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Introduction 


T he 37th Annual Sanibel Symposium, orga¬ 
nized by the faculty, students, and staff of 
the Quantum Theory Project of the University of 
Florida, was held on March 1-7,1997. The meeting 
was again held at the Ponce de Leon Conference 
Center in St. Augustine, Florida. More than 300 
participants gathered for seven days of lectures 
and informal discussions. 

The symposium followed the established format 
with plenary and poster sessions. This year, the 
schedule was shortened somewhat with a compact 
seven-day integrated program of quantum biol¬ 
ogy, quantum chemistry, and condensed matter 
physics. The topics of the sessions covered by 
these proceedings include Density Functional The¬ 
ory (DFT) and Applications, DFT for Intermolecu- 
lar Forces, Clusters and Nanostructures, Theory of 
Floppy Molecules, Exotic Anions, Theoretical Inor¬ 
ganic Chemistry, Foundations of Quantum Me¬ 
chanics, Non-Linear Optics, Quantum-Classical 
Molecular Dynamics, and Industrial Application. 

The articles have been subjected to the ordinary 
refereeing procedures of the International Journal of 
Quantum Chemistry . The articles presented in the 
sessions on quantum biology and associated poster 
sessions are published in a separate issue of the 
International Journal of Quantum Chemistry. 

The organizers acknowledge the following 
sponsors for their support of the 1997 Sanibel Sym¬ 
posium: 

■ Army Research Office and U. S. Army Edge- 
wood RD&E Center through Grant 
#DAAG55-97-l-0020: 


International Journal of Quantum Chemistry, Vol. 65, 377 (1997) 
© 1997 John Wiley & Sons, Inc. 


"The views, opinions, and/or findings 
contained in this report are those of the au¬ 
thors) and should not be construed as an 
official Department of the Army position, 
policy, or decision, unless so designated by 
other documentation." 

■ The Office of Naval Research through Grant 
#N00014-97-l-0320: 

"This work relates to Department of the 
Navy Grant #N00014-97-l-0320 issued by 
the Office of Naval Research. The United 
States Government has the royalty-free 
license throughout the world in all copy¬ 
rightable material contained herein." 

■ IBM Corporation. 

■ HyperCube, Inc. 

■ Silicon Graphics. 

■ The University of Florida. 

Very special thanks go to the staff of the Quan¬ 
tum Theory Project of the University of Florida for 
handling the numerous administrative, clerical, 
and practical details. The organizers are proud to 
recognize the contributions of Mrs. Judy Parker, 
Ms. Sharon Stellato, Ms. Sandra Weakland, and 
Mr. Greg Pearl. All the graduate students of the 
Quantum Theory Project, who served as "gofers," 
are gratefully recognized for their contributions to 
the 1997 Sanibel Symposium. 

N. Y. Ohrn 
J. R. Sabin 
M. C. Zerner 
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Foundations of Quantum Mechanics: 
Building on von Neumann’s Heritage 


GERARD G. EMCH 
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ABSTRACT: From 1927 to 1943, the mathematical and conceptual foundations of 
quantum mechanics, and their prolongations into the formulation of statistical mechanics, 
were either the direct subject of or one of the primary motivations for several publications 
by von Neumann. We contrast the reception received by the work done during the first 
5 years (1927-1932) and that received by the monumental mathematical constructs 
wrought in the second period (1934-1943); we describe how the latter were to lead to the 
elaboration of a formalism generalizing the fundamental structures of quantum mechanics 
so that they can encompass and deal consistently with situations typical of the quantum 
many-body problems of statistical mechanics. © 1997 John Wiley & Sons, Inc. Int J Quant 
Chem 65: 379-387, 1997 


Introduction 

V on Neumann died almost exactly 40 years 
ago (February 8, 1957). A mathematician had 
passed away, who—like Poincare and Gauss be¬ 
fore him—had contributed to such a vast array of 
mathematical endeavors that it is a trite question 
to ask whether he was of a pure or applied persua¬ 
sion. Every participant in this symposium is aware 
of the impact von Neumann has left upon the 
methods that we rely on in our everyday scientific 
activities; to wit: his 1932 book is still recognized 
today as one of the landmarks one can turn to for a 
concise formulation of the foundations of quantum 
mechanics; and the computer, which he created, is 
an evermore powerful investigative tool that is 
pervasive in the present company. It seems there- 

International Journal of Quantum Chemistry, Vol. 65, 379-387 (1997) 
© 1997 John Wiley & Sons, Inc. 


fore proper to pay him homage here, not only for 
his having enabled us to cultivate our field so 
intensively, but chiefly for having opened the 
fences of the quantum pastures toward the greater 
outdoors, as yet not fully surveyed. 


1. The Familiar Landscape 
( 1927 - 1932 ) 

The first article [1] witnessing the interest von 
Neumann was to show in the foundations of quan¬ 
tum mechanics originated in a course given by 
Fiilbert in Gottingen during the winter semester 
1926/27; the notes were prepared by Nordheim, 
and we are told that von Neumann contributed 
"important parts" of the mathematical develop¬ 
ments. One of the overt aims was to digest, mathe¬ 
matically and conceptually, the ideas of Jordan 
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and of Dirac. The stimulating and very direct in¬ 
fluence of the pioneering work of Heisenberg, Born, 
and Jordan is explicitly acknowledged from the 
very beginning of the article, while the last section 
is devoted to the Schrodinger equation. The syn¬ 
thesis is clearly on the making, fusing into the 
theory of operators acting on a Hilbert space the two 
forms of quantum mechanics which we now rou¬ 
tinely recognize to be equivalent—as Schrodinger 
himself had pointed out only shortly before: the 
algebraic formulation, then called the "matrix-mech¬ 
anics," of quantum theory (Heisenberg), and the 
analytic formulation , via partial-differential equa¬ 
tions, the so-called "wave-mechanics" (Schrodin- 
ger). 

The same year, von Neumann published three 
other articles on the foundations of quantum me¬ 
chanics, its probabilistic background, and the ther¬ 
modynamics of quantum mechanical ensembles 
[2-4]. His interest was also directed toward more 
specific problems, such as the quantum spectral 
properties of the electron [5-7]; a discussion of the 
extension of ergodic theory and the H-theorem to 
the "new mechanics" [8]; and several other articles 
as well, with ostensibly mathematical titles (e.g., 
[9]), witnessing to the fact that von Neumann was 
profoundly involved in trying to attack the mathe¬ 
matical (or conceptual) foundations of the theory 
from as many angles as possible. 

1931 is the year of the theorem most universally 
attached to von Neumann's name in the physics 
community; it is the statement and the proof of the 
uniqueness of the Schrodinger representation [10]. 
Specifically, this theorem establishes the remark¬ 
able fact that if W 2n + 1 denotes the group gener¬ 
ated by exponentiating the position and momen¬ 
tum to obtain the Weyl form of the CCR (canonical 
commutation relations) then W 2n + 1 has exactly 
one irreducible representation, the one found by 
Schrodinger. In the spirit of the early 1930s, this 
result is worth comparing with the richness of the 
inequivalent irreducible representations that were 
known to exist for most of the other groups 
used in physics; in particular, spectroscopists knew 
how to exploit the fact that the rotation group in 
three dimensions has as many irreducible repre¬ 
sentations as there are complex, finite-dimensional 
vector spaces. By contrast, von Neumann's 
uniqueness theorem is why we can teach quantum 
mechanics and start by telling our students that 
the momentum and position operators are can¬ 
onically realized as differentiation (Pf\ r )(x) = 


~i(d x ^)(x), and as multiplication (QpP)(x) = 

x k V( x), with k — 1,2 ,,n, where W are elements 
of the Hilbert space J*f 2 (R n , C), i.e., are complex¬ 
valued functions, defined on the configuration 
space R”, and assumed to be square-integrable 
with respect to the Lebesgue measure. 

Finally, the culminating achievement arrived 
with the epoch-marking Mathematische Grundlagen 
der Quantenmechanik [11]. I often wondered why 
this jewel saw its first English translation only in 
1955; was this simply due to the towering preemi¬ 
nence of Dirac's book, or could it be a reflection 
from the forced emigration to the United States of 
so many of the scientists who were conversant 
with the subject matter in the original language? In 
the meantime, translation or not, von Neumann's 
book certainly became for a large part of the 
physics community the standard reference on the 
conceptual, as well as the mathematical, founda¬ 
tions of quantum mechanics (QM). In particular, 
its introduction of the density matrix, rather than 
the wave function, as the fundamental concept for 
a quantum mechanical state is quite remarkable; 
the discussion of the "measuring process" in¬ 
scribed there is both compelling and so profoundly 
unsatisfactory that it created a semantic contro¬ 
versy that has not yet really abated; the task of 
elucidating the problems involved in the elusive 
interpretation of the quantum measuring process 
has benefited, in particular, from the sustained 
attention of Wigner, who was in a position to add 
to his own penetrating reflections [12], the author¬ 
ity of his lifelong friendship with von Neumann. 

As far as the treatment of the mechanics of 
quantum systems with finitely many degrees of free¬ 
dom is concerned, the formulation of scattering 
theory seems to be the only postwar conceptual 
and mathematical development not anticipated in 
von Neumann's book. This lacuna has now been 
tended to in [13] (see also its predecessor [14]), and 
it can be fairly said that this did not require any 
drastic reconsiderations of the conceptual mold in 
which von Neumann's book had cast QM. 

Indeed, after the publication of von Neumann's 
book, most physicists had apparently every reason 
to "know" that the observables of QM are the 
self-adjoint operators acting in a fixed Hilbert 
space; that the states are the density matrices asso¬ 
ciated to this Hilbert space; and that this represen¬ 
tation is uniquely determined by the configuration 
space of the system to be considered. 

Well, almost every reason, as we shall now see. 
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2. Two Brief Excursions (1934-1936) 

It is commonly acknowledged that somewhere 
around the turn of the 1930s, something went 
tangy in quantum physics. A perception might 
have appeared according to which the success of 
quantum spectroscopy in atomic and molecular 
physics was not going to carry smoothly over to 
quantum electrodynamics. The rather difficult up¬ 
bringing of the still impetuous quantum field the¬ 
ory might indeed have suggested that the princi¬ 
ples and/or the formalism were not quite right. 
For instance, in an article that we shall briefly 
review later in this section [15], we read of an 
alternate "suitable starting point for a generaliza¬ 
tion of the present quantum mechanical theory. 
The need of such a generalization arises from the 
(probably) fundamental difficulties resulting when 
one attempts to apply quantum mechanics to 
questions in relativistic and nuclear phenomena." 

While the above were powerful physical moti¬ 
vations indeed, it seems that it was also for rea¬ 
sons of a different order that von Neumann be¬ 
came dissatisfied with the foundations of QM. In 
one of his other modes of operation, he had min¬ 
gled mightily into the foundations of pure mathe¬ 
matics; even to us, it is clear that the foundations 
of quantum physics were not on a footing accept¬ 
able to the likes of Hilbert, nor were they on par 
with Kolmogorov's axiomatization of probability 
theory. Note incidentally that von Neumann had 
himself contributed also to the allied field of math¬ 
ematics called measure theory. Therefore, von 
Neumann's reasons to probe further into the foun¬ 
dations of quantum physics participated signifi¬ 
cantly from longings ambient in the mathematical 
community. 

Yet, one does not need to indulge in much 
guessing or tentative reconstructions of his deepest 
motivations: the published record shows that von 
Neumann decided to concentrate his quest, at first, 
on analyzing the necessity of Hilbert spaces in the 
formulation of quantum theory. 

The first excursion in this direction is [15], from 
which the above quote was taken. It is the prede¬ 
cessor of the modern algebraic approach. The pur¬ 
pose was to focus on the set of observables and 
thus to try to abstract the structure of the set 
of all (bounded) self-adjoint operators, dispensing 
with the reference to an underlying Hilbert space. 


Since such operators do not necessarily commute, 
their products are not necessarily self-adjoint; and 
therefore do not necessarily belong to 33 sa ) the 
substitute advanced in this article is to consider 
instead of the ordinary product, AB the symmetric 
product 

A » B = |( AB + BA) = f[( A + B) 2 - A 2 - B 2 ]. 

While this product is clearly commutative, it is not 
associative; thus, points to the necessity of 
studying a new algebraic structure, which is now 
referred to as a Jordan algebra . The article pursued 
this new theory far enough to be able to classify all 
the "simple" finite-dimensional objects in this cate¬ 
gory. All of them, except one, are, in fact, the 
self-adjoint part of some matrix algebra, over the 
reals, the complex numbers, or the quaternions. 
These are called "special" Jordan algebras, to dis¬ 
tinguish them from the "exceptional" one they 
had obtained, involving Caley numbers; the latter, 
however, has not yet found useful physical appli¬ 
cations. Its interest is mostly to point to the exis¬ 
tence of Jordan algebras that are not special. For 
many years, to determine whether a Jordan alge¬ 
bra is "special" (i.e., is the self-adjoint part of an 
associative algebra) was an open problem for infi¬ 
nite—dimensional Jordan algebras, even when 
these algebras are equipped with a norm; the an¬ 
swer [16] somewhat surprisingly was reduced to 
the question of whether a consistent orientability 
condition in the state space of these algebras is 
satisfied (for an elementary description of what is 
involved, see, for instance, pp. 378-381 in [17]). 

Von Neumann's second excursion [18] in the 
direction of probing further the role of the Hilbert 
space formulation that he had described in [11] 
was to abstract the structure of the family L of all 
projectors on a Hilbert space or, equivalently, the 
structure of the set of its closed subspaces. Under 
the usual inclusion, one can define the intersection 
of two projectors as the largest projector contained 
in both of them and the union of two such projec¬ 
tors as the smallest projector containing them both. 
There is a smallest element, the zero projector, and 
a largest element, the identity map. Finally, a com¬ 
plementation is defined, attributing to every pro¬ 
jector P the projector I — P, its orthogonal comple¬ 
ment. The result of this description is that L is a 
complemented lattice, in much the same way as 
the collection of all measurable subsets (or their 
indicator functions) of a probability space is a 
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complemented lattice. There is, however, a major 
difference: the latter is Boolean, i.e., the intersec¬ 
tion and union enjoy the familiar distributive rela¬ 
tions. This property is not true for L, and this 
reflects an essential feature of QM, the noncommu¬ 
tativity of the operators acting on the Hilbert space. 
The question asked in [18] was how to characterize 
L intrinsically or, phrasing this in more physical 
terms, how to define abstractly the quantum 
equivalent of a classical probability space. Birkhoff 
and von Neumann only found a partial answer: an 
abstract property—called modularity—seemed to 
be a good substitute to replace the classical notion 
of distributivity; however, Birkhoff and von Neu¬ 
mann pointed out that this property is not suffi¬ 
cient to capture the structure of the set of all the 
projectors, since it is satisfied only when the Hilbert 
space if finite-dimensional , clearly not an acceptable 
restriction for the purpose of QM. A weakened 
version of modularity was found a quarter of cen¬ 
tury later [19]. In physical terms (i.e., here in purely 
lattice-theoretical terms), the weak modularity 
condition requires that if P implies Q (i.e., PcQ, 
or, equivalently, in the language of projections 
operators acting on a Hilbert space, PQ = P), then 
P and Q are compatible (i.e., the sublattice of L 
generated by P, Q, P 1 , Q 1 is Boolean or, equiva¬ 
lently, in the operator language, PQ = QP). To 
obtain that this weak modularity characterizes the 
lattice of all projectors in a Hilbert space, one must 
add an atomicity condition exploiting the presence 
of minimal projections (also called "atoms" or 
"points"), a condition the omission of which led 
later to interesting structures von Neumann was to 
explore; see next section. 

The lattice theoretical approach to QM captures 
enough of the fundamental structure of the theory 
that it is sufficient to define symmetries as auto¬ 
morphisms of L preserving its structure, in order 
to obtain—at long last—an elementary proof for 
the theorem by Wigner [20] according to which, in 
a Hilbert space formalism where all bounded 
self-adjoint operators correspond to physical ob¬ 
servables, symmetries are implemented either by 
unitary or by antiunitary operators; for the proof, 
see [21], and for a critical assessment, [22] and 
references therein. 

By 1936 therefore, one could hope that some 
useful information would be retrieved from the 
study of the algebra of observables or from the 
lattice of projections that generate this algebra; the 
main problem was to find the topological condi¬ 
tions allowing one to relax the conditions enough 


to encompass infinite-dimensional situations. This 
is a story for the next section. 


3. The Great Outdoors 
(1936-Onwards) 

Many problems that the traditional QM does 
not seem able to cope with in a consistent manner 
are related to the appearance of infinitely many 
(real or virtual) degrees of freedom. In the latter 
case, and in contrast with von Neumann's unique¬ 
ness theorem (see Section 2), we know [23], that 
the Fock space representation is only one among 
uncountably many inequivalent representations of 
the canonical commutation (or anticommutation) 
relations. We further know that the Fock represen¬ 
tation is attached to the vacuum state and that it 
cannot accommodate the finite density of particles 
needed for many applications to statistical me¬ 
chanics. A manifestation of this phenomenon was 
pointed out in connection with the gauge-symme¬ 
try breaking occurring in the BCS model for super¬ 
conductivity [24]. 

The natural frame to collect these phenomena 
was prepared by Murray and von Neumann in a 
series of articles [25]. The mathematical strategy of 
these articles can be explained physically as a way 
to see QM as a generalization of classical mechan¬ 
ics in the following manner. 

First, let us agree to view (a large class of) the 
observables in classical mechanics as (essentially) 
bounded functions on the classical phase space, 
mathematically a measure space (fi, p ); for in¬ 
stance, p could be the Lebesgue measure on the 
space R 2 ”. These functions are the real elements of 
an algebra sf = Lq(O) that comes naturally 
equipped with an involution (here, the complex 
conjugation) and a norm satisfying || A*A\\ — \\A\\ 2 . 
In modern terms, this is to say that sf is a C*-alge¬ 
bra. Moreover, srf is the dual of a Banach space, 
namely, the space LHfi, p ) of integrable functions 
on (fi, p). The latter property makes stf what is 
called a W*~algebra. Note that the states on these 
classical systems are given by the positive ele¬ 
ments p of L}(Ct, pi) that are normalized by requir¬ 
ing that their total weight is 1, so that p dpi is a 
probability measure. Hence, the expectation value 
of the observable A in the state p is given by 
< p; A) = fA(co)p(co) dpi. 

We can now take advantage of this perspective 
on classical mechanics to provide a natural link to 
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the formalism of quantum mechanics. Let us in¬ 
deed represent the algebra sf as the algebra of 
operators on L 2 (fi, jjl) operating by multiplication, 
i.e., for every vector ^ e L 2 (0, fi), the vector A 'F 
is defined by (A^XctO = AfcoHXco). It is easy to 
see that, in this representation, 

where,*/' = {B e<^|AB = BA VA (3.1) 

This relation implies that 

where sf" = {&'}’. (3.2) 

(In general, j/' is called the commutant of stf, and 
j/", the bicommutant of j/.) Hence, stf is a von 
Neuman algebra , i.e., an algebra of operators such 
that it is self-adjoint (i.e., A e/ implies that its 
adjoint A* also belongs to ssf) and satisfies (3.2) 
(i.e., it coincides with its bicommutant). 

Von Neumann algebras are the mathematical 
objects proposed for study in [25] under the now 
obsolete name of "rings of operators''; the current 
name seems to have come into general practice 
after the publication of [26]. 

It is of utmost importance in our argument that 
we include (3.2) but not (3.1) in the definition of a 
von Neumann algebra; indeed, (3.1) requires in 
addition that the algebra be Abelian [j/cj/'], 
which we do not want in general, since we want 
to include in this structure algebras that are not 
Abelian, such as the algebra of all bounded opera¬ 
tors on a Hilbert space. It is significant for our 
purpose that classical mechanics and ordinary QM 
appear in this language as two extreme cases: to 
classical systems are associated algebras which co¬ 
incide with their commutant [srf=srf f ), whereas 
the algebras that are associated with ordinary 
quantum systems are irreducible, i.e., their com¬ 
mutant is trivial [sf f = Cl]. Von Neumann and 
Murray concentrated on an intermediary case ob¬ 
tained when the center of «*/ is trivial [$/ r\s&' = 
Cl]; these von Neumann algebras are called fac¬ 
tors. Mathematically, they turn out to be the build¬ 
ing blocks from which one can construct all von 
Neumann algebras. Physically, they occur under 
very specific circumstances, e.g., in connection with 
the description of pure thermodynamical phases as 
extremal KMS states (see below). 

Before we arrive there, we should return briefly 
to the question of the justification for the role 
Hilbert spaces in quantum theory; or, stated differ¬ 
ently, whether one can avoid putting Hilbert spaces 
explicitly in the very axioms of the theory; or 


putting it still differently, whether one can build in 
the theory enough flexibility so that different phys¬ 
ical situations may be described by different 
Hilbert space representations. 

The answer to this question is predicated on the 
prior construction of a C*-algebra (see the begin¬ 
ning of this section), the self-adjoint elements of 
which are the observables of the system consid¬ 
ered. For instance, in the study of a Heisenberg 
magnet, i.e., of spins distributed on a lattice A, one 
can associate to every part A N c A containing N 
sites (with N finite) the C*-algebra of all 
2 N X 2 n matrices. The set-theoretical inclusion in 
the family of all finite subsets of A induces a 
natural inclusion relation in the family of all corre¬ 
sponding C*-algebras. It is then possible to con¬ 
struct (by a method called the C*-inductive limit) 
a C *-algebra sf and mutually consistent injective 
maps from each to sf, in such a manner that 
the set theoretical union of their images in sf is 
dense in stf. sf is then identified as the C*-algebra 
of the infinite lattice of spins. Note that no mention 
of a Hilbert space has been made in connection 
with the global algebra j/. 

The general GNS construction associates then to 
every state on any C*-algebra sf a specific Hilbert 
space representation. As this construction seems to 
capture much of the physics, let us sketch it. First, 
a state is viewed here as a linear map: 

4>\ A (4>; A) cC with 

<4>; A*) = <0; A)*; <0; A*A> >0 V A 

and sup Ae ^ /ljAll < 1 l<(f>; A) |= 1. (3.3) 

Note that, while the above properties are not 
mathematically independent, they all have signifi¬ 
cance for the physical interpretation of the theory; 
moreover, in the usual situation where has a 
unit I, the last of the above conditions reads sim¬ 
ply (<f>; I) = 1. 

The state 4> °n stf being given, a sesquilinear 
form on is naturally defined by 

(A, B) = <</>; A*B>. (3.4) 

A Hilbert space 

(3.5) 

is then obtained as the completion of the pre-Hil¬ 
bert space Ov/Ker^), where the kernel Ker^, = {A 
<</>; A*A> = 0} is a closed left-ideal of ,*/. We 
identify the vector <F A as the equivalence 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


383 







EMCH 


class of A ej/ modulo Ker^. As these vectors 
form a dense subspace of we can define for 
every A a bounded operator A) by 

77^(A)O b = 0 AB V B gj/. (3.6) 
Note that the map 

77 ^: A Gj/ »-> 7r^(A) G^(^p (3.7) 

is a Hilbert space representation of sf\ 

ir/cjAj + c 2 A 2 ) = c 1 77^,(A 1 ) + c 2 TT tj> (A 2 ) 

TT 4 ,{A l A 2 ) = ^(Ai)77^,(A 2 ) 

tt/A*) = ir+iAf, (3.8) 

for all c v c 2 ^ C and all A v A 2 e j/. Note also 
that 

^ = {^(A)<I>,|A ej/} (3.9) 

and 

it/A)®,) = (<!>; A) V A ey, (3.10) 

i.e., in this representation, the vector ^ is cyclic 
and represents the state 4>. These two properties 
determine the representation up to a unitary 
equivalence. 

For two familiar examples, let </> be the vacuum 
state on the algebra of the CCR; if the sys¬ 
tem considered has finitely many degrees of free¬ 
dom, the representation Tr cf) is the Schrodinger rep¬ 
resentation; and when the system considered 
has infinitely many degrees of freedom, the repre¬ 
sentation 77 ^ is the Fock space representation. The 
familiar fact that these representations are irre¬ 
ducible is a reflection on the fact that the vacuum 
state is a pure state (i.e., cannot be decomposed 
into a convex mixture of other states). Since W*-al¬ 
gebras are C*-algebras, the GNS applies to them 
as well. 

As indicated above, for the special case of the 
Heisenberg ferromagnet, it is possible to construct 
abstractly the C*-algebra corresponding to an infi¬ 
nite system. Similar procedures can be followed in 
much more general circumstances; for instance, an 
axiomatization of quantum field theory was pre¬ 
sented along such line in an article that injected a 
new vigor in the algebraic approach to quantum 
theory [27]. The general idea is as follows. Suppose 
that a C*-algebra ssf has been obtained that de¬ 
scribes the infinite system to be considered. Sup¬ 


pose further that a state </> as been selected, e.g., 
the thermodynamical limit of the canonical equi¬ 
librium states of the corresponding local systems; 
we shall discuss below a characterization of the 
states obtained in this manner. One then performs 
the GNS construction associated to this state to 
obtain the representation 77 ^ that is thus specifi¬ 
cally based on the state 0. It is now a matter of 
convenience to consider the von Neumann algebra 
acting on the Hilbert space 

The main advantage of considering the von 
Neumann algebra rather than the C* -alge¬ 

bra 77 ^(j/) is that the topology of von Neumann 
algebras is better adapted to apply some of the 
methods of functional analysis. For instance, a von 
Neumann algebra Jif is always generated by its 
projectors (whereas a C*-algebra might have no 
non-zero projector among its elements). A spectral 
theory can be formulated then, entirely at the level 
of the von Neumann algebra. In fact, the lattice 
L(/f) of the projectors of a von Neumann algebra 
essentially contains all the intrinsic information 
provided by itself. In particular, when is a 
factor, a dimension function d can be defined on 
L(/T). Up to a multiplicative constant, the range R 
of d can be of either of the following five types: 

type I n : R = {1,2,..., ft} 

type 4: R = { 1 , 2 ,..., 00 } 

type ll x \ R = [ 0,1] 

type 14: R = [0, 00 ] 

type III: R = {0,°°}. (3.11) 

From a mathematical point of view, we empha¬ 
size that the range of the dimension function for 
the factors of type I is discrete: the lattice of their 
projectors is atomic. By contrast, for the factors of 
type II V the dimension function takes all values in 
the finite closed interval [0,1]; these factors are the 
candidates for the "geometries without points." 
The search for such geometries seems to have been 
one of the motivations von Neumann had, not 
only for the series of articles under discussion, but 
also for the article with Birkhoff discussed in the 
previous section, where they had isolated the 
modularity condition. 

For the purposes of physics, factors of type II 
and III nowadays occur routinely and typically in 
the study of systems with infinitely many degrees 
of freedom, be they obtained in relativisitic quan¬ 
tum field theory or in the statistical mechanics of 
the thermodynamical limit. We will concentrate 
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our attention on the latter in the sequel. To prepare 
for this, the case of canonical equilibrium states for 
finite systems is still worth considering in some 
detail. Specifically, we want to illustrate that, as 
the GNS representation depends on the state (f >, 
one should expect that it expresses also some of 
the special circumstances of the physical situation 
described by the state 4>. 

The following explicit particular case is typical 
in the structures that it brings to light. Consider, 
indeed, the representation obtained by the GNS 
construction for the canonical equilibrium state of 
a 1 -dimensional harmonic oscillator; it acts on the 
Hilbert space known as the Liouville space of the 
system ^ = <g>^, where^ denotes the Hilbert 

space of the Schrodinger representation. This is the 
space of all Hilbert-Schmidt operators on %f 0 , i.e., 
the Hilbert space spanned by the density matrices 
on With {ifj k | k = 0, 1 , 2 ,...} denoting an or¬ 
thonormal basis of eigenvectors of the Hamilto¬ 
nian H of our oscillator, {^ kl = i[t k <S> ip l | k, l = 
0 , 1 , 2 ,...} gives an orthonormal basis in We 
have then 

77^( A) = A ® I, 

i.e., 77 ^( A)(\f) k <S> i// ; ) = (Aip k ) < 8 > i/fj (3.12) 

and 

<Lj = -/p with p = e~P H /Tre~P u , 

EvW^kk- (3-13) 

k 

This representation is evidently not irreducible, a 
reflection of the fact that the canonical equilibrium 
state p is a mixture. Moreover, it is instructive to 
notice that 

TT^y = {I< 8 >A|Aej/} and, thus, 

n 77 - 4 ,(j/)' = C, (3.14) 

i.e., tt^Q/) is a factor which is antiunitarily equiva¬ 
lent to its commutant, through the antiunitary 
operator / defined by 

/(Eca) = Hc* kl % k . (3.15) 

Notice, furthermore, that the operators U^it) de¬ 
fined on ^ 4 , by 

^( 0 ^ = 0 ) V(t)AV(~t) 

where V(t) = e~ lHt and A (3.16) 


satisfy the relations 

LZ^( ^ )<t> = and 

!i/f)7r(A)Lf/-f) = 7 r(V(t)AV(~t)) 

VA^stf. (3.17) 

These operators are realized explicitly by 

U^t) = V(t) ® V(-t) (3.18) 

so that 

fU^iOj = U^i-t) and = H 0 I - I ® H , 

(3.19) 

where the generator of U$, has therefore a 
spectrum that is symmetric around zero and, in 
particular is not bounded below. As is called 
the Liouville space of the system, H 4 , is referred to 
as the Liouville operator. 

So far, we are still in somewhat familiar terri¬ 
tory. Yet, the remarkable feature about the repre¬ 
sentation that we just discussed is that much of the 
structure to which it has pointed persists in the 
thermodynamical limit where the usual formalism 
of QM loses its power since the Fock space does 
not accommodate states with a finite particle den¬ 
sity and factors do appear that are not of type I. 
This was first noticed for the Bose gas [28]. 

This structure is now known indeed to be a 
general consequence [29] (for a textbook account, 
see [17]) of a characterization of the canonical 
equilibrium state that carries over to the thermo¬ 
dynamical limit. This is the abstract formulation 
[30] (the so-called KMS condition [31]) of an ana- 
lyticity property of thermal Green functions. 

Given a time-evolution a (i.e., a continuous, 
one-parameter group of automorphisms of the al¬ 
gebra of observables for the system considered) 
and given also a temperature = 1/kT , the KMS 
condition on the state <fi expresses that the correla¬ 
tion functions satisfy (in essence, although some 
caution on domains should be exercised) 

(</>; a(t)[ A]B) = ((f); Ba(t + i/3)[ A]) 

V A, B e/, (3.20) 

The states that satisfy this condition are called 
KMS states. 

For finite systems, it is immediate to see that 
there is only one such state and that it is given by 
the usual canonical equilibrium density matrix. 
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For infinite systems, however, this uniqueness dis¬ 
appears, but the set <9^ of the states that satisfy 
the KMS condition (for the evolution a and the 
temperature /3) form a convex set that is a sim¬ 
plex, i.e., a convex set the elements of which can 
be decomposed in a unique manner as a convex 
combination of their extreme points (as every 
chemist who has dealt with mixtures of three reac¬ 
tants knows, an equilateral triangle is a simplex; a 
circle is not). This additional freedom is exactly 
what was looked for when one wished the thermo¬ 
dynamical limit would allow one to describe phase 
transitions. The extremal KMS states are to be 
interpreted as pure thermodynamical phases, and 
it seems important for the interpretation that 
canonical equilibrium states be uniquely decom¬ 
posable in their extremal components, the pure 
thermodynamical phases. This very special prop¬ 
erty of quantum canonical equilibrium states is a 
classical feature of quantum statistical mechanics 
that is made manifest only in the thermodynami¬ 
cal limit. The GNS representation for a KMS state 
4>, which does not describe a pure thermodynami¬ 
cal phase, is not a factor; the spectrum of its 
nontrivial center then labels the pure thermody¬ 
namical phases of which is a mixture. 

The canonical decomposition of KMS states in 
extremal states satisfying the KMS condition has 
been a useful tool to discuss the phenomena of 
spontaneous symmetry breaking, see, e.g., [32]. For 
nonequilibrium situations as well, the flexibility of 
the algebraic approach has allowed some progress 
on a variety of fronts, such as 

1 . The construction of a model in which one can 
implement the four requirements Bohr 
wanted to impose on a quantum measuring 
process, namely: (i) the collapse of the wave 
packet, e.g., the collapse of the state of the 
system from a coherent superposition of 
eigenstates (relative the observable mea¬ 
sured) to an incoherent mixture of these 
eigenstates; (ii) the transfer of information 
from the system to the apparatus; (iii a) the 
measuring apparatus is macroscopic (a rather 
vague notion that can be sharpened in a 
formalism where the passage to the thermo¬ 
dynamical limit is allowed); (iii b) the mea¬ 
suring process is stable under small (i.e., lo¬ 
cal) perturbations of the initial state of the 
measuring apparatus; (iv) the process is gov¬ 
erned by the laws of Quantum Mechanics 
(which as applied to infinite systems, means 


that the dynamics is obtained as the thermo¬ 
dynamical limit of a Hamiltonian evolution 
describing completely the coupling between 
the system and the apparatus) [33]. 

2 . The extension to the quantum realm of the 
concept of Kolmogorov flows, inherited from 
classical probabilistic ergodic theory (e.g.. 
Brownian motion) [34]; this involves discus¬ 
sions of the physical manifestations of such 
ergodic behavior, including various notions 
of mixing (i.e., various forms for the decay of 
correlation functions), as well as finer notions 
such as the Lebesgue spectrum, the nonvan¬ 
ishing of dynamical entropy, and quantum 
diffusion-type equations. 

3. The exploration of quantum equivalents to 
the classical notions of Anosov systems, 
chaos, and/or Liapunov exponents; the latter 
project is still involved in some controversy 
in which the algebraic approach helped the 
formulation, and sorting out, of various op¬ 
tions [35]. 

Quantum field theory, elementary particle the¬ 
ory, braid groups, and knots, and noncommutative 
differential geometry are other fields of physics 
and of mathematics where the algebraic and ana¬ 
lytic methods pioneered by von Neumann have 
opened new conceptual vistas; these, however, are 
not within the scope of the present lecture. 
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ABSTRACT: In probability theory an event is characterized by a set of outcomes. The 
probability of an event is a prediction of the relative frequency of the outcomes that lie in 
the set, when the experiment is repeated independently. In quantum mechanics an event 
is identified with a closed subspace of Hilbert space. Do the probabilities of quantum 
events predict relative frequencies in the same way? The famous analysis of Bell shows 
that this cannot happen in general, for all quantum events. Nevertheless the probabilities 
must predict something, and if not relative frequencies then what? There are several 
answers to the question of how to interpret quantum probabilities as relative frequencies, 
but they all involve restrictions on the events that are considered. This article surveys 
three such interpretations, which correspond to three possible choices: measured events, 
macroscopic events, and position events. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 
65: 389-398, 1997 


Introduction 

Does quantrum probability predict frequency? 

If so, frequency of what? We put these questions in 
a mathematical framework and present possible 
answers. The purpose is to clarify the physical 
content of quantum mechanics. This article is ex¬ 
pository; the goal is to explain the issues in as 
elementary a way as possible. 

In ordinary probability theory there is a fixed 
set of outcomes. An event is characterized by a set 
of outcomes. The probability of an event is a num- 
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ber between zero and one. The ordinary interpre¬ 
tation of probability in physical science is the 
frequency interpretation. Suppose there are many 
independent repetitions of the same situation. Each 
time there is an actual physical outcome. The rela¬ 
tive frequency of the repetitions for which the 
outcome makes the event happen should be close 
to the probability prediction. If this frequency in¬ 
terpretation of probability is to be viable, then the 
probability assignment must satisfy basic princi¬ 
ples of probability theory, such as addivity. 

In quantum mechanics an event is identified 
with a closed subspace of Hilbert space. When the 
quantum state is specified, each quantum event 
has a probability. This again is a number between 
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zero and one. However, conjunctions of quantum 
events are defined only in special cases, when the 
events are compatible. Can the probabilities, for all 
quantum events simultaneously, be regarded as 
predictions of relative frequency? Can the quan¬ 
tum events be reinterpreted as determining sets of 
outcomes? The answer is no. The probabilities of 
quantum events violate additivity. 

The famous analysis of Bell gave a deeper per¬ 
spective on this fact. If the probability assignment 
depends on which set of compatible quantum 
events is considered, then the ordinary probability 
laws can be satisfied. The physical apparatus that 
defines the particular quantum events influences 
the probabilities. However, Bell found a new con¬ 
straint. Assume that the system satisfies two con¬ 
ditions: active locality and passive locality. Then 
again the quantum probability assignment con¬ 
flicts with probability laws. Thus if quantum me¬ 
chanical probabilities are to predict frequencies in 
the usual way, then either active locality or pas¬ 
sive locality must fail. 

In any case, if quantum probabilities are to 
predict the frequencies of certain events, then these 
events must be defined in terms of sets of out¬ 
comes. There are several answers to the question 
of how to pick the set of outcomes and the corre¬ 
sponding events, corresponding to different inter¬ 
pretations of quantum mechanics. We survey three 
such interpretations. 

According to the Copenhagen interpretation, the 
set of possible outcomes depends on an external 
choice of what to measure. Once this choice has 
been made, then there is only a small collection of 
compatible events for which the probabilities give 
frequency predictions. 

Another interpretation attempts to describe a 
single set of outcomes consisting of macroscopic 
histories. The laws of probability apply to macro¬ 
scopic events for which a decoherence condition is 
satisfied. 

Yet another class of interpretations describes the 
world on the atomic level, and the set of possible 
outcomes consists of particle trajectories. The only 
quantum events that persist in this picture are 
position events. 

So the answer is that quantum probability pre¬ 
dicts frequencies, but only of certain events. In 
each interpretation it predicts the frequencies of 
events characterized by sets of outcomes. The 
problem is that there is no agreement over what 
the outcomes should be. 


The entire discussion here is in the context of 
the elementary quantum mechanics of particles. Of 
course, current physical theory regards quantum 
fields as the fundamental dynamical entities. In 
particular, quarks and leptons are described by 
Fermi fields and interactions between them by 
gauge fields. The direct physical significance of 
these fields is subtle, and so the implication for the 
interpretation of quantum mechanics is yet ob¬ 
scure. 


Probability 

Our first task is to review standard probability 
theory in order to contrast it with quantum me¬ 
chanics. In probability there is a set fl of all 
outcomes of an experiment. An event A is a condi¬ 
tion that determines a subset of IF to which a 
probability may be assigned. The logical opera¬ 
tions and, or, not on events correspond to set 
theory operations intersection, union, complement 
on subsets. The probability P[ A] of an event A is a 
number between 0 and 1. The probability assign¬ 
ment must satisfy the usual laws of probability. 
For instance, a typical condition is additivity in the 
form 

P[ A] = P[ A and B] + P[ A and not B]. 

The frequency interpretation of probability is as a 
prediction of relative frequency. Suppose the situa¬ 
tion is repeated many times under independent 
conditions. Then the relative frequency of the repe¬ 
titions for which a given event happens should be 
close to the probability of the event. 

Here is an example of a probability assignment. 
Consider the set II consisting of the six triples 110, 
101, 100, Oil, 010, and 001. Let A be the event that 
the first element is a 1, let B be the event that the 
second element is 1, and let C be the event that the 
third element is 1. Thus B, for example, corre¬ 
sponds to the three-element subset 110, 011, 010. 
Assign the probability uniformly over the six 
points. Then the events A, B, and C and their 
negations each have probability However, note 
that events such as the compound event (A and 
not B) have probability f. (The corresponding 
subset is 101, 100, and these are two out of six.) 
Thus there is a conspiracy that makes A and B 
tend not to happen together. However, there is a 
limit to the possible magnitude of such a conspir¬ 
acy. This limit is given by the following elemen¬ 
tary probability theorem. 
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ELEMENTARY PROBABILITY THEOREM 

Consider an arbitrary probability assignment. 
Then 

P[ A and not B] + P[ B and not C] 

+ P[ C and not A] < 1. 

This theorem is completely trivial as a theorem 
of probability theory, but we shall see that there is 
an analogous result that is false in quantum me¬ 
chanics. 


Quantum Mechanics 

We briefly review this theory. The mathematical 
framework of quantum mechanics is Hilbert space. 
Each quantum event is identified with a closed 
subspace A of the Hilbert space. (Often the sub¬ 
space is identified with the orthogonal projection 
operator onto the subspace.) The quantum state is a 
fixed one-dimensional subspace of the Hilbert 
space. For each quantum event A, the probability 
of A is given by 

P[ A] = cos 2 (0), 

where cos 2 (0) is the square of the cosine of the 
angle between the one-dimensional subspace de¬ 
termining the state and the subspace correspond¬ 
ing to the A. [Of course, the Hilbert space is 
complex, but cos 2 (0) may be defined as the effect 
on the state of projecting onto A and then back.] 

In this framework, the negation not A of the 
event A is defined by the orthogonal complement 
A 1 of the subspace corresponding to A, so that 

P[not A] = sin 2 (0). 

In particular, 

P[A] + P[not A] = 1. 

If A and B are closed subspaces, the intersec¬ 
tion A n B is a closed subspace. We say that A 
and B are compatible if the four subspaces A n 
B, AnB i ,A‘ L nB, A 1 nB 1 have direct sum the 
entire Hilbert space. (This geometrical condition is 
equivalent to having the corresponding projections 
commute.) When A and B are compatible, we 
define their conjunction A and B to be the intersec¬ 
tion A n B. Thus, when A and B are compatible, 
we have all the usual probability laws, for exam¬ 


ple, additivity in the form 

P[ A] = P[ A and B] + P[ A and not B ]. 

It would be unwise to define conjunction as 
intersection of subspaces when the subspaces are 
not compatible. To see this in a dramatic way, take 
A and B to be one-dimensional subspaces that are 
not parallel or perpendicular, and take the state 
along A. The additivity equation then reads 1=0, 
as bad a failure as one could imagine. 

The relation of compatibility is not transitive. 
Say that A is compatible with B and also B is 
compatible with C. It does not necessarily follow 
that A is compatible with C. 

What is to be made of the existence of incom¬ 
patible quantities A and B? In the Copenhagen 
interpretation of quantum mechanics it is some¬ 
times said that one must choose to measure either 
A or B, and the measurement of one makes impos¬ 
sible the measurement of the other. Furthermore, a 
quantum event has no particular truth value until 
it is measured. So from this perspective it is rea¬ 
sonable there is no meaningful quantum event that 
combines A with B. 


Bell’s Theorems 

Bell discovered a remarkably simple way of 
characterizing some of the puzzling aspects of 
quantum mechanics. His essays on this subject 
have been collected [1]. In this account we draw on 
the appendix to Wick's book [2] for a particularly 
elementary account. 

Suppose that we have a fixed quantum state 
and two compatible events A and A'. We say that 
A is equivalent to A' with respect to the state if 

P[ A] = P[ A and A'] = P[ A']. 

These events will happen or not happen together, 
with probability 1. 

Bell made a remarkable discovery about the 
quantum system consisting of two spin \ particles 
in the singlet state. There are events A, B , and C 
associated with the spin of the first particle and 
events A', S', and C associated with the spin of 
the second particle. They have the following prop¬ 
erties: 

1. Each of the events A, B, C is compatible with 
each of the events A', B', C'. 
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2 . The event A is equivalent to A, the event B 
is equivalent to B', and the event C is equiv¬ 
alent to C. These events all have probability 
1 

2 * 

3 . The probabilities of the events (A and not 
B'X (B and not C'), (C and not A) are each 
§. There is a very strong conspiracy for A 
and B' not to happen together. 

This gives what we shall refer to as Bell's first 
theorem: 

BELL S FIRST THEOREM 

There can be no single probability model that 
reproduces the results of quantum mechanics for 
this system. 

Proof. If there were, we could replace the events 
A, B', and C by the equivalent events A, B, C 
and get 

P[ A and notB] + P[ B and not C] 

+ P[C and not A] = §. 

This contradicts the elementary probability theo¬ 
rem. 

This theorem is remarkable, but its implication 
for quantum mechanics is not quite as strong as it 
seems. If one admits that quantum events only 
become true or false when they are measured, then 
one must be prepared to admit that a measure¬ 
ment can have an effect on the system. Thus the 
probability assignment might be affected by what 
is measured. 

In the example of the two particles, there is a 
magnetic field in the vicinity of the first particle 
that is oriented in a particular direction. It can 
deflect the particle in one of two different ways. 
The event A corresponds to one of the two possi¬ 
ble deflections (and the event not A to the other 
one). If the magnetic field is oriented in another 
way, we get the event B and its negation. A third 
orientation defines C and its negation. Only one of 
these orientations is physically possible in a given 
situation. There are similar possible magnetic field 
arrangements for the second particle, with corre¬ 
sponding events A , B', C and their negations. 

Thus we could write P ah for the probability 
when we have decided to introduce magnetic fields 
to decide between A and not A for the first 
particle and between B' and not B' for the second 
particle. In this notation the three probabilities 
would be P ab [A and not B'], P bc [B and not C'], 


and PjC and not A']. Even though these proba¬ 
bilities are each f, there is no way of combining 
the events in a situation where a single probability 
assignment is relevant. It would seem that the 
argument would no longer apply to this situation. 
Bell, however, showed that under additional hy¬ 
potheses it still works. 

Let 9 be events associated with the first parti¬ 
cle, and let 9 1 be events associated with the sec¬ 
ond particle. Let 9$ be another collection of events; 
we think of these as occurring in the past. Let P /; - 
be the probability assignment with apparatus set¬ 
ting i for the first particle and j for the second 
particle. 

It seems reasonable that there should be no 
possibility of intervention over long distances. This 
would say that the P f/ probabilities of events in 9 
associated with the first particle do not depend on 
the apparatus setting j for the second particle. 
Correspondingly, the P I; - probabilities of events in 
9 1 associated with the second particle do not 
depend on the apparatus setting i for the first 
particle. 

The following two locality conditions refer to a 
chosen collection of past events. Active locality 
rules out intervention and also rules out influence 
on the past events. Passive locality is relative to 
the same collection of past events. 

AL: Active locality: The P {j probabilities of 
events defined by events in 9 and 9 0 do 
not depend on j. The P f . probabilities of 
events defined by events in 9* and 9 0 do 
not depend on i. (An intervention on one 
particle cannot affect the relation of the 
other particle to its past.) 

Note: It follows that probabilities of past 
events in 9 0 are computed with a probabil¬ 
ity assignment P that depends on neither 
apparatus setting. 

PL: Passive locality: If an event A in 9 is 
equivalent to an event A in 9* with re¬ 
spect to P ijf then they are both equivalent 
to an event A 0 in 9 0 . (A perfect coinci¬ 
dence must have an explanation in a com¬ 
mon past.) 

BELL’S SECOND THEOREM 

Under the hypotheses of active and passive 
locality, there can be no family of probability mod¬ 
els that reproduces the results of quantum me¬ 
chanics. 
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The proof is not difficult, and here is a sketch. 
Let A in be associated with one particle, and let 
A in be associated with the other. Take them 
equivalent. By passive locality, A is equivalent to 
a past event A 0 in with respect to the probabil¬ 
ities P aa . By active locality, for these events we get 
the same results if we compute with the probabili¬ 
ties Pab • Thus, A is equivalent to A 0 with respect 
to the probabilities P ab . 

By similar reasoning, B' is equivalent to B 0 
with respect to the probabilities P ab . 

It then follows from simple probability argu¬ 
ments that (A and not B') is equivalent to (A 0 
and not B 0 ) with respect to the probabilities P ab . 
However, the probability of the past event (A 0 
and not B 0 ) is computed with a probability assign¬ 
ment P not depending on the apparatus. 

It follows that 

P ab [ A and not B'] = P[ A 0 and not B 0 ]. 

This can be repeated with all three terms. The 
probabilities no longer depend on the decision on 
the apparatus. Thus we can now prove Bell's sec¬ 
ond theorem with exactly the argument used for 
Bell's first theorem. 

The conclusion of this theorem is that if quan¬ 
tum mechanics is a correct way to make probabil¬ 
ity predictions, then we must give up the combi¬ 
nation of active and passive locality. We shall see 
how this happens in the various interpretations. 

Notice that in order to apply the theorem in a 
specific case one must define the collection of past 
events There may be some choice in this defi¬ 
nition. Let us look at two extreme examples. 

Example 1. Assume there is no relevant past. 
This means that the only past events are trivial: the 
sure event with probability one and the impossible 
event with probability zero. In this case active 
locality is merely the condition that there is no 
possibility of intervention. If this is so, then it is 
only passive locality that fails. 

Example 2. Assume that for each pair of appara¬ 
tus settings i, j the theory is deterministic except 
for random initial conditions at the past time. The 
probability assignment P f . makes every event 
equivalent to a corresponding event in *0- Passive 
locality obviously holds, so active locality must 
fail. How does this happen? Suppose an event A 
in & is equivalent to A 0 in ^ with respect to the 
probability measure P aa . Then A may be equiva¬ 
lent to some other event A 0 with respect to the 
probability measure P ab . The change of the appara¬ 


tus setting for the second particle has changed the 
relation between past and future for the first 
particle. 

Quantum mechanics has been well tested, and 
there is now considerable experimental support for 
the strong correlations over long distances pre¬ 
dicted by the theory. Quantities accessible to ex¬ 
periment seem to be local in the sense that the 
correlations cannot be used to make interventions. 
However, according to Bell's result, there is no 
collection of past events that simultaneously sat¬ 
isfy active and passive locality. The correlations 
seem unexplainable by a common past. 


Measured Events 

The traditional textbook interpretation of quan¬ 
tum mechanics is the Copenhagen interpretation. 
In this interpretation each consistent set of quan¬ 
tum events defines probabilities having a fre¬ 
quency interpretation. But which consistent set of 
events is chosen comes from outside the theory, by 
a choice of external measuring device. 

The problem that some people see with the 
Copenhagen interpretation is that it gives no ac¬ 
count of a reality that is not subject to measure¬ 
ment. According to Heisenberg, "the idea of an 
objective real world whose smallest parts exist 
objectively in the same sense as stones and trees 
exist, independently of whether we observe 
them...is impossible." (The quote is from W. 
Heisenberg, Physics and Philosophy , Harper and 
Row, New York, 1958, p. 129). There must always 
be a measuring apparatus external to the quantum 
system. One can perhaps draw the line between 
quantum system and measuring apparatus at vari¬ 
ous levels, but there is never a theory of what 
happens in a system that is not undergoing 
measurement. 

Furthermore, it is not evident what counts as a 
suitable external measurement. Clearly a person 
with a Ph.D. in physics and a research grant is 
allowed to perform a measurement. Maybe the 
measurement can be conducted automatically with 
the experimental apparatus purchased on the grant. 
But how about the world before grants. Can a 
dinosaur perform a measurement? And how about 
the world on a grander scale? Does the universe 
have a quantum description? 

The Copenhagen interpretation does not evade 
the locality issues. It is trivially true that different 
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probability measures correspond to different 
choices of measurement. The event (A and not B') 
is not compatible with the event (B and not C). 
Thus P ab [ A and not B'] is computed with a differ¬ 
ent probability measure from that used for P bc [ B 
and not C']. However, the coincidences are still 
present, and according to Bell's second theorem 
they have no explanation in a common past. 


Macroscopic Events 

Perhaps there should be an interpretation that 
does not depend on the notion of measurement. 
There are a number of more recent proposals, due 
to various authors, that I will put in one category. 
The Gell-Mann and Hartle version is perhaps rep¬ 
resentative [3]. They key concepts are consistent 
history, macroscopic event, and decoherence. 

In Bell's theorem the original conjunctions of 
quantum events are between compatible events. 
So any peculiar features of quantum mechanics are 
already present in this case. Nevertheless, some 
authors have proposed that problems of interpre¬ 
tation may be clarified by defining the probabili¬ 
ties of conjunctions of certain incompatible events. 
Griffiths was one of the pioneers in the direction 
[4], Here is a brief description of how this works. 

We now describe an operation known as reduc¬ 
tion or collapse of the state. Consider a fixed state, 
given by a one-dimensional subspace of the Hilbert 
space. Let B be an event with nonzero probability 
P[B]. The collapsed state is the projection of the 
one-dimensional subspace onto the subspace cor¬ 
responding to B. It is a new quantum state. Let us 
denote the probability of some other event A 
computed with respect to the collapsed state by 
P[A\B]. We might want to call this the conditional 
probability of A given B. Sometimes we may 
think of B as being defined before A, so that this 
is a probability computed in the course of a 
history. 

Given the conditional probability, we might try 
to define the probability of the history by the usual 
formula 

P[ Aand B] =P[A\B]P[B]. 

It is important to realize that the probability of the 
conjunction is thought of as being computed with 
respect to the original state. The collapse is merely 
a stage in the mathematics. 

When the events are not compatible, this defini¬ 
tion can lead to paradoxes. Consider A and B to 


be one-dimensional subspaces at an angle of 7 t/ 4 . 
Take the state to be perpendicular to A but at an 
angle 77/ 4 from B. Then P[ B] = P[ A\B] = \ so 
P[ A and B] = Yet P[ A] = 0. 

All that remains of the additivity condition is 
the equation 

P[A] = P[ A and B] + P[A and not B] + /(A, B). 

The interference term satisfies - ^< I(A, B) <\ 
and measures the failure of additivity. 

Let A and B be arbitrary quantum events. We 
say that B followed by A form a consistent history 
with respect to the given quantum state if 

P[ A] = P[ A and B] + P[ A and not B], 

This says that for this particular state the interfer¬ 
ence term is zero. 

The idea now is that this should apply to a 
certain class of macroscopic events. For this class, 
the decoherence effect should make the interference 
terms vanishingly small. Decoherence is a prop¬ 
erty of large systems that says that it is impossible 
to distinguish quantum mechanics from ordinary 
probability without examining in detail every par¬ 
ticle in the system. One of the early formulations 
of decoherence is by Hepp and Lieb [5]; see [6] for 
extensions. There is survey of the subject and its 
history in the article by Zurek [7]. The book of 
Omnes [8] has an extensive discussion. The deco¬ 
herence effect is not automatic; it is supposed to be 
a consequence of the dynamics of the particular 
system. 

To see how this might work, consider a system 
with two rather separate parts. One part is associ¬ 
ated with collective events , typically macroscopic 
events defined by the values of collective coordi¬ 
nates (center of mass and so on). The other part is 
an environment, presumably part of the atomic 
world. This part is associated with environmental 
events . It is assumed that the collective events are 
compatible with the environment events. 

We consider a situation where the two parts 
have interacted in such a way that the environ¬ 
mental part has become correlated with the collec¬ 
tive part. We say that the state makes a collective 
event B decoderent if B is equivalent to some 
environmental event E with respect to the state. 
The environment thus contains a record of the 
event B, in that when B happens or does not 
happen, the corresponding E happens or does not 
happen. 
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DECOHERENCE THEOREM 

Let A and B be collective events. Suppose the 
state makes B decoherent. Then the history B 
followed by A is consistent with respect to the 
state. 

Proof. Let E be the environmental event that 
makes B decoherent in the given state. Since A 
and E are compatible, we have 

P[ A] = P[ A and E] + P[ A and not E]. 

Since B and E are equivalent in the state, the 
collapsed state with respect to B is the same as the 
collapsed state with respect to E. Not only P[ B] = 
P[E], but also P[ A|B] = P[ A|E]. Hence we can 
replace E by B (and similarly not E by not B) and 
obtain 

P[ A] = P[ A and B] + P[ A and not B]. 

The prediction of such a theory consists of prob¬ 
abilities of sets of histories of macroscopic objects. 
The histories are part of the world independent of 
anyone's choice of what to measure. The outcome 
in a particular situation is the history that actually 
occurs. 

The construction of such a theory in full detail 
is a formidable task. The purpose of the theory is 
to make frequency predictions about sets of consis¬ 
tent histories, independent of any notion of mea¬ 
surement. Therefore it must specify which particu¬ 
lar collection of consistent histories is relevant. 
Gell-Mann and Hartle [3] have given criteria that 
such a collection must satisfy. However, questions 
of the existence and of the uniqueness of such a 
collection are difficult. 

The puzzling aspect of such an interpretation is 
that it gives little role to the subatomic world 
except as a way of making predictions about the 
world on a large scale. If there were a universe of 
only a few particles, in which decoherence effects 
did not play a role, then such an interpretation 
would be vacuous. 

Note also that any such interpretation must 
confront the implications of Bell's theorems. By 
Bell's first theorem, there will not be a macroscopic 
event in the set of consistent histories correspond¬ 
ing to each quantum event. There may be macro¬ 
scopic events corresponding to certain atomic level 
quantum events, those for which a suitable cou¬ 
pling to the macroscopic world exists. An example 
of this might be the pointer reading corresponding 
to the deflection of a particle by a magnetic field. 


Perhaps there is no possibility of intervention over 
long distances. However, according to Bell's sec¬ 
ond theorem, the interpretation must admit the 
violation of the combination of active and passive 
locality. 


Position Events 

Another class of interpretation describes the 
world on the atomic scale, but only to events 
defined in terms of the position of particles. One 
can look at the positions over time, so one can 
assign probabilities to sets of particle trajectories. 
The outcome in a given situation is a particular 
particle trajectory. 

Two variants are Bohm's theory (Bohmian me¬ 
chanics) and Nelson's theory (stochastic mechan¬ 
ics). The literature on Bohm's theory can be traced 
back from a recent collection [9], and work is 
ongoing [10, 11]. Nelson's theory is explained in 
his books [12, 13]. The fact that there are two 
competing theories, giving different predictions for 
the particle trajectories, is a weakness. Perhaps 
there is some comfort in the fact that the fixed time 
position probabilities are the same in both theories 
and agree with those of quantum mechanics. And 
this may be all that is possible to measure in 
practice. 

These theories discard the usual elaborate su¬ 
perstructure of quantum mechanics, in which there 
are events of all sort. Probability is only assigned 
to position events (at various times). It is surpris¬ 
ing how little this seems to matter. It is wise to 
consider these theories before making general dec¬ 
larations on the interpretation of quantum me¬ 
chanics, since often they provide a counterexam¬ 
ple. 

The Bell theorems again apply to these theories. 
Spin is determined by introducing certain mag¬ 
netic fields, which deflect the particles, eventually 
resulting in a change in position. The introduction 
of the magnetic fields does change the probabili¬ 
ties, as envisaged in Bell's second theorem [14]. 
The main defect of these theories is that with the 
natural definition of past events they violate active 
locality, at least on the level of the trajectories. 
However, if one looks only at the final positions of 
the particles, the predictions agree with quantum 
mechanics. So again there is no practical possibility 
of active intervention for such events, and the 
situation is as for the Copenhagen interpretation. 
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I find stochastic mechanics attractive, so I will 
give a brief account. I will begin with the descrip¬ 
tion of the time-reversible stationary diffusion pro¬ 
cess associated with the quantum mechanical 
ground state. Then I will give some indication of 
the diffusion process associated with a quantum 
state that evolves in time. In both cases the out¬ 
come of the probability experiment is a particular 
particle trajectory. 

In ordinary quantum mechanics the position 
events at fixed time are compatible, but position 
events at different time are usually not compatible. 
With the diffusion process we can define the con¬ 
junction of position events at different times. Of 
course, this is a deviation from the usual interpre¬ 
tation of quantum mechanics. 

Consider the Hilbert space of square integrable 
wave functions and the Schrddinger operator 

1 ft 2 

H =-V, 2 + V(x). 

2 m 

Assume that H has a lowest energy eigenfunction 
ip > 0 with 

Hip = Eip. 

A position event at fixed time is defined by a 
subset of position space. If A is such a subset, then 
there is a corresponding indicator function. This 
function is 1 on the set A and 0 on its comple¬ 
ment. We also denote the indicator function by A. 
Multiplication by the indicator function is the pro¬ 
jection onto the corresponding closed subspace of 
Hilbert space. 

We let A(s) be the event that the trajectory of a 
particle is in the set A at time s. Let A s be the 
operator exp[-s(H - E)/h) A exp(s(H - E)/h)\. 

Now consider two such position events, A and 
B and corresponding times s < t. Then the proba¬ 
bility of the particle being A at time s and in B at 
time t is given by 

P[ A(s) and B(t)] = (if , A s B t ip ) 

= (ip, Ae~ (t - s)(H ~ E)/h Bip). 

Note that this is a well-defined probability for the 
position of the particle at two different times. For 
t = s the particles have not moved, so this is just 
the probability that the particle is in the intersec¬ 
tion of the two sets. However, as t — s becomes 
infinite, the probability approaches ( if, A if) (if, 
Bif). This says that the two particle positions be¬ 
come independent. 


The probability calculation may be reproduced 
in the language of diffusion theory. Define the 
backward diffusion operator L and forward diffusion 
operator L* by 

1 

L = --(H - E)/hip 
4 * 

and 

1 

L* = ~ip(H - E)/h -. 

4 * 

Note that LI = 0 and L*p = 0, where p = ip 2 . 

The explicit form of these operators is as fol¬ 
lows. Define the diffusion coefficient cr 2 = h/m. 
Define the drift vector field u to be 

2 v ^ 

u = a -. 

Then the backward diffusion operator is 
L = + u- V,. 

The backward diffusion operator gives the ex¬ 
pected values at future times as a function of the 
initial point. Thus e tL f(x) is the expected value of 
/ of the particle position at time t, when the 
particle is started at point x. The forward diffusion 
operator is 

L* = |cr 2 V 2 — V* • (m ). 

The forward diffusion operator describes how 
probability evolves. Thus, if g is the probability 
density at time zero, then e tL *g is the probability 
density at time t. 

The quantum mechanical operator — (H - E)/h 
and the diffusion operators L and L* all have the 
same spectrum, since they are similar. So this is a 
perfect correspondence between quantum mechan¬ 
ics and diffusion theory. 

We can now rewrite the probability that a dif¬ 
fusing particle is in A at time s and in B at time t. 
This is 

P[ A(s) and B(t)] = ( p, Ae (t ~ s)L B) 

= (e (t ~ s)L \Ap),B ). 

The backward operator maps events from future 
to present, while the forward operator maps prob¬ 
abilities from present to future. 
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We may write the forward diffusion equation 
explicitly as a partial differential equation 

dp 1 

— = L*p = — cr 2 V 2 p - V x • (up). 

This describes how probability evolves when a 
particle is diffusing randomly with diffusion con¬ 
stant a 2 and drift u. The drift vector field u is 
space-dependent. If we start the particle at a fixed 
point x 0 , then the solution is a function p(x 0 , x, t ) 
that describes the probability density after time t. 

If we start the particle at a random position 
with density p, then the probability density at a 
later time is fp(x 0 )p(x 0 , x, t) dx 0 . The condition 
for a stationary density is 

Jp(x 0 )p(x 0 , X, t) dx Q = p(x). 

This is equivalent to 

o = L*p= \a 2 V x p - V x ■ (up). 

As we have seen, the solution is p = i)/ 1 . The 
equilibrium for the diffusion is the quantum me¬ 
chanical density. In fact, the equilibrium is main¬ 
tained by the mechanism of detailed balance, which 
says that the equilibrium is due to a local balance 
between the density gradient and the drift. This is 
expressed mathematically as 

0 = \a 2 V x p - up. 

This leads to the following physical picture for a 
particle in a quantum mechanical eigenstate. The 
particle diffuses randomly, but the drift is such 
that it compensates for the diffusion and leads to 
an equilibrium. This probability for the position of 
the particle in this equilibrium agrees at every 
time with the prediction of quantum mechanics. 
However, the diffusion model provides a picture 
of the random trajectory of the diffusing particle. 
Such a picture is forbidden in the ordinary inter¬ 
pretation of quantum mechanics. 

In the general case one gets a diffusion process, 
but it is no longer homogeneous in time. Consider 
a solution ij/ of the time-dependent Schrodinger 
equation. Such a solution is in general complex. 
Define two vector fields, a drift u and a velocity, 
v, by 





These are both drift terms in the diffusion equa¬ 
tion. Sometimes u is called an osmotic velocity 
and v a current velocity. In any case, the corre¬ 
sponding diffusion process is given by the forward 
equation 

dp 1 

— = 2 0-2 y2 P “ V 

with diffusion coefficient a 2 and drift u + v. The 
coefficients are now both space and time depen¬ 
dent. Consider the solution where the particle is 
started at point x 0 at time t Q . The solution 
p(x 0 , x, t 0 , t) describes the probability density for 
diffusion to x at time t . 

If we start the solution at time t 0 with probabil¬ 
ity density p(x 0 , t 0 ) then the probability at a later 
time is 

p(x, t) = J p(x 0 , £ 0 )p(x 0 , x, t 0 , t) dx 0 . 

We can find such density explicitly by taking p = 
\ip\ 2 , the quantum mechanical prediction. Then it 
turns out that there are separate equations involv¬ 
ing u and v. The equation involving u is just the 
detailed balance equilibrium we had before, but 
with time-dependent coefficient: 

0 = \(j 2 V x p — up. 

The equation involving v is the transport equa¬ 
tion: 

dp 

Together these equations imply that p satisfies the 
forward diffusion equation with diffusion coeffi¬ 
cient a 2 and total drift u + v. 

The fact that the diffusion equation involves 
two vector fields u and v at first seems difficult to 
motivate. However, u does not change under time 
reversal, while v changes sign under time reversal. 
Thus the role of the drift u is to attempt to estab¬ 
lish equilibrium, while the role of the velocity v is 
to effect overall movement. 

Bohm's theory differs from stochastic mechanics 
in that the motion is deterministic, except for the 
random initial conditions. In Bohm's theory the 
motion is given by the velocity field v alone. 
Clearly there is a sense in which Bohm's theory is 
much simpler. However, stochastic mechanics has 
the attractive feature that it gives a natural role to 
the diffusion constant a 2 = h/m that is derived 
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from Planck's constant and the mass. It is amusing 
to note that when m is the mass of an electron, the 
diffusion constant a 2 has a rather memorable and 
reasonable value, about one square centimeter per 
second. 


Conclusion 

According to the mathematical apparatus of 
quantum mechanics, each quantum event has a 
probability. The question is whether these events 
correspond to sets of outcomes, so that the proba¬ 
bility has a frequency interpretation. Here is the 
situation in the three interpretations, (i) In the 
Copenhagen interpretation each compatible collec¬ 
tion of quantum events has a corresponding set of 
outcomes. However, the possible outcomes are not 
intrinsic to the system but depend on a choice of 
external measurement, (ii) In interpretations where 
events consist of sets of consistent histories subject 
to a decoherence condition, the outcomes are 
macroscopic histories. However, there need be no 
frequency interpretation for the probabilities of 
quantum events on the atomic scale, (iii) In the 
interpretations in which the outcomes are particle 
trajectories, the only relevant probabilities are those 
corresponding to position events at various times. 
So the probabilities of other quantum events either 
have no frequency interpretation or their interpre¬ 
tation is ultimately expressed by position events. 

The subspaces of Hilbert space describes a huge 
collection of events. Any interpretation of quan¬ 
tum mechanics must provide an additional crite¬ 
rion to select a relatively small collection of events 
for which the probabilities have a frequency inter¬ 


pretation. In the Copenhagen interpretation this 
is given by choice of a measurement and a cor¬ 
responding set of outcomes. Unfortunately, this 
choice is external to the theory. On the other hand, 
the macroscopic event and position event interpre¬ 
tations have a single set of outcomes and assign 
probabilities to at least some quantum events. They 
point the way to a possible understanding of quan¬ 
tum mechanics as a description of an isolated 
system. Perhaps that is enough. 
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ABSTRACT: Recent discoveries show that phenomena in highly excited states of 
atoms and molecules are qualitatively different from the standard "planetary" model of 
the atom and the near-rigid model of molecules, characteristic of these systems in their 
low-energy states. Highly excited states of two-electron atoms show "molecular" behavior 
of a nonrigid linear structure undergoing collective rotation and vibration. Highly excited 
states of molecules display strong effects of normal mode anharmonicity and coupling 
that induce bifurcations on the route to molecular chaos and isomerization. Hints of a 
unified view for both systems are emerging in terms of approximate dynamical symmetry 
principles and methods of nonlinear classical mechanics such as bifurcation theory. 
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Introduction 

A surge of new knowledge, experimental and 
theoretical, of highly excited quantum states 
of few-body systems is leading to a new view of 
the motion of electrons in atoms and of atoms in 
molecules. We are accustomed to thinking of atoms 
in their ground states in terms of the "planetary" 
model of independent particles, described by an 
electronic configuration, e.g., the Is 2 configuration 
of the ground state of helium. In contrast to this 
nearly uncorrelated, nonrigid structure of an atom 
in its ground state, the ground- and low-energy 
rotation-vibration states of molecules are de- 
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scribed in terms of the normal mode picture of 
small amplitude, nearly harmonic motion about a 
rigid equilibrium structure. However, efforts of 
many workers in recent years show that in highly 
excited states each system at high energy shows 
some resemblance to the behavior that describes 
the other at low energy. In highly excited states, 
the motion of electrons in atoms takes on charac¬ 
teristics similar to molecular rotation-vibration 
modes, while the motion of atoms in highly ex¬ 
cited vibrational states, especially in systems un¬ 
dergoing isomerization, shows drastic departures 
from the low-energy normal mode picture. More¬ 
over, the tools used to describe high-energy behav¬ 
ior, such as effective Hamiltonians, are beginning 
to give signs of a unified formal description of the 
two systems. 

My purpose here was to highlight some of the 
most important of these developments. I give ref¬ 
erence to some of the key original sources, in 
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which can be found many other important refer¬ 
ences that must be left out here. Elsewhere [1, 2], I 
have presented distinct points of view regarding 
some aspects of the work discussed here. 


The “Molecular Atom : 99 Rotations, 
Vibrations, and Supermultiplets 

The electrons in atoms and molecules do not 
have fixed positions relative to the atomic nuclei, 
and it is not evident that they have a "structure" 
in any geometrical sense. In the classical model of 
the hydrogen atom, the electron orbits the nucleus 
in an ellipse. In a multielectron atom, the Coulomb 
repulsion between electrons substantially perturbs 
the elliptical orbits. This is the problem of electron 
correlation. Two-electron atoms continue to elicit 
intense interest as the simplest prototype of the 
correlation problem. Conventionally, one describes 
an excited state of the atom in terms of a configu¬ 
ration. For example, a state with both electrons in 
a 2p orbital has a configuration 2 p 2 . This indepen¬ 
dent particle configuration description is the quan¬ 
tum mechanical analog of the planetary model. By 
the late 1970s, accumulating evidence was show¬ 
ing that the configuration description is severely 
inadequate for highly excited states. Nonetheless, 
it came as something of a shock [3-6] to discover 
that the excited atom had many properties in com¬ 
mon with the motion of a triatomic molecule. 
There is now compelling evidence that in states 
with both electrons excited from the ground level 
the atom behaves according to a "molecular" 
model of collective rotations and vibrations, simi¬ 
lar to a highly nonrigid linear XYX molecule, with 
the electrons correlated on opposite sides of the 
nucleus. 

I will outline how this picture has developed, 
emphasizing the physical ideas, but giving high¬ 
lights of the formal development. The interested 
reader should consult the references for the many 
technical details. 

APPROXIMATE DYNAMICAL SYMMETRY OF 

THE TWO-ELECTRON ATOM 

Classically, the electron in a one-electron atom 
with Coulomb potential follows an elliptical (Kep¬ 
ler or planetary) orbit, in which the angular mo¬ 
mentum L is an exact constant of motion. The 
three components of the angular momentum vec¬ 


tor L generate the group of rotations in three- 
dimensional space, denoted 0(3). 

The elliptical Kepler orbit has far more symme¬ 
try than demanded merely by rotational invari¬ 
ance, however, a planetary orbit that conserves 
angular momentum L could still have a precessing 
perihelion. The Kepler orbit has a constant of the 
motion [7] in addition to the angular momentum, 
the Runge-Lenz vector A. This vector points along 
the semimajor axis of the ellipse. It is defined in 
terms of the instantaneous position and momen¬ 
tum at each point in the orbit, but as a constant of 
motion, it is invariant in size and direction. The 
vector A together with the angular momentum L 
generates the group 0(4) of rotations in an abstract 
four-dimensional space, which contains the ordi¬ 
nary angular momentum group 0(3) as a sub¬ 
group. 

In the two-electron atom, because the system is 
invariant under rotations in space, the total angu¬ 
lar momentum 

L = L 1 + L 2 (1) 

is an exact constant of motion that generates a 
group 0(3) 12 . By analogy with the one-electron 
atom, one might think to combine L with the total 
Runge-Lenz vector 

A = A x + A 2 (2) 

to form a two-electron 0(4) 12 group. If the two- 
electron system behaved like a "planetary" atom, 
each electron would move in an independent ellip¬ 
tical orbit, with A = A x + A 2 a conserved quan¬ 
tity. However, this is not the case, because A is 
badly broken as a constant of motion by the strong 
Coulomb repulsion between electrons. 

Even though this 0(4) 12 group is not an exact 
symmetry, it is reasonable to try to use it as an 
approximate symmetry. One might hope this would 
be of use in ordering the spectrum, but in a less 
complete way than if O (4) 12 were exact. This would 
be very much in the spirit of the use of approxi¬ 
mate symmetries earlier in nuclear and elementary 
particle physics, where these kinds of ideas led to 
developments such as the quark model. Unfortu¬ 
nately, when this was tried for the two-electron 
atom, it was found [8] that O (4) 12 was not of much 
use even in this limited sense of an approximate 
symmetry. 

A key advance came when Wulfman [9] and, 
independently, Herrick and Sinanoglu [10, 11] 
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tried, instead A of (2), the operator 

B = A x - A 2/ (3) 

where the crucial difference is the minus sign. A 
heuristic physical picture of the electron correla¬ 
tion is shown in Figure 1. The idea is that overall 
maximizing the difference between the Runge-Lenz 
vectors A 2 and A 2 minimizes the Coulomb repul¬ 
sion between the electrons. It was found that B is 
an approximate constant of motion, unlike the 
experience with A. Furthermore, it is easily shown 

—y —) 

from the commutation relations [7] that L and B 
together generate a distinct 0(4) group. I will call 
this approximate symmetry group 0(4)' 12 to dis¬ 
tinguish it from the unsuccessful O (4) 12 of Eqs. (1) 
and (2). 

ROTATIONS AND VIBRATIONS OF THE 

“MOLECULAR” ATOM 

Of course, the heuristic picture in Figure 1 of 
electrons moving in stable ellipses is far from a 
literal image of either the classical or quantum 
mechanics of the atom. A more realistic depiction 
of a quantum state of the atom with both electrons 
in the same shell is given in Figure 2. The atomic 
structure is like that of an elongated cloud or blob 
of charge, with the density of the two electrons 
concentrated at the ends. This can be rationalized 
by saying that the repulsion of the electrons tends 
to correlate them on opposite sides of the nucleus. 
At the same time, a residual effect of the quantum 
mechanical configuration (with both electrons in 
the same shell in Fig. 2) is to concentrate the 
electrons very loosely at a most favored distance 
from the nucleus. This structure of the electron 
cloud surrounding the nucleus is that of a highly 
nonrigid system. Nonetheless, the correlation gives 



FIGURE 1. Heuristic picture of electron correlation in 
two-electron atom with both electrons in the same shell. 
Because of the electronic repulsion, the most favorable 
configuration maximizes the difference B = A 1 - A 2 of 
Runge-Lenz vectors. 



FIGURE 2. A depiction of the two-electron atom with 
both electrons in the same shell (which may be an 
excited shell). The atom has the structure of a “blob” of 
charge. The electrons are concentrated on opposite 
sides of the nucleus, because of the electronic repulsion, 
and are concentrated at some most probable distance, 
because of the shell structure. The overall structure has 
the symmetry of a linear, symmetric XYX molecule, with 
X’s the electrons and Y the nucleus. 

the atom a distinct shape—in fact, a shape with 
the symmetry of the molecular point group D xfr 
If the atom has a structure with a symmetrical 
shape, it is natural to wonder if this structure can 
exhibit collective modes, e.g., rotations and vibra¬ 
tions. The beginning of the "molecular" model of 
the atom came when Kellman and Herrick [3] used 
the O (4)' 12 group structure to classify the atomic 
levels into "rotor series," i.e., 0(4)' 12 multiplets or 
sets of states that behaved like a nonrigid rotor. 
Next [4-6], the 0(4)' 12 multiplets were combined 
into "supermultiplets." The collection of the 0(4)' 12 
rotor series into the more comprehensive super- 
multiplet classification describes bending vibra¬ 
tions of the "molecular" atom. (The role of radial 
excitations in this picture, which came to be better 
understood later, is discussed below.) Figure 3 
depicts the rotations and bending vibrations of the 
electron cloud in the "molecular" atom. The su¬ 
permultiplets give the classification of "intrashell" 
states of the atom with both electrons in the n = 2, 
3, 4, and 5 shells and also "intershell" states with 
the electrons in different shells. The supermulti¬ 
plets for both electrons in n = 3 are shown in 
Figure 4, indicating the role in the classification of 
rotational excitations, bending vibrations with vi¬ 
brational angular momentum along the "molecu¬ 
lar" symmetry axis, and bending vibrations with 
no angular momentum. Clearly, this classification 
reveals a great deal of order in a spectrum which 
does not at first show much order at all. Not only 
is the spectral pattern of Figure 4 similar to that of 
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FIGURE 3. Collective rotation-vibration modes of the 
“molecular” atom. The structure has degenerate bending 
vibrations and overall rotations, like a linear triatomic 
molecule. 

a linear, nonrigid triatomic molecule, but the pat- 

tern of terms U matches that of a symmetric 
triatomic molecule with rotor-vibrator states obey¬ 
ing the constraints imposed by Fermi statistics of 


the outer particles, i.e., the spin-orbital-parity clas¬ 
sifications of the terms derived from atomic shells 
and the symmetry restrictions of the XYX molecule 
dovetail perfectly [6]. 

There have been significant developments since 
the work of Herrick and Kellman. Some highlights: 
I showed [12] that the molecular picture could be 
applied to atoms with two electrons outside a 
closed shell, i.e., alkaline earth atoms. Berry and 
co-workers [13, 14] analyzed the atomic wave 
functions in terms of a basis with bending vibra¬ 
tion and stretching modes. In the original model of 
Kellman and Herrick, the molecule was taken to 
have rotations and bending vibrations, but not 
stretching excitations, inasmuch as the radial de¬ 
grees of freedom seemed quite reasonably to be 
encompassed in the n = 2,3,... shell structure. 
Ezra et al. [15] developed a semiclassical treatment 
of the radial motion of a collinear atom and found 
evidence for radial motions that can be described 
as an antisymmetric stretch. Rost et al. [16, 17] 
gave a unified ''molecular orbital" treatment of 
the radial, bending, and rotational motion. They 
found an intricate interweaving of bend and stretch 
excitations. These investigations strongly support 
the idea of the atom as being like a highly nonrigid 
triatomic molecule, but now understood to have a 
systematic involvement of radial motion in the 
classification. Richter et al. [18] discovered the 



FIGURE 4. Supermultiplet rotor-vibrator “molecular” classification of doubly excited states of He with both electrons 
excited to the n = 3 shell, from [4]. In the largest, d = 2 supermultiplet, the column labeled [4,0] + is a multiplet of 
O (4)' 12 . This multiplet is a “rotor series” of pure rotational excitations with L = 0, 1 , 2,3,4, built on the lowest energy 
state, which is 1 S e . Adjacent columns of the d = 2 supermultiplet are rotor series built on excitations of the bending 
vibration, which have / = 1 unit of angular momentum along the symmetry axis of the “molecular” atom. The d = 1 
supermultiplet is built on an 1 S e state with two bending quanta and / = 0 units of angular momentum along the 
symmetry axis. It has its own rotor series and / = 1 excitations. The d = 0 supermultiplet is a lone 1 S e state with four 
bending quanta and / = 0. See [4, 6] for more details. 
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existence of "frozen planet" states, in which the 
two electrons are locked in a stable configuration 
on the same side of the nucleus. Ostrovsky and 
Prudov [19] found frozen planet states in which 
radial and bending modes of the "molecule" are in 
an approximate 2:1 frequency ratio, suggestive of 
molecular Fermi resonance! Finally, as described 
later, I have been investigating some connections 
of the atomic supermultiplet classification with 
recent models of molecular vibrations that use Lie 
group symmetries. 


Order in Chaos in Highly Excited 
Molecules? 

We have seen that the spectrum of a highly 
excited two-electron atom resembles that of a lin¬ 
ear molecular structure undergoing rotation and 
vibration. This can be turned around to ask, What 
happens to a real molecule when it is excited high 
above the ground state? How does one describe 
the changes that must occur from the low-energy 
picture of small-amplitude oscillations about a 
near-rigid structure? 

The spectrum of vibrational excitations is tradi¬ 
tionally described in terms of normal modes [20]. 
However, at high energies, the modes can become 
strongly coupled to each other. As the atoms per¬ 
form progressively large excursions from the equi¬ 
librium rigid structure, the motion can become 
chaotic, in the sense of nonlinear dynamical sys¬ 
tems. Eventually, the atoms may rearrange into a 
different molecular structure or the molecule can 
fragment. The normal modes' description, ade¬ 
quate for a rational account of the spectrum at low 
energies, is no longer physically valid. But it is 
precisely here that the spectrum may become fear- 
somely complex. Spectra may be our window on 
the molecular world, through which to obtain in¬ 
formation such as the forces between atoms which 
govern molecular dynamics and reactivity, but as 
with all visualization, a great deal of sophisticated 
processing is necessary to extract the information 
encoded in the raw data of experience. The search 
for new ways to model spectra and organize them 
into meaningful patterns, if any such exist, has 
been a concerted effort of fundamental chemical 
science of the past two decades. 

Even though the standard approach starts with 
the notion of a set of normal modes [20], at all but 
the lowest energies, it is necessary to take into 


account anharmonicity of the modes. In addition, 
there are important couplings: In general, the vi¬ 
brations are not truly separable into independent 
modes. (For simplicity, I will neglect rotation. Most 
of the statements carry over when rotation is in¬ 
cluded.) At low energy, the combination of anhar- 
monicity and coupling for the most part can be 
"swept under the rug." But this becomes unten¬ 
able at higher energy, where anharmonicity and 
coupling become inexorably more important. 

Here, I will discuss two aspects of this problem 
of complex spectra of polyatomics: (1) When the 
most important couplings are taken into account, 
are there any remaining approximate constants of 
motions? If so, these intact dimensions of the 
molecular phase space act as "bottlenecks" to en¬ 
ergy flow within the molecule. This will be impor¬ 
tant, e.g., in efforts to control molecular dynamics 
with intense laser pulses. (2) What happens to the 
dimensions of phase space that are not left intact? 
This is the problem of the branching of the normal 
modes in bifurcations on the route to molecular 
chaos. I will outline some promising approaches to 
these problems, developed through intense efforts 
of many workers. 

THE SPECTROSCOPIC FITTING 

HAMILTONIAN WITH NORMAL MODE 

COUPLINGS 

I begin by describing the standard effective 
Hamiltonian used to fit experimental spectra. The 
spectroscopic Hamiltonian (SH) is intended as an 
"effective Hamiltonian" to bridge the gap between 
experimental spectral data and the fundamental 
description of the true molecular quantum me¬ 
chanics. A natural question is how the SH is re¬ 
lated to the true Hamiltonian. Fried and Ezra [21] 
and independently Sibert [22] showed how to ob¬ 
tain the SH by beginning with a potential surface 
and transforming approximately to SH form by 
means of perturbation theory. 

The SH is a quantum mechanical model of vi¬ 
brational normal modes and their couplings. The 
SH contains a number of variable parameters. 
These are optimized to give the best fit to experi¬ 
mental data. The SH is in no sense a fundamental 
description of the molecule. That involves treating 
it as a collection of nuclei and electrons and essen¬ 
tially solving for the exact quantum mechanics. 
When feasible, as for a few small molecules, that 
leads to a molecular potential energy surface, with 
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vibrations emerging as quantum mechanical mo¬ 
tion of the atoms on this potential. This is very 
difficult to carry out in practice. 

The starting point of the SH is a set of elemen¬ 
tary normal modes described by a Hamiltonian 
function H 0/ which depends on the number of 
quanta n i in each of the modes i. The simplest 
model is just a set of N harmonic normal modes. 
However, anharmonicity is evident from the spec¬ 
tra even at low energy. It is therefore standard to 
use an H 0 with harmonic terms proportional to the 
number of quanta n i and frequency a) t in each 
mode, but also anharmonic terms proportional to 
higher powers of the quantum numbers, including 
cross-terms: 

N 

H 0 = E «,(«,-) + Zxainf 

i — 1 i 

N 

+ HXi y(n,•)(«;) + — • (4) 

■'</ 

One can try to fit an experimental spectrum by 
optimizing the parameters w if etc., in H 0 . For 
many systems, especially at low energy, this may 
suffice to give a good match with experiment. 
However, a Hamiltonian of the form H 0 often is 
not adequate. In such cases, coupling between the 
normal modes must be introduced beyond the 
cross-anharmonic terms For example, 

Darling and Dennison [23] found that an interac¬ 
tion must be added to H 0 which couples the stretch 
normal modes in a molecule such as H 2 0. This is 
called "Darling-Dennison resonance" because of 
the near-resonant frequencies of the symmetric and 
antisymmetric stretches s and a . Addition of the 
Darling-Dennison coupling V S ^ D gives a Hamilto¬ 
nian H = H 0 + V S ^ D which is no longer separable 
into modes s and a. 

The Darling-Dennison coupling V DD is for two 
modes s and a which have approximately the 
same frequency. This is sometimes called a 1:1 
resonance coupling. Other couplings can arise be¬ 
tween modes with frequency ratio mm. These are 
referred to as Fermi resonances [24]. For example, 
there is a prominent 2:1 Fermi resonance in H 2 0 
between the symmetric stretch and bend. A global 
fit of the spectrum, including states with promi¬ 
nent excitation of all three modes, requires a mul¬ 
tiresonance Hamiltonian: 

h = h 0 + + vfrr- (s) 


The multiresonance character of the spectra has 
an important connection with identifying approxi¬ 
mate constants of motion of the molecule. These 
act to channel energy flow within the molecular 
phase space, simplifying the spectra and the corre¬ 
sponding intramolecular dynamics. However, 
within the energy flow channels or pathways de¬ 
fined by the constants, the resonances can produce 
bifurcations of the normal modes and classical 
chaos, with considerable complications. I will con¬ 
sider these in turn. 

POLYAD SYMMETRY, APPROXIMATE 
CONSTANTS, AND CHANNELS FOR 

MOLECULAR ENERGY FLOW 

Approximate constants of motion are quantum 
numbers that are left intact when the resonance 
couplings needed to fit the spectrum are taken into 
account. If these exist, they are very important 
because they are related to patterns that can be 
uncovered in the experimental spectrum. In dy¬ 
namical terms, on the time scale corresponding to 
the level of energy resolution in the spectral fit, 
they channel energy flow in the molecular phase 
space when energy is deposited at some specific 
location in the molecule. This could be crucial for 
efforts to control molecules with intense beams of 
coherent light pulses. The theory [21, 25] for find¬ 
ing approximate constants and energy channels 
uses a simple "resonance vector" analysis of the 
SH. The quantum numbers that are preserved are 
"polyad" numbers which are special combinations 
of the zero-order quantum numbers. 

The resonance vector method was first applied 
for experimental data [25] to absorption spectra of 
acetylene (C 2 H 2 ) of Smith and Winn [26] that 
probe the flow of energy initially deposited in the 
C—H stretch motions. There are a multiplicity of 
resonance couplings at high energy. Nonetheless, 
two good quantum numbers are found [25] to 
remain from the resonance vector analysis. The 
polyad analysis was applied by Jonas et al. [27] to 
spectra which probe the bends of acetylene, with a 
bearing on observation of hierarchies of time scales 
for the energy flow. Temsamani and Herman [28, 
29] analyzed the polyads in constructing an effec¬ 
tive SH Hamiltonian. Solina et al. [30, 31] directly 
"unzipped" the C 2 H 2 spectrum into polyads. 
When viewed as individual polyads, the spectra 
reveal a great deal of order that is not otherwise 
evident. Most recently, direct absorption and fluo¬ 
rescence data were jointly analyzed [32] from the 
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point of view of polyad constants, with the goal of 
constructing the SH from as global a view of the 
molecular phase space as possible. An open ques¬ 
tion is whether the polyads can be further " un¬ 
zipped" into subpolyads, revealing still more hid¬ 
den order in the spectrum. 

BRANCHING OF THE NORMAL MODES AND 
THE STRECTERE OF MOLECELAR CHAOS 

As shown by the need to use multiresonance 
Hamiltonians, it is evident that the couplings pre¬ 
clude using the normal modes by themselves in all 
but the coarsest description of the spectra. Some 
approximate constants of motion may be left intact 
as polyad numbers. As seen above, these are ex¬ 
tremely important, yet easy to abstract from the 
SH fit to the experimental spectrum. However, 
knowledge of the intact polyad numbers leaves 
crucial questions unanswered. What happens when 
the individual normal modes are destroyed or 
modified by couplings? What are the signatures of 
this in the spectrum? What are the dynamical 
consequences, e.g., the changes in the natural mo¬ 
tions of the molecule? These questions are impor¬ 
tant for using spectra to infer information about 
molecular potentials and may be crucial for efforts 
at coherent control of polyatomic molecules. From 
the viewpoint of classical mechanics, the destruc¬ 
tion of the normal modes occurs when they un¬ 
dergo bifurcations . When a normal mode bifur¬ 
cates, new types of anharmonic or nonlinear mo¬ 
tions are formed in place of, or in addition to, the 
original modes. The goal of recent work [1, 33-37] 
has been to characterize the bifurcations associated 
with the SH applied to data of specific molecules, 
in order to identify the new nonlinear modes and 
related spectral patterns that result at high energy. 

A difficult aspect of using bifurcation theory in 
spectral analysis is the high number of degrees of 
freedom in a polyatomic molecule. One confronts 
the not-well-understood problem of bifurcations in 
a high-dimensional classical phase space. In addi¬ 
tion, the classical dynamics, in general, shows a 
mixture of chaotic and regular motion, so one also 
faces the notorious problem of the connection be¬ 
tween chaotic classical dynamics and quantum 
states in this "mixed'' regime. 

Completely solving for the bifurcations of a 
chaotic Hamiltonian is presently impossible and 
probably not very useful. Instead, using as an 
exemplar the three-mode spectroscopic Hamilto¬ 
nian of (5) for H 2 0, work has focused [36-40] on 


obtaining and exploiting the large-scale bifurcation 
structure. By this is meant the principal branchings 
of the original normal modes as the system be¬ 
comes chaotic. This gives the principal structure 
around which the chaotic phase space is orga¬ 
nized. This information has been used in several 
methods for assignment of spectra and analysis of 
energy-level patterns [37-40]. Although success¬ 
fully carried out for H 2 0, the assignment with 
spectral pattern recognition is largely unchartered 
territory for larger molecules, with investigations 
currently in progress. 


More Alike at Higher Energy? 

We have seen that at high energy the two-elec¬ 
tron atom has spectral patterns similar to the rota¬ 
tion-vibration modes of a molecule, while in ac¬ 
tual molecules at high energy the vibrational 
modes are transformed in bifurcations because of 
resonant couplings. To a certain extent, the '■' plane¬ 
tary" atom has become like a "molecular" atom 
and the rigid molecule less regular. Still, the two 
systems could become even more similar. A real 
molecule can have excursions of the atoms com¬ 
pletely out of its equilibrium configuration, with 
the possibility of multiple minima in the potential 
energy surface, as in isomerization. Could the mo¬ 
tion of these "wandering atoms" come to resemble 
the correlated but nonrigid motion of the electrons 
in an atom? On the other hand, could electrons in 
atoms sometimes become rigidly localized to such 
an extent as to speak of Fermi resonances between 
modes of well-defined frequency? Also, could 
atoms show multiple quasi-rigid configurations, 
analogous to an isomerizing molecule? There is 
evidence that two-electron atoms have "frozen 
planet" states [18] in which the electrons really are 
locked into a second "molecular" configuration, 
with the electrons on the same side of the nucleus. 
Furthermore, there are indications [19] that the 
frequencies of the normal modes of these states are 
well defined and may sometimes be in Fermi reso¬ 
nance, i.e., have close to integral ratios. On the 
other hand, evidence about isomerizing molecules 
is now starting to accumulate [41-47], and distinct 
experimental and theoretical hallmarks are appear¬ 
ing. So, there is the prospect of soon understand¬ 
ing just what are the similarities and correspon¬ 
dences between truly nonrigid molecules and 
highly correlated atoms. 
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ISOMERIZING MOLECULES 

If spectra are one of our clearest windows on 
the microscopic world of molecules, one of the 
most elusive behaviors to "catch in the act" is 
molecular rearrangement or isomerization. How¬ 
ever, rapid progress [41-47] is being made on two 
prototype systems: the isomerization of HCP to 
HPC and that of HCN to HNC. In particular, a 
bifurcation of the type known as "tangent" or 
"saddle node" has been predicted from numerical 
studies [46] in connection with the approach of 
HCP to the isomerization barrier. Most interest¬ 
ingly, distinct changes in the spectrum, quite pos¬ 
sibly associated with this bifurcation, have re¬ 
cently been observed [46]. Calculated vibrational 
wave functions [47] on a potential surface also 
show interesting behavior. An exciting challenge 
will be the development of spectroscopic Hamilto¬ 
nians for isomerizing systems, including the de¬ 
scription of the new bifurcation phenomena. 

OF ‘FROZEN” PLANETS AND FERMI 

RESONANCES IN ATOMS 

There have been important developments in the 
study of collective behavior of the electrons in 
atoms since the rotor-vibrator supermultiplet 
model. Some highlights include investigation of 
molecular-type electronic wave functions, with 
distinct approaches pursued by Berry and co¬ 
workers [13, 14] and Rost and co-workers [16, 17]. 
Another development is the investigation of radial 
motion [15] and how radial motion fits [16,17] into 
the supermultiplet scheme. Finally, a most inter¬ 
esting development [18] is the prediction of "frozen 
planet" states. In these states, both electrons are on 
the same side of the nucleus! As illustrated in 
Figure 5, the outer, "frozen" electron hardly moves 
at all and the inner electron oscillates along the 
axis connecting the outer electron to the nucleus. 
Ostrovsky and Prudov [19] investigated the rela- 



FIGURE 5. “Frozen planet” states of a two-electron 
atom. Electron 2 is nearly frozen in place, while electron 
1 oscillates along the axis between electron 2 and the 
nucleus. See [18, 19, 48] for more discussion of frozen 
planet states. 


tive frequencies of radial and bending modes of 
the "molecular" atom in the frozen planet configu¬ 
ration. They predicted circumstances in which the 
modes are in an approximate 2:1 frequency ratio, 
suggestive of molecular Fermi resonance. This 
suggests an analogy with bifurcation phenomena 
in molecular vibrations. Interestingly, recent classi¬ 
cal calculations of Ostrovsky and Prudov [48] on 
the frozen planet states show phase space behavior 
similar to that observed by Li et al. [33, 34] in 
resonant molecular vibrations. Further interesting 
parallels may be expected between the frozen 
planet states and the isomerization of linear 
molecules, including the novel bifurcation phe¬ 
nomena in the latter. 


A Unified Model? 

We have seen that highly excited vibrations and 
rotations of molecules involve significant depar¬ 
tures from the orderly, rigid structures of the 
ground state, while highly excited states of elec¬ 
trons in atoms behave in some ways like highly 
nonrigid molecular structures. The relationships 
discussed so far consist of analogies between phys¬ 
ical descriptions that are rooted in very different 
formal descriptions: in the case of atoms, the O (4) 
hydrogenic shell structure, and in the case of 
molecules, the motion of atoms on a potential 
energy surface, abstracted in the spectroscopic 
Hamiltonian (SH). But if both types of systems 
display more and more features in common, such 
as collective modes, resonances with bifurcations, 
and multiple stable configurations, it is reasonable 
to ask. Can there be a unified formal description 
encompassing both types of system? These ques¬ 
tions are in the early stages of exploration, and I 
do not believe definitive answers or conclusions 
about the choice of formalism are anywhere near 
at hand. Nonetheless, I would like to discuss one 
possible approach to at least a partially unified 
description. This approach uses Lie algebras to 
describe the spectra of both atoms and molecules. 
For atoms, this is not surprising, given the promi¬ 
nence of the 0(4) algebra in the work on atoms 
already described. Perhaps more surprising, for¬ 
mally related approaches have been developed for 
molecular rotation-vibration spectra. One of these 
has come to be known as the vibron model [49-51]; 
another uses so-called quantum groups [52, 53]. I 
have used a vibron-type model for an investiga- 
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tion of the atomic supermul tip lets [54] that is de¬ 
scribed below. The reason that such models are 
candidates for a unified picture is that they start 
with a group theoretical shell structure, similar to 
the O (4) description of the atom, yet they encom¬ 
pass an effective rotation-vibration Hamiltonian, 
much like the standard spectroscopic Hamiltonian 
(SH). 

THE VIBRON MODEL 

The "vibron" model [49-51] is related to early 
work on rigid and nonrigid molecular systems in 
terms of correlation diagrams [55, 56]. The basic 
idea is to represent the rotation-vibration spec¬ 
trum of each bonded pair of atoms in a molecule 
in terms of representation of a continuous group. 
The group is the unitary group li(4), which is 
related to the O (4) rotation group described above 
for atoms. The unitary group can be thought of as 
the generalization of rotations to spaces of com¬ 
plex numbers. In the vibron model, a set of n 
bonded pairs is described by first coupling a sepa¬ 
rate Lf(4) for each bond: L/(4) 1 X U( 4) 2 X X 
U( 4)„. 

Then, subgroup chains are used to describe op¬ 
erators which resemble terms in the standard 
zero-order Hamiltonian (4). Couplings analogous 
but not identical to the Darling-Dennison or Fermi 
resonance couplings can also be defined in terms 
of the group operators. Although this procedure 
might appear far different from the SH, there is a 
term-by-term correspondence of a vibron Hamilto¬ 
nian to a similar Hamiltonian built in the standard 
way. It is therefore not surprising that sufficiently 
elaborate versions of the vibron model (with 
enough independent terms in the Hamiltonian) 
can give spectral fits of comparable accuracy to the 
SH. 

A RELATION TO THE SLPERMULTIPLETS? 

The vibron model also turns out to have a 
possible relation to the supermultiplet description 
of two-electron atoms. Despite the spectral pat¬ 
terns revealed by the supermultiplets and the com¬ 
pelling physical motivation, one fundamental as¬ 
pect of the supermultiplets was mysterious from 
the beginning. This was the formal group theoreti¬ 
cal basis of the supermultiplet classification. The 
supermultiplets were constructed by collecting to¬ 
gether the approximate two-electron O (4)' 12 multi- 
plets into larger entities—the supermultiplets. The 


mystery was that despite the beauty of their struc¬ 
ture and "molecular" interpretation the supermul¬ 
tiplets did not appear to correspond to any larger 
mathematical group structure encompassing the 
two-electron 0(4)' 12 . In a recent article [54], I was 
able to shed some light on this by embedding the 
0(4) group structure in a larger group, U(4). The 
ii(4) lends its structure to a larger organization 
scheme, the supermultiplets, than can be obtained 
from 0(4) alone. This embedding of the atomic 
problem in a U( 4) structure is similar to the way 
the vibron model embeds the molecular states. In 
both cases, the suitability of the embedding seems 
related to the observation that the pattern of corre¬ 
lation from nonrigid to rigid states in linear 
molecules [56] results in groupings analogous to 
the atomic supermultiplets. 

FUTURE PROSPECTS 

One aspect of the 11(4) embedding of the super¬ 
multiplets is still puzzling, but possibly connected 
to important future insight into the atomic prob¬ 
lem and maybe even isomerizing molecules. This 
is that the U(4) embedding leads to "extra" states 
beyond those obtained from the 0(4) a X 0(4) 2 
structure assumed in starting from the single-par¬ 
ticle "planetary" model of the atom. In the 11(4)! 
X lK4) 2 scheme [54], I took pains to try to ratio¬ 
nalize away these extra states. However, it would 
not be necessary to do this if the "extra" states 
really were new physical states not accounted for by 
assuming an O (4) x X O (4) 2 planetary structure as 
a starting point. Is there any evidence for such 
states? Some but not all of the "frozen planet" 
states really appear to be new states. This is not 
surprising, because it is hard to see why frozen 
planet states would originate from an 0(4) x X 
0(4) 2 independent particle picture. Suggestively, 
the first frozen planet state in a shell corresponds 
to the highest ordinary supermultiplet state in the 
shell, i.e., the 1 S e supermultiplet state with the 
most bending excitation. It is as if the frozen planet 
sequences are taking off from where the shell-based 
supermultiplets leave off—hinting that both are 
part of some larger classification. Of course, this 
does not prove that the frozen planet states are the 
"extra" states of the 11(4)! X LT(4) 2 scheme. In¬ 
deed, that scheme predicts many more new states 
than the frozen planet states already known. It is 
unknown how many dynamically stable states will 
eventually be discovered or what the systematics 
of their classification will look like. However, it is 
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an exciting possibility to try to use group theoreti¬ 
cal schemes, such as the li(4) 1 X U( 4) 2 model, to 
make predictions for computational and experi¬ 
mental searches for novel states. 

Another challenge is the development of spec¬ 
troscopic Hamiltonians for isomerizing systems, 
including the description of the novel bifurcation 
phenomena now being uncovered in experiments 
and numerical studies. This opens up many excit¬ 
ing possibilities for the development not only of 
spectroscopic Hamiltonians of the standard molec¬ 
ular form, but also effective Hamiltonians using 
Lie algebraic methods. It will be fascinating in 
coming years to watch the interplay of these new 
developments in highly excited electron correla¬ 
tion in atoms and highly excited, nonrigid 
molecules. These undertakings are likely to lead to 
a new understanding of fundamental problems of 
correlation, of general interest not only for atoms 
and molecules but throughout the many areas of 
physical science where correlation problems are 
important. 

ACKNOWLEDGMENT 

Support of the Department of Energy under 
Basic Energy Sciences Grant DE-FG06-92ER14236 
is gratefully acknowledged. 


References 

1. M. E. Kellman, Annu. Rev. Phys. Chem. 46, 395 (1995). 

2. M. E. Kellman, Proc. Natl. Acad. Sci. U.S.A. 93, 14287 
(1996). 

3. M. E. Kellman and D. R. Herrick, J. Phys. B 24, L755 (1978). 

4. D. R. Herrick and M. E. Kellman, Phys. Rev. A 21, 418 
(1980). 

5. D. R. Herrick, M. E. Kellman, and R. D. Poliak, Phys. Rev. 
A 22, 1517 (1980). 

6. M. E. Kellman and D. R. Herrick, Phys. Rev. A 22, 1536 
(1980). 

7. B. G. Wybourne, Classical Groups for Physicists (Wiley, New 
York, 1974). 

8. E. Chacon, M. Moshinsky, O. Novaro, and C. E. Wulfman, 
Phys. Rev. A 3, 166 (1971). 

9. C. E. Wulfman, Chem. Phys. Lett. 23, 370 (1973). 

10. O. Sinanoglu and D. R. Herrick, J. Chem. Phys. 62, 886 
(1975). 

11. D. R. Herrick and O. Sinanoglu, Phys. Rev. A 11, 97 (1975). 

12. M. E. Kellman, Phys. Rev. Lett. 55, 1738 (1985). 


13. J. L. Krause and R. S. Berry, J. Chem. Phys. 83, 5153 (1985). 

14. J. E. Hunter III and R. S. Berry, Phys. Rev. A 36, 3042 (1987). 

15. G. S. Ezra, K. Richter, G. Tanner, and D. Wintgen, J. Phys. B 
24, L413 (1991). 

16. ]. M. Rost, R. Gersbacher, K. Richter, J. S. Briggs, and D. 
Wintgen, J. Phys. B 24, 2455 (1991). 

17. J. M. Rost and J. S. Briggs, J. Phys. B 24, 4293 (1991). 

18. K. Richter, J. S. Briggs, D. Wintgen, and E. A. Solov'ev, J. 
Phys. B 25, 3929 (1992). 

19. V. N. Ostrovsky and N. V. Prudov, Phys. Rev. A 51, 1936 
(1995). 

20. G. Herzberg, Infrared and Raman Spectra (Van Nostrand, 
New York, 1950). 

21. L. E. Fried and G. S. Ezra, J. Chem. Phys. 86, 6270 (1987). 

22. E. L. Sibert III, J. Chem. Phys. 90, 2672 (1989). 

23. B. T. Darling and D. M. Dennison, Phys. Rev. 57,128 (1940). 

24. E. Fermi, Z. Phyz. 71, 250 (1931). 

25. M. E. Kellman, J. Chem. Phys. 93, 6330 (1990); M. E. Kell¬ 
man and G. Chen, J. Chem. Phys. 95, 8671 (1991). 

26. B. C. Smith and J. S. Winn, J. Chem. Phys. 94, 4120 (1991). 

27. D. M. Jonas, S. A. B. Solina, B. Rajaram, R. J. Silbey, R. W. 
Field, K. Yamanouchi, and S. Tsuchiya, J. Chem. Phys. 99, 
7350 (1993). 

28. M. A. Temsamani and M. Herman, J. Chem. Phys. 102, 6371 
(1995). 

29. M. A. Temsamani and M. Herman, J. Chem. Phys. 105, 1355 
(1996). 

30. S. A. B. Solina, J. P. O'Brien, R. W. Field, and W. F. Polik, 
Ber. Bunsenges. Phys. Chem. 99, 555 (1995). 

31. S. A. B. Solina, J. P. O'Brien, R. W. Field, and W. F. Polik, J. 
Phys. Chem. 100, 7797 (1996). 

32. M. A. Temsamani, M. Herman, S. A. B. Solina, J. P. O'Brien, 
and R. W. Field, J. Chem. Phys. 105, 11357 (1996). 

33. Z. Li, L. Xiao, and M. E. Kellman, J. Chem. Phys. 92, 2251 
(1990). 

34. L. Xiao and M. E. Kellman, J. Chem. Phys. 93, 5805 (1990). 

35. J. Svitak, Z. Li, J. Rose, and M. E. Kellman, J. Chem. Phys. 
102, 4340 (1995). 

36. Z.-M. Lu and M. E. Kellman, Chem. Phys. Lett. 247, 195 
(1995). 

37. Z.-M. Lu and M. E. Kellman, J. Chem. Phys. 107, 1 (1997). 

38. J. P. Rose and M. E. Kellman, J. Chem. Phys. 105, 7348 
(1996). 

39. S. Keshavamurthy and G. S. Ezra, Chem. Phys. Lett. 259, 81 
(1996). 

40. S. Keshavamurthy and G. S. Ezra, J. Chem. Phys, 107, 156 
(1997). 

41. K. K. Lehmann, S. C. Ross, and L. L. Lohr, J. Chem. Phys. 
82, 4460 (1985). 

42. Y.-T. Chen, D. M. Watt, R. W. Field, and K. K. Lehmann, J. 
Chem. Phys. 93, 2149 (1990). 

43. X. Yang, C. A. Rogaski, and A. M. Wodtke, J. Chem. Phys. 
92,2111 (1990). 


408 


VOL. 65, NO. 5 



NONRIGID SYSTEMS IN CHEMISTRY 


44. X. Yang, C. A. Rogaski, and X. Yang, J. Opt. Soc. Am. B 7, 
1835 (1990). 

45. H. Ishikawa, C. Nagao, N. Mikami, and R. W. Field, J. 
Chem. Phys. 106, 2980 (1997). 

46. S. C. Farantos, H.-M. Keller, R. Schinke, K. Yamanuchi, and 
K. Morokuma, J. Chem. Phys. 104, 10055 (1996). 

47. J. M. Bowman and B. Gazdy, Chem. Phys. Lett. 237, 27 
(1995). 

48. V. N. Ostrovsky and N. V. Prudov, J. Phys. B 30, 151 (1997). 

49. R. D. Levine and C. Wulfman, Chem. Phys. Lett. 60, 372 
(1979). 


50. F. Iachello and R. D. Levine, Algebraic Theory of Molecules 
(Oxford University Press, New York, 1994). 

51. S. Oss, Adv. Chem. Phys. 93, 455 (1996). 

52. D. Bonatsos and C. Daskaloyannis, Phys. Rev. A 48, 3611 
(1993). 

53. D. Bonatsos, C. Daskaloyannis, and P. Kolokotronis, J. 
Chem. Phys. 106, 605 (1997). 

54. M. E. Kellman, Phys. Rev. Lett. 73, 2543 (1994). 

55. M. E. Kellman, F. Amar, and R. S. Berry, J. Chem. Phys. 73, 
2387 (1980). 

56. M. E. Kellman, Chem. Phys. 48, 89 (1980). 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


409 



Kinetic Energy Functional Derivative 
for the Thomas-Fermi Atom in 
D Dimensions 


NORMAN H. MARCH/* SABRE KAIS 2 

1 

Oxford University, Oxford, England 

2 Department of Chemistry, Purdue University, West Lafayette, Indiana 47907 
Received 26 March 1997; accepted 29 April 1997 


ABSTRACT: The self-consistent Thomas-Fermi atom satisfying Poisson's equation in 
D dimensions has a functional derivative of the kinetic energy T with respect to the 
ground-state density n(r) proportional to n 2/D . But the Poisson equation relates n l ~ 2/D 
to "reduced" density derivatives n/dr \ Thus ST/Sn can be written also, quite 

compactly, solely in terms of these derivatives. An analytic solution to the Thomas-Fermi 
equation in D dimensions can be presented as an expansion about the known analytic 
solution at D = 2. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 411-413, 1997 


Introduction 



h 2 D 

2 m(D + 2 ) 


' D ' 


2/D 


(i) 


T here is a good deal of interest currently in the 
role played by electron density gradients in 
density functional theory [1]. The purpose of the 
present study is to explicitly demonstrate that the 
self-consistent Thomas-Fermi (TF) atom discussed 
in D dimensions by Kventsel and Katriel [2, 3] has 
the following property. The kinetic energy T, of 
the form [4] 


where the constant K D satisfies the recurrence 
relation 


K 


D 4- 2 


K 


D 


2irD ' 




1 

2tt 


( 2 ) 


has a functional derivative 8T/8n that can be 
written either in the conventional "local" form 


r (D) = 4 D) j n x + 2/D dT, 
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8T (D) l 2 
~Sn~ C + D 


4 D >n 2 / D 


(3) 


or in a quite equivalent form in terms solely of 
n~ 1 (dn/dr) and n~ 1 (d 2 n/dr 2 ). 


International Journal of Quantum Chemistry, Vol. 65, 411-413 (1997) 
© 1997 John Wiley & Sons, inc. 


CCC 0020-7608 / 97 / 050411 -03 



MARCH AND KAIS 


Differential Equation for 
Ground-State Density 


The first step in the proof of the above state¬ 
ment is to establish the differential equation satis¬ 
fied by the ground-state density n(r) in D dimen¬ 
sions. The starting point to do this is the D-dimen- 
sional Poisson equation [1, 2], which from phase 
space considerations takes the form 


d 2 ^ D - 1 dV 

—_ _|_ - 

dr r dr 


/3 D ^ D/2 . 


(4) 


Here the chemical potential fi , constant through¬ 
out the entire electronic charge cloud of the atom, 
and the self-consistent potential energy V(r), are 
related to in Eq. (4) by 


fi-V = eV 


( 5 ) 


while the constant p D is given by 


f3 D = 87re 


2 77 me \ D/ " 2 


h 2 


F11 + 


D 


( 6 ) 


Explicitly, for a point charge of magnitude Ze, 
'P(r) behaves as r 0 as 


'f(r) 


47 r Ze 

(D — 2)fl D 


,2-D 


D > 3 (7) 


with 


a 


D 


Dt7 D / 2 


r 



( 8 ) 


The integral of n(r) through all space diverges 
for D > 4, but as one of us has pointed out previ¬ 
ously [3], this trouble can be "cured" by introduc¬ 
ing a finite nucleus. 

Evidently, from Eq. (4), the electron density 
n(r ) is related to ^(r) by [4] 

n(r) = — V D/2 . (9) 

4:77 e 

Writing Eq. (9) as 'P(r) a n(r) 2/D , one can sub¬ 
stitute for ^(r) in Eq. (4) to obtain the differential 
equation satisfied by the electron density n(r) in 
D dimensions. After a short calculation, this can be 


obtained in the form 

'l d 2 n D - 1 / 1 dn\ 12 1/1 dn\ 2 

n dr 2 r \n dr) \D J\n dr J 

= -(• (10) 

This generalizes the known result [5] for the 
original TF atom with D — 3, namely 


1 

£ 

CM 

> 

1 1 

Vn \ 2 ' 

n 1/3 

i 

1 ~ 4 ' 


n 

" 3 

« ) 

= ~T' 



( 11 ) 

since from Eq. (6), j8 3 = (327r 2 e/3h 3 )(2me) 3/2 , to D 
dimensions. 


Analytic Solution to the TF Equation in 
D Dimensions 


Using the spatial dimension D as a free param¬ 
eter has been widely employed to obtain an ap¬ 
proximate results for D = 3 for a variety of physi¬ 
cal problems [6]. The TF equation in D = 3 is a 
nonlinear equation. But the case D = 2 from Eq. 
(9) yields a linear equation which has been solved 
previously in terms of Bessel functions [2]. 

Recently, Bender et al. [7] have proposed a new 
approach to the analytic solution of a variety of 
nonlinear problems. Laurenzi [8] has shown that 
one can use this approach to obtain an analytic 
solution for the TF equation. 

In this section we would like to show that one 
can use the D-perturbation expansion method to 
obtain an analytic solution for the D dimensional 
TF equation. We do this by replacing Eq. (4) with 


d 2 V 26 + 1 df 
dx 2 x dx 




( 12 ) 


where x = /3^ /2 r and e = (D — 2)/2 is a measure 
of the nonlinearity of the TF equation. The poten¬ 
tial M* is then expanded in a power series in e: 

V = % + e% + e 2 % + •••. (13) 


This, in turn, produces a set of inhomogeneous 
equations for the perturbation functions 


d 2 % 

dx 2 


1 'Vn 
x dx 


- % = X „( x ) 


(14) 
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where the first few inhomogeneous terms are given 
by: 

Xo = 0 . ( 15 ) 

2 d% 

(16) 

x ax 

2 1 

*2 = % ln% - - + % + -% ln% 2 . (17) 

The zero-order equation, which is the TF equa¬ 
tion at D = 2, has been solved previously in terms 
of modified Bessel functions I 0 (x) and K 0 (x ) 

% = a a I 0 (x) + a 2 K 0 (x), (18) 

where the constants oq and a 2 can be fixed by the 
boundary conditions at zero and infinity [2]. By 
eliminating the first derivative d x ¥ n /dx in the set 
of linear equations, Eq. (14), one can use the Green's 
function approach to construct a general solution 
for [8]. This generalizes to D dimensions the 
previous work of Laurenzi [8] for the three-dimen¬ 
sional solutions of the TF equation. 


Functional Derivative of Kinetic Energy 
T in D Dimensions 


Returning to Eq. (3), one can raise Eq. (9) to the 
power 2/(D — 2) to obtain a different, though, of 
course, equivalent, form of the functional deriva¬ 
tive 8T^ D) /8n(r). The result is 


ST (D) 

8n(r) 


-(D) 


1 d 2 n (D~ 1 \ 1 dn 

+ 


n dr 2 

2 


1 dn 

+ 1 — — 1 

D l\ n dr 


r i n dr 

212 / 0-2 


, (19) 


where 


-(D) _ 


2 \ (2 \ 2 / (D ~ 2) 

1+ d) c Hd) 


X(477e)" 2/D /3y D(2 - D >. (20) 


This is the main result of the present work and 
generalizes to D dimensions the previous work of 
one us for the three-dimensional TF neutral atom, 
namely 


<5T (3) 
8n(r ) 



’ v 2 fi 

i - 

Vn \ 2 ' 

n 

’ 3' 

» / 


( 21 ) 


Conclusion 


The conclusion is that for the self-consistent TF 
atom in D dimensions, the functional derivative of 
the kinetic energy T can be expressed in terms of 
"reduced" density gradient terms (1 /n)(dn/dr) 
and (1 /n)(d 2 n/dr 2 ). The local form (3) and the 
density gradient (nonlocal) form (21) are, of course, 
entirely equivalent. This is because of the differen¬ 
tial Eq. (10) satisfied by the self-consistent 
ground-state electron density in the TF atom in D 
dimensions. 
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ABSTRACT: A method for computing the (molecular) detour matrix is proposed. 
© 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 415-419, 1997 


T he detour matrix was introduced into the 
mathematical literature in 1969 [1] and 25 
years later into the chemical literature under the 
name of the maximum path matrix of a molecular 
graph by Ivanciuc and Balaban [2]. 

The detour matrix can be used to compute the 
so-called detour index [3, 4] in the same way as 
the distance matrix [5] can be employed to gener¬ 
ate the Wiener index [6], The detour index, which 
is a Wiener-like index, was also introduced by 
Ivanciuc and Balaban [2] as the half-sum of the 
maximum path sums and independently by John 
[7] and us [3]. Lukovits, who introduced the term 
"the detour index," studied the properties of this 
index [4, 8]. He was also first to use this index in 
structure-property modeling [4]. The present re¬ 
port was stimulated by Lukovit's remark [8a]: "The 
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usefulness of the detour index is deflated by the fact that 
to date no method (but inspection ) is available to 
compute this index." Or, in other words, there is no 
method available to compute the detour matrix 
but by hand. In attempting to remedy the above, 
we propose here a computer method for construct¬ 
ing the detour matrix for graphs of moderate sizes 
and a limited number of cycles. 

The detour matrix A = A(G) of a labeled con¬ 
nected (molecular) graph G is a real symmetric 
N X N matrix whose (/, ;)-entry is the length of 
the longest path [9a] from vertex i to vertex j: 


(A ),*/ 


(<Tij if i * j 
\0 if i=j’ 


( 1 ) 


where er /; . is the number of steps in a longest path 
(i.e., the maximum number of edges) in G between 
vertices i and j. 

This definition is exactly the "opposite" of the 
definition of the traditional graph-theoretical dis¬ 
tance matrix, whose off-diagonal entries are the 
lengths of the shortest paths between the vertices 
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in G [5, 9b]: 


(D)*7 


dq if i ¥= j 
0 if i — j 


( 2 ) 


where d { • is the number of steps in a shortest path 
(i.e., the minimum number of edges) in G between 
vertices i and /. As examples, the distance and 
detour matrices for a simple bicyclic labeled graph 
G are given in Figure 1. 

The detour index II is defined in the same as is 
the Wiener index W [10], i.e., the detour index is 
equal to the half-sum of the elements of the detour 
matrix A [2-4]: 


n=(l/2)£E(A)ij (3) 

' i 

W= (1/2) E E(D)-7. (4) 

i j 

The Wiener index and the detour index for a 
bicyclic graph G are also given in Figure I. 

It is obvious from (3) and (4) that the Wiener 
index and the detour index are identical for acyclic 
structures. For polycyclic structures, they are not 
particularly intercorrelated indices. For example, 
the linear correlation between W and fl for a set of 
37 diverse polycyclic structures has a modest cor¬ 
relation coefficient (r = 0.79) [3a]. 


The computation of the detour matrix is a non¬ 
trivial problem of long standing [11]. We propose a 
procedure for computing the (molecular) detour 
matrix from the (molecular) distance matrix since 
there are several algorithms available in the litera¬ 
ture for computing the distance matrix [9b]. We 
will also use the adjacency matrix to generate the 
distance matrix. The procedure is based on the 
computation of distance matrices for sets of se¬ 
lected subgraphs of a considered graph G. These 
subgraphs are created from G by removing succes¬ 
sively edges whose vertices are at least divalent 
until the set of spanning trees is reached. For each 
subgraph, the distance matrix is computed and is 
compared to the starting distance matrix corre¬ 
sponding to G. If any of the distances d i - in the 
newly created distance matrix is larger than the 
corresponding element (d f -) 0 in the initial distance 
matrix, then (d i; ) 0 should be replaced with d { -. 
The final distance matrix, obtained after the con¬ 
sidered set of subgraphs is inspected, is the detour 
matrix. After the detour matrix is obtained, the 
computation of the detour index is straightfor¬ 
ward. 

The block diagram of our computational proce¬ 
dure is given in Figure 2. As an illustrative exam¬ 
ple, we computed the detour matrix for the same 
bicyclic graph G that was considered before in 
Figure 1. This computation is shown in Figure 3. 


©— 

—© 

©— 

—© 


G 




0 1 2 3 2 1 
10 12 12 

2 10 12 3 

3 2 10 12 
2 12 10 1 
1 2 3 2 1 0 

W = 25 


0 5 4 5 4 5 

5 0 5 4 3 4 

4 5 0 5 4 5 

5 4 5 0 5 4 

4 3 4 5 0 5 

5 4 5 4 5 0 

Q = 67 


FIGURE 1 . The distance matrix (D) and the detour matrix (A) of a labeled bicyclic graph G. 
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FIGURE 2. A block-diagram of the computer program. 
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1 2 3 


6 5 4 

G 

1) Generate the adjacency matrix A 0 and calculate distance matrix D 0 of graph G. 


0 

1 

0 

0 

0 

— 

1 


0 

i 

2 

3 

2 

1 

1 

0 

1 

0 

1 

0 


1 

0 

1 

2 

1 

2 

0 

0 

1 

0 

0 

1 

1 

0 

0 

1 

0 

0 

o 

o 

II 

2 

3 

1 

2 

0 

1 

1 

0 

2 

1 

3 

2 

0 

1 

0 

1 

0 

1 


2 

1 

2 

1 

0 

1 

1 

0 

0 

0 

1 

0 


L i 

2 

3 

2 

1 

0 


2 ) 


Find edges e(G) with both vertices at least 
edges (m = 1). 


edge 

vertices 

valencies 

1 

1-2 

2-3 

2 

1-6 

2-2 

3 

2-3 

3-2 

4 

2-5 

3-3 

5 

3-4 

2-2 

6 

4-5 

2-3 

7 

5-6 

3-2 


divalent and copy them to array of m-tuples of 



7 6 

edge-labels 


3) For every m-tuple, disconnect edge(s) in the m-tuple, disregarding graphs that contain 
disconnected vertices and paths of length one. 


labels' 6 (1) (2) <3) (4) (5> (6) (7) 

m-j = 1 n-i (number of m-tuples) = 7 


4) From m-tuples, form the (m+1 )-tu pies by adding the edges (found in step 2) with the edge 
serial numbers higher than the serial number of the last m-tuple edge. 




(1.3) 



(2.6) 




0.4) 


(1.5) 


( 1 . 6 ) 


(2.3) 


(2,4) 


(3.4) 


(3.7) 


(4.5) 


(4.6) 


(4,7) 


(6.7) 

m 2 (number of m+1-tuples) = 2 n 2 (number of m+1 -tuples) = 15 

(a) 


(2,5) 


(5,7) 


FIGURE 3. Computation of the detour matrix for the bicyclic graph G using the proposed method. 
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5) Copy (m+1 )-tuples to array of m-tuples, and go to the step 3. Continue repeating steps 3 and 
4 until spanning trees disconnecting level is achieved (i.e., m+1 = cyclomatic number). 

In the present example, there is no need to go back to the step 3. 


6) Calculate the distance matrix D for every subgraph obtained by m+1 disconnections. If any 
of the distances dy is larger than the corresponding element in D 0 , replace (dy) 0 with dy. 


7) The detour matrix A is equal to the final distance matrix. 


A = 


0 5 4 5 4 5 
5 0 5 4 3 4 

4 5 0 5 4 5 

5 4 5 0 5 4 

4 3 4 5 0 5 

5 4 5 4 5 0 


8) Calculate the detour index, 
n = 67 

(b) 

FIGURE 3. (Continued). 


The CPU time need to construct the detour matrix 
for G in Figure 3 was 1 s on the workstation 
HP-735. 
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ABSTRACT: The classical theory of chemical valence, first, is naturally formalized in 
mathematics in the area of graph theory and, second, finds an extension in quantum 
mechanics in terms of the Heitler-London-Pauling "valence-bond" (VB) theory. Thus, 
VB theory stands in a fairly unique position, although in quantum chemistry, there often 
has been a preference for the alternative (perhaps even "complementary") molecular 
orbital (MO) theory, presumably in large part because of computational efficacy for 
general molecular structures. Indeed, as formulated by Pauling and others, VB theory 
was described as a configuration interaction (Cl) problem when there were multiple 
relevant classical valence structures for the same molecular structure. Also, as now 
recognized, a direct assault on Cl is computationally intensive, prone to size-inconsistency 
problems, and effectively limited to smaller systems—whereas indirect approaches, e.g., 
via wave-function cluster expansions or renormalization-group theory, often neatly avoid 
these problems. Thus, what is (and perhaps always has been) needed is "many-body" 
schemes for VB computations (as well as for higher-order MO-based approaches, too). 
Here, then, certain such many-body VB-amenable computational schemes are to be 
discussed, in the context of semiempirical (explicitly correlated) graphical models. The 
collection of models are described and interrelated in a fairly comprehensive systematic 
manner. A selection of many-body cluster expansion methods are then discussed with 
special reference to resonating VB wave functions and the fundamental graph-theoretic 
nature of the consequent problems (such as also are noted to arise in lattice-discretized 
statistical-mechanical problems, too). Some examples are described incorporating 
resonance among exponentially great numbers of VB structures as applied: for large 
icosahedral-symmetry fullerenic structures, for the (polyacetylenic) linear chain, and for 
ladderlike conjugated polymers. It is contended that practicable many-body VB-theoretic 
methods are now available, retaining clear links to classical chemical valence theory. 
Hopefully, too, these methods may soon find use beyond the semi-empirical framework. 
© 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 421-438, 1997 
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1. Introductory Survey 

T he idea of valence structures goes back to 
classical chemistry. Also, indeed, then it was 
at the heart of chemistry, although the theory was 
primitive in a quantitative mode. There are both 
brief surveys [1] encompassing this history, as well 
as whole books on the subject [2]. 

Soon after the advent of quantum mechanics, 
several valence-bond developments were made: 

■ First, with the ab initio H 2 computations of 
Heitler and London [3] who presented a fun¬ 
damental quantum mechanical description 
of the covalent bond; 

■ Second, with Rumer's [4] realization of a 
basis of such covalent electronic configura¬ 
tions in suggestive correspondence with 
classical chemical-bonding structures; 

■ Third, with Pauling's identification [5] of ori¬ 
ented hybrid orbitals as precursors for bond 
orbitals such as could be used in conjunction 
with Rumer basis functions; and 

■ Fourth, with Pauling and others' develop¬ 
ment [6] of such Rumer-type valence-bond 
bases as a foundation for a classically con¬ 
nected theory of molecular electronic struc¬ 
ture with extension to incorporate resonance 
to describe delocalized bonding. 

Thus, a quantum mechanically founded va¬ 
lence-bond (VB) theory had auspicious beginnings, 
although as reviewed just a bit more completely 
elsewhere [7, 8], a slow period of development 
followed. The alternative molecular orbital (MO) 
theory began development roughly at the same 
time by Hund, Huckel, and Mulliken, but contin¬ 
ued with further theoretical developments by 
Coulson and co-workers while a significant com¬ 
putational advance was made by Roothaan and 
Flail in the form of a discrete-orbital-basis 
self-consistent-field theory. Several supposed diffi¬ 
culties with VB, both computationally and concep¬ 
tually (e.g., involving nonorthogonality and size- 
consistency), were brought to attention. Presum¬ 
ably, largely due to the greater computational effi¬ 
cacy of MO theory, it came to dominance—then 
further various semiempirically formulated MO- 
based theorems were developed, e.g., by Longuet- 


Higgins and Dewar, and, finally, the MO-based 
Woodward-Hoffmann rules only further en¬ 
hanced this dominance of MO theory. The fact that 
VB theory related so nicely in a conceptual (but 
seemingly noncomputational) way back to classi¬ 
cal chemical-bonding ideas may have actually dis¬ 
inclined some researchers to this approach—it be¬ 
ing viewed that the numerically correct answers 
should come out of "pure" quantum mechanics 
(i.e., solely out of physics) without folding in any 
classical chemical concepts, at least if one expends 
sufficient computational effort. 

Moreover, in physics, many-body methods were 
developed largely in terms of the MO (or band- 
orbital-theoretic or momentum-space) picture. 
Some conceptual problems concerning size- 
consistency were resolved in this context and re¬ 
lated to powerful computational procedures: 
many-body perturbation theory, Green's-function 
decoupling techniques, and cluster-expanded wave 
functions. Also, ultimately, these computational 
techniques were all adapted (in the 1970s and 
1980s) for extensive ab initio molecular computa¬ 
tions. Also, MO-related density functional theory 
was developed in a powerful computational for¬ 
mat. 

Through the 1950s, 1960s, and 1970s, there were 
still a few researchers in the area of VB theory 
(including Coulson, Daudel, Gallup, Goddard, 
Lowdin, Matsen, McConnell, McWeeny, Ooster- 
hoff, and Simonetta, although most of these re¬ 
searchers had much diversified interests). But es¬ 
pecially starting in the 1980s (primarily with a new 
group of researchers), it became clear that VB 
theory is far from dead. On the ab initio side, there 
are the especially notable accurate small-molecule 
computations of Gerratt and co-workers [9,10]. On 
a more theoretical side, Anderson [11] suggested 
that a VB picture may be crucial to understanding 
high-temperature superconductivity, so that now 
there are a variety of formal many-body resonating 
VB developments made by physicists, e.g., as re¬ 
viewed in [12]. Also, on the semiempirical chemi¬ 
cal side, there have been (and are reviewed [13]) 
many developments, amusingly often linking up 
with the ideas now developed in conjunction with 
high-temperature superconductivity. 

Now, it seems that there are many-body tech¬ 
niques which are fairly widely recognized to en¬ 
compass either MO- or VB-based approaches. One 
such is the renormalization-group-theoretic ideas, 
although an efficacious general computational 
scheme for general molecular structures has yet to 
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have reached any general recognition. Another 
powerful computational approach is via the Monte 
Carlo statistical techniques, which, again, have yet 
to reach any general utility, although in the hands 
of experts, there are some high-accuracy results on 
selected systems. Also, more and more, it seems 
that techniques earlier thought to be MO-based 
can be formulated as well in a VB-format. 

Here, wave-function cluster expansions devel¬ 
oped in the context of semiempirical VB-theoretic 
models are to be considered. Notably, it is be¬ 
lieved that the semiempirical context is not neces¬ 
sary. But before describing the different types of 
cluster expansions based on local structures, some 
further general theoretical characterization of VB- 
theoretic ideas in the next section is appropriate. 
Beyond the description of different types of locally 
represented cluster-expanded wave functions. Sec¬ 
tion 3 goes on to indicate the form of the overlap 
and Hamiltonian matrix elements, as generating 
functions for the counting of certain types of sub¬ 
graphs of the parent molecular graph. Section 4 
notes certain qualitative features of the different 
types of wave functions, with attention to the 
structural circumstances for best applicability and 
to types of long-range order which can arise. Fi¬ 
nally, Section 5 reports and compares numerical 
results on three different types of structures: icosa- 
hedral-symmetry fullerenes, the infinite linear 
(polyacetylenic) chain, and the infinite ladder. 


2 . VB Theory: Bases, Models, 
and Resonance 

As the first step, some of the nomenclature and 
framework should be clarified. A VB basis consists 
of configurations of spin-paired localized orbitals, 
at least for the overall spin singlet case, while for 
higher spins, there is augmentation by unpaired 
spins. Typical VB spin-pairing diagrams in corre¬ 
spondence with the basis elements for the case of 
six electrons in six orbitals (as for the six occupied 
atomic 7 r-orbitals of benzene) appear as in Figure 1 
and display an evident close analogy to the pri- 


1 








FIGURE 1 . Five independent covalent VB structures 
for six orbitals. These VB structures are also the Rumer 
structures. 


mary classical valence structures for benzene, espe¬ 
cially if augmented with the rest of the spin¬ 
pairing diagram for the cr-orbitals. Rumer [4] 
focused on the singlet case with orbitals singly 
occupied, it being recognized that the extension to 
states with doubly occupied orbitals is straightfor¬ 
ward. Also, Pauling [6] pointed out early the ready 
extension to the overall spin doublet case, and the 
approach taken hinted at further extension, e.g., as 
done in Simonetta's group [14]. But it is to be 
emphasized that there are other possibilities—an¬ 
ticipated to be most significant when the VB struc¬ 
tures as formalized by Rumer, Weyl, and Teller 
[15] do not correspond so closely to classical VB 
structures. I.e., this Rumer basis is built in terms of 
pairings around a cycle, and in many cases, a 
chemical structure may have little to do with a 
simple cycle. More important from a chemical 
viewpoint should be spin-pairing patterns involv¬ 
ing pairing between nearest pairs of localized or¬ 
bitals, independently of a relation to the formal 
basis of Rumer et al. [15]. 

Granted a VB basis, a semiempirical model rep¬ 
resented on such a basis often is described as a 
"VB" model. Of course, such models can be repre¬ 
sented on any basis spanning the same space, 
although the semiempirical integral approxima¬ 
tions are (usually) motivated from considerations 
in terms of the localized (atomic) orbitals as they 
appear in VB basis states. Here, a VB model is 
taken to be a model where the atomic orbitals 
which appear are viewed to be non-orthogonal. 
One such circumstance is the VB model of Pauling 
and Wheland [16] expressed by the covalent space 
of configurations built from singly occupied or¬ 
bitals. Indeed, the model so developed turns out to 
be essentially the Heisenberg spin Hamiltonian, 
more often expressed by the basis of what is essen¬ 
tially Slater determinants of atomic orbitals, and 
each determinant is distinguished solely via the 
spin assignment. Now, most commonly, the model 
is re-represented equivalently to act just on spin 
space; hence, the Hamiltonian takes the general 
form 


H = Y,J P P spin (2.1) 

p 

where P spin are spin space permutations and J P 
are parameters. The simplest (and usual) approxi¬ 
mation retains just the terms involving a transposi¬ 
tion P = (ij) exchanging the ith and jth indices; 
hence, the exchange parameter can [17] be approx- 
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imated to be of the form 

hi) = {ES, 7 -m(ij)Hm}/{l - Sf t }, (2.2) 

with E = (<F|H|0>, S f y = (OKz/OI^), and $ most 
simply being taken as a product of nonorthogonal 
atomiclike orbitals, one assigned to each center. 
Further, the interaction may be represented in 
terms of ordinary spin operators: 

( ij) spin = 2s { • Sj + 1/2 

= 2 sfsf + sfs]~ + s“s+ +1/2. (2.3) 

Then, the Hamiltonian (with a further shift of the 0 
of energy) is written as 

H = LJ^s-Sj, (2.4) 

i~j 

which is the more commonly presented form; 
hence, it is known as the Heisenberg spin Hamilto¬ 
nian, often with i and j restricted to nearest neigh¬ 
bors. Evidently, reference to Rumer or VB bases is 
now quite hidden, and many other bases are used. 
Bases involving "spin-waves" are not uncommon 
in physics, but working with representations on 
such alternative bases can lead to nonlocally inter¬ 
preted wave functions, and results can end up 
being rather far divorced from apparent classical 
chemical connection and interpretation. 

A variety of VB models arise, with possibly the 
most natural hierarchy indicated in Figure 2. There, 
also, a systematic derivational scheme is indi¬ 
cated, based on successive steps which are either 
restrictions to ever smaller spaces or orthogonal- 
izations of natural (initially nonorthogonal) many- 
particle bases for these spaces. It should be noted 
that older criticisms concerning non-orthogonality 
or size-consistency "catastrophes" connected with 
several of the intermediate models are largely ir¬ 
relevant or misleading, as addressed elsewhere 
[18]. 

Resonating VB theory should make explicit use 
of VB states, especially the more chemical corre¬ 
sponding to the more local spin-pairing patterns. 
This really is the focus of this review, especially 
for the many-body case. The use of the phrase VB 
theory (without the adjective resonating ) should be 
more general, but it might best be taken to de¬ 
scribe a treatment with connection back to the 
classical chemical-bonding ideas, using a VB basis 
adapted to the molecular structure under consider¬ 
ation, although, often, researchers refer to any- 







FIGURE 2. A hierarchy of VB-theoretic models. 


thing using the Rumer basis as "VB theory," i.e., 
when a system has a structure little related to a 
single cycle, then the standard Rumer basis often 
does not include many locally spin-paired struc¬ 
tures. The Pauling-Wheland resonance-theoretic model 
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(of Fig. 2) is obtained as the restriction of the 
covalent-space (Heisenberg) VB model just to the 
subspace of Kekule structures which are those VB 
structures with spin-pairings solely between near¬ 
neighboring pairs of sites. The molecular structure 
is conveniently mathematically encoded in terms 
of a molecular graph G which is specified by the 
sites (which here are the various 7r-orbitals) and by 
the set of pairs of neighbor sites. The Kekule struc¬ 
tures are nonorthogonal, but they may be sub¬ 
jected to Lowdin symmetric orthogonalization in a 
systematic fashion [19] to yield what is known as 
the Herndon-Simpson model [20], which most 
simply takes the form 

Hhs-KiEA. + ^EA,, (2.5) 

a j3 

where and R 2 are parameters, the a- and 
/3-sums are over the 6- and 10-cycles of G, and A a 
and Ap are operators which give a nonzero result 
only if they act on a Kekule structure with a 
pattern of alternating rr bonds around the cycle (a 
or /3), in which case they reverse the pattern of 
alternation around that cycle. Then, utilizing a 
ground-state wave-function Ansatz as an equally 
weighted sum of all Kekule structures, there re¬ 
sults [21] a resonance energy expressed in an ele¬ 
gant graph-theoretic form, which, in fact, can also 
be motivated [22] from purely classical chemical¬ 
bonding ideas. 

Additional models are also indicated in Figure 
2. Any of these models can be expressed in a 
graph-theoretic form, and all involve explicit elec¬ 
tron correlation. Also, although a number of 
many-body solution techniques are possible as 
briefly discussed elsewhere [23], focus here is to be 
on a class of wave-function cluster expansions 
where local structures are manifest. Indeed, three 
types of such wave-function cluster expansions are 
here described in some detail. 


3. Wave-function Cluster Expansions 

One general type of approach to the solution of 
such models is via the localized cluster expansion 
of system wave functions. Now, wave-function 
cluster expansions seem (at least in an ab initio 
framework) most commonly to be carried out in 
the coupled-cluster format [24] based on a single- 
determinantal MO reference, although it is under¬ 


stood [25] that the technique can be applied in a 
local format based on a reference built from local¬ 
ized orbitals. Also, such localized-orbital coupled- 
cluster techniques have quite successfully been 
utilized on the Heisenberg model (as well as the 
Hubbard model). But, especially for the lower 
models in the hierarchy of Figure 2, this has not 
been done, and, indeed, it is often not so clear 
what role such orbitals would play. Here, then, we 
consider a general type of local substructure clus¬ 
ter expansion for wave functions. Considerations 
here are limited to developments for the Heisen¬ 
berg spin Hamiltonian and the Herndon-Simpson 
Hamiltonian, although much the same sort of ideas 
should be rather much more generally applicable. 
Even within the stated limitations, there are differ¬ 
ent types of local cluster expansions, and they are 
here discussed separately, but the three discussed 
are formulated in graph-theoretic language. 

One presumes a molecular graph G, which may 
be thought of as the classical molecular connectiv¬ 
ity pattern. The graph G is presumed to be con¬ 
nected in the sense that there exists a sequence of 
edges leading from any site to any other. Further, 
often, it is here assumed that G is bipartite in the 
sense that the set of sites may be partitioned into 
two subsets termed starred and unstarred subsets 
such that sites of one subset have neighbors solely 
in the other subset. For instance, all the classical 
benzenoids form bipartite networks, although sev¬ 
eral lattices arising in solid-state problems are also. 
It is to be emphasized that the presumption of a 
graph does not in itself engender any limita¬ 
tions—but, rather, it guides the form of the expan¬ 
sions, which when truncated in low order engen¬ 
ders an approximation. Different types of sub¬ 
graphs composed from subsets of sites and subsets 
of edges of G are of relevance, and often these can 
be disconnected. In particular, each Kekule struc¬ 
ture K is represented by a subgraph where the 
subset of sites for K is the full set of G and the 
subset of edges of K is to be such that every site 
has exactly one edge incident thereon 
in K. 

3.1. NEEL-STATE-BASED 

CLUSTER EXPANSION 

The first type of wave-function cluster expan¬ 
sion to be considered is the most nearly conven¬ 
tional and has at least in its simple two-site form 
been proposed several times [26] independently 
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and now used [27-30] quite successfully. For a 
bipartite graph G for the Heisenberg model, this 
systematic cluster expansion is based on the so- 
called Neel state which is a product of spins with 
a spins on the starred subset of sites and (3 spins 
on the unstarred subset of sites: 

| 0 > = na(f)fl/3(;). (3.1.1) 

i i 

Even by itself, this may be viewed to be a zero- 
order estimate for the ground state of the antiferro- 
magnetically signed Heisenberg model. But build¬ 
ing from this, a cluster-expanded wave function 
may be considered of the form 

W) = ex Pli (T + )|0>, (3.1.2) 

where the subscript U indicates a slightly modi¬ 
fied exponential function to be explained a little 
later and T is an excitation operator which may be 
written as a sequence of terms involving sub¬ 
graphs of one-, two-, three-, four-sites and beyond: 

T + = Zx,sr + YsXijSfsJ 

* 

+ E + x KjV tsS'sp 

i~j~k 

+ E (XijuSfsfsffi + X im sfsfsf 

l 

+ x Kj)ki s ? s f s k + x i(jk)iSfsp + ..., (3.1.3) 

where the various x# are variational parameters 
and the s% are spin-lowering or spin-raising opera¬ 
tors s~ or for site m, i.e., the s° change the 
spin occurring in the Neel state T> at site m . Also, 
note that within the three- and four-site terms 
there are what would usually be called pair excita¬ 
tions but involve pairs of sites as far apart as those 
which appear in the "proper'' three- and four-site 
terms appearing within these same terms. This 
may be recast in a general form thusly: 

£G y 

r + = E E*^nsf, (3.1.4) 

y i 

where the first sum is over (connected) subgraphs 
y of G, the second sum is over subsets £7 of sites 
of y such that the set <9* occurs in no smaller 
subgraph, and the product is over the sites in £?. 
Then, the wave function may be expanded in terms 
of subgraphs 7 which have many components 


disconnected from one another, it now being un¬ 
derstood that the subscript U on the exponential in 
Eq. (3.1.2) indicates the deletion of any terms in the 
Taylor series expansion of the exponential involv¬ 
ing products of subgraph terms with any sites in 
common. Then, the resultant "disentangled" form 
for the wave function is 

Gy 

|¥> = E EXCY^risfl®)/ (3.1.5) 

y SP * 

where £7 is a subset of the sites of 7 and the 
coefficient X(y,S*) breaks up into a product of 
x y/S * labeled by the different components y of 7 
and by the sets of sites of S? which occur in 7 : 

y JZ’&y 

x(v,^) = n (3.1.6) 

y SP 

Really, Eqs. (3.1.5) and (3.1.6) specify the present 
Ansatz quite completely, the form of Eqs. 
(3.1.1)—(3.1.4) simply being more like the usual 
manner of presentation in the conventional cou¬ 
pled-cluster literature. But, a further important 
point (which much simplifies matters) is that if the 
net number of spins is to be conserved (as for the 
isotropic Heisenberg model) then the excitations 
not so conserving this number may be omitted, 
i.e., all the terms involving an odd number of 
excitations may be omitted, and, in addition, real¬ 
izing that the graph G is presumed to be bipartite, 
terms without a balance of starred and unstarred 
sites [such as that associated with x i( j )k ] may be 
omitted. Thus, Eq. (3.1.4) reduces to 

r+ = E x u s^ + £ (x ijkl s?sjsffi 

i~j 

+ x i{jk)l s ? s ?') + •••/ (3.1.7) 

although, again, all the simplification may be en¬ 
visaged in the form of Eq. (3.1.5). Finally, with 
symmetry, many of the x-parameters may be iden¬ 
tified as equal. With all sites translationally equiv¬ 
alent, all the Xjj can be taken as the same. 

3.2. HULTHEN-KASTELEYN ANSATZ 

This approach originally considered by Hulthen 
[31] and Kasteleyn [32] and evidently indepen¬ 
dently by Huse and Elser [23] may be viewed to 
be also based on Neel-state-related ideas. Again, 
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special note of neighbor pairs of a and /3 spins is 
made as they occur in a general N-site product, 

\a(N)) = |a- 1 (l)cr 2 (2)cT3(3) ... <r N (N)), (3.2.1) 

of such sites spins = a, jS; i.e., with s(n(N), G) 
the number of /3 spins assigned to unstarred sites 
in cr(N ), the Hulthen-Kasteleyn cluster-expansion 
Ansatz is 

o-(n) 

(3.2.2) 

where the variational x-parameters take the value 
1 if crj # o-j . The phase factor here is convenient 
since it is known [34] that upon inclusion of this in 
the product basis vectors the ground-state wave 
function (for the nearest-neighbor Heisenberg on a 
bipartite G) appears nodeless , in the sense that 
the correct expansion coefficients for the 
(-1 ) s(<t(N),G) |<x(N)> can all be chosen non-nega¬ 
tive. If all the neighbor pairs are translationally 
equivalent (while also there is spin-symmetry un¬ 
der the interchange of a and /3), then all the 
x ( Xaa) = x i; (/3j3) = x for i — j. Now, if 
#(a/3; cr(N), G) is defined as the number of a, /3 
pairs in cr(N) which are neighbors in G, then the 
ground-state Ansatz becomes 

I^HK>= £ x #(^;-(N),G) ( _ 1 ) ^(N);G) k(N)>< 
cr(N) 

(3.2.3) 

Moreover, one can imagine a way to generalize 
this Ansatz to higher orders, with variational pa¬ 
rameters x (cr(y)) dependent on the spin configu¬ 
ration <r(y) on a (connected) subgraph of type y, 
i.e., we let #(cr(y), y; o-(N), G) denote the number 
of subgraphs of type y in G such that the spin 
configuration cr(y) appears in |cr(AO), and then 
we take the general ground state to be expressed 
as 


form given, there is an overabundance of parame¬ 
ters—one need not use subgraphs y which are 
included in any other retained subgraphs. (Or if 
such smaller subgraphs are kept, the associated 
variational parameters can be chosen, say, to take 
the values as determined when there were no 
larger subgraphs included in the earlier truncated 
duster expansion.) 

3.3. RESONATING VB CLUSTER EXPANSION 

Neither of the more nearly conventional 
cluster-expansion Ansdtze of the preceding two 
sections really incorporate VB-theoretic ideas for 
wave functions, but this now is to be done. 
Strangely, though the most closely related to clas¬ 
sical chemical ideas, it seems to be the most re¬ 
cently considered [29, 35-37]. This VB-theoretic 
Ansatz is definable in terms of standard singlet 
spin-pairing functions 

0[i,/] = a(i)p(j) - /3(0a(;). (3.3.1) 

Then, presuming that G has at least a nonzero 
number of fully paired Kekule structures K, the 
lowest-order resonating VB wave function is taken 
to be 

I^rvb) s £ ( O O jr /(-K)]/ (3.3.2) 

K \ i I j 

where the sum is over Kekule structures of G, the 
two products are over the starred sites of G, and 
i(K) denotes the site to which i is paired in K. For 
a higher-order Ansatz , one considers the Rumer 
structures R with spin-pairing between more dis¬ 
tant sites but always such that starred and un¬ 
starred sites are paired. The wave function makes 
use of the idea of convex subgraphs y of G, which 
is such that for every pair of vertices i and j of y 
every shortest path of G between i and j is also in 
y. Then, the general resonating VB wave-function 
takes the form 


l*>= E n{x>(y))} #(ff(r) ' 7;CT(N),G) 

a(N) y 

(3.2.4) 

If subgraphs up to the size of G are kept, then this 
includes the exact wave-function. In fact, in the 


G /RAG \ ★ 


FI x y n if y(-R)]/ (3.3.3) 


r \ y J j 


where the y-product here is over the components 
of the smallest (convex) subgraph of G such that R 
entails the pairing of no sites in different compo- 
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nents. Of course, the variational parameters x y are 
expected to become ever smaller as the (con¬ 
nected) subgraph component y becomes ever 
larger, and usually one will truncate the Ansatz to 
keep just smaller size y. 

3.4. OTHER POSSIBLE CLUSTER-EXPANSION 

ANSATZE 

There are slightly different ways to set up each 
of the types of cluster expansions noted in the 
preceding sections. First, for the Neel-state-based 
Ansatz , one need not utilize the "unlinked" sub¬ 
script U on the exponential, although, then, the 
"disentangling" in matrix elements as in the next 
subsection would not be quite so complete (and 
the results would be numerically different at a 
given order of truncation), at least as far as the 
higher-order Ansatze are concerned. Second, for 
the lattice-gas Ansatz , one could seek to utilize the 
extra freedom in the x y (cr(y)) imposing extra con¬ 
ditions, say by "factoring" the variational coeffi¬ 
cients to pair factors (cr ? o- ; ) independently of 
whether i and j are neighbors or not; hence, the 
overlap reduces to the partitions function for an 
lattice-gas model solely with pair interactions. 
Third, for the reasoning VB model, the freedom 
connected with the overcompleteness of the full 
set of Rumer structures might be sought to be 
utilized, say again by a similar sort of factorization 
to pair terms. Although neither of these pair factor¬ 
ization assumptions is known to include the exact 
ground state, there is some empirical evidence [29, 
33] of very high accuracy. 

But there are at least a couple of other some¬ 
what different sorts of possibilities for cluster- 
expansion Ansatze, although seemingly less well 
developed. One "sequentially" spin-adapted clus¬ 
ter expansion is related to the Young-Yamanouchi 
basis in somewhat the same sense that the resonat¬ 
ing VB Ansatz is related to the set of locally paired 
VB structures. In this approach, the product basis 
|o“(N)> again is considered, and the z-component 
of spin at site i is denoted by m i (= +1/2 for 
cr i = a and = — 1 /2 for = /3). Also, it is conve¬ 
nient to define 

i 

Mj = £ m, (3.4.1) 

/= 1 

and we let S, denote a total spin (quantum num¬ 
ber) for indices 1 -» i. Then, in a lower-order ap¬ 


proximation, 

/ N— 1 

+e q >= £ E ru-(s„s, +1 ) 

o-(N) S(N) l , = 1 

X(S/M,- X o- i+1 |S /+1 M j+1 )Jk(N» / (3.4.2) 

where S(N) = (Sj, S 2 , ..., S N ), the x(S i ,S i+1 ) are 
variational parameters, and the ( S i M i X cr /+1 | 
S Z+1 M /+1 ) are Clebsch-Gordon (or spin-coupling) 
coefficients. More generally, in place of the two-site 
approximation with just x/S,-, S i+1 ), one would (at 
the k + 1-site approximation) instead have 
Xf(S f , S i+1 , S i+2 ,..., S i+k ). But, further, in invoking 
an approximation, one limits the values which the 
Sj may take—e.g., variational x-parameters with 
Sj arguments exceeding some selected value are 
taken as 0. Presumably, though, because of the 
manner of the spin-coupling, this Ansatz (in its 
lower-order forms) is really only well adapted to 
quasi-l-dimensional systems (i.e., the chemical 
graph-theoretic circumstance is rather special). 

Also, finally, a cluster-expansion on a "basis" of 
cluster-expanded wave functions might be 
mentioned as a possibility. Granted a localized 
cluster-expansion as any in the preceding sections, 
the local x-parameters can be viewed as separate 
variables which in the ground state for translation- 
ally symmetric systems fall into a relatively few 
equivalence classes, so that changing one at a time 
away from the ground-state value to give a result 
orthogonal to the ground state yields what may be 
viewed as candidate excited states or, alterna¬ 
tively, as new basis states. But in the local picture, 
this gives ~ N such "single" excitations with 
changes to just one local x-parameter, and one can 
inquire what happens when two or more such 
changes are made. Of course, when such changes 
are made for well-separated regions, additional 
"multiple" excitations arise so that something ap¬ 
proximating a new cluster-expanded basis seems 
to be arising. With such a new basis, a new repre¬ 
sentation for the Hamiltonian can be made, such 
new representation still being essentially local, and 
any standard many-body solution technique might 
be attempted on this new basis. In particular, a 
new cluster expansion might be made. Sometimes, 
the basis functions may not fully span the original 
space, as is the case with the Neel-state-based 
Ansatz if the excitations are built from a low-order 
primary cluster expansion, although, even so, the 
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secondary cluster expansion can provide some de¬ 
gree of improvement. The same limitation occurs 
with a low-order resonating VB Ansatz, but no¬ 
tably for the lattice-gas Ansatz for a linear chain, 
the new basis is essentially complete, as discussed 
elsewhere [38]. 

3.5. OVERLAP MATRIX-ELEMENT 
COMBINATORICS 

Of course, the manner in which one computes 
with the cluster-expanded wave functions of the 
preceding sections is of crucial importance. The 
most standard approach at least for the more con¬ 
ventional MO-based cluster expansions is via 
Cizek's coupled-cluster method [24]. Also, indeed, 
this can conceivably be done for the localized-site 
cluster expansions as suggested previously [39] 
and as implemented [25, 30] in practice for (what 
is a slightly modified version of) the Neel-state- 
based Ansatz of Section 3.1. But, in fact, at least for 
quasi-1-dimensional (or finite) systems, all the 
Ansatze of the preceding subsections can be treated 
in a purely variational manner, which emphasizes 
interesting statistical mechanical analogies. Hence, 
one sees clearly possibilities for different types of 
long-range orders (as discussed in a following sec¬ 
tion)—and there are possibilities for novel types of 
phase transitions. 

Here, the variational combinatorial form might 
be briefly considered for the Neel-state-based 
Ansatz of 3.1. Most simply, restriction might be 
made to just the two-site Ansatz , for which the 
overlap matrix element takes the form 

<W> = 

7 S’ 7 ' S" 

-j. 

X<*| FK UsfWW ,s*) 

' i j 

- L E Ex(7,m(7',^). (3.5.D 

7^7' 

This may be refined further in a general form in 
terms of "superposition" graphs (composed from 
pairs y and 7 '). But as an introduction, the 
graph-theoretic elegance might be illustrated with 
consideration of the case with just nearest-neigh¬ 
bor symmetry-equivalent pair excitations. For this 
circumstance, then, the overlap may be repre¬ 
sented as a sum over terms associated to different 
subgraphs of G, each subgraph corresponding to 


suitable products of the x fy - corresponding to the 
edges of the subgraph. Each site-excitation in the 
ket needs to be matched by one in the bra, so that 
two types of connected components can arise in 
these subgraphs: even cycles with pair excitations 
from ket and bra alternating around the cycle and 
isolated edges where a pair excitation in the ket 
and one in the bra exactly match. Hence, the over¬ 
lap is a weighted sum over such sesquivalent sub¬ 
graphs S, and the weights are a product over 
weights for each component: 2x M for an even 
M-cycle of S, with the 2 arising because there are 
two ways to have alternation of the pair excita¬ 
tions around the cycle between ket and bra, and 
x 2 for an isolated edge of S. Then, 

G 

<w> = £2 c(S) x e(S)+i(S) , (3.5.2) 

s 

where the sum is over sesquivalent subgraphs of 
G, c(S) is the number of cycles in S, c(S) is the 
total number of edges in S, and i(S) is the number 
of isolated edges in S. This may be expressed [40] 
in terms of the permanent of a weighted molecular 
graph G: 

<W> = per{i + X A}, (3.5.3) 

where A is the adjacency matrix of G. Graph 
theoretically, this is a type of subgraph-generating 
function associated to the parent graph G—the 
parameter x appears as the dummy variable in 
this generating function. In a statistical mechanical 
context, the mathematical form of ('\P'|M r > is the 
same as that of a partition function for allowed 
subgraph conformations, with x corresponding to 
a statistical mechanical activity . 

In fact, the graph-theoretic and statistical forms 
of <T r | A T) appearing like that of a generating func¬ 
tion or a partition function persists to higher-order 
or alternative cluster-expansion wave-function 
Ansatze. The simple resonating VB Ansatz which 
involves just Kekule structures leads [36, 37] to an 
overlap given as a different sum over spanning 
sesquivalent graphs. The simple Ansatz for the 
conjugated-circuit case yields an overlap which is 
just a count of the Kekule structures. For the near¬ 
est-neighbor Hulthen-Kastelyen Ansatz , the over¬ 
lap becomes [35] essentially the Ising model, and 
higher-order such Ansatze are just elaborated 
"lattice-gas" models. 
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3.6. HAMILTONIAN MATRIX-ELEMENT 
COMBINATORICS 

To obtain the energy, one considers the Hamil¬ 
tonian matrix element. For the Heisenberg spin 
Hamiltonian, it is a sum over matrix elements of 
pair interactions ( ij) spin for neighbor sites i ~ j. 
The Hamiltonian matrix element may then be de¬ 
veloped in a diagrammatic form with a special 
type of edge (perhaps represented by a wiggly line 
segment) for each pair interaction. For the nearest- 
neighbor Heisenberg model utilizing the simplest 
Neel-state based Ansatz, this, then, involves just 
replacing one of the x-parameters by an exchange 
parameter from the Hamiltonian; hence, one ulti¬ 
mately finds [40] 

<^|H|^> - /^<W>. (3.6.1) 

But now if we hark back to the permenental for¬ 
mula for <4 / |4 / >, one can obtain a formula in terms 
of a sum of ratios of permanental minors to the 
full permanent. In statistical mechanical contexts, 
just such ratios appear as logarithmic derivatives 
(with respect to activities). In a graph theoretic 
context, these ratios appear in average (weighted) 
subgraph counts. Amusingly, also, such a ratio is 
much reminiscent of those that appear in bosonic 
SCF problems. 

But such graph-theoretic and statistical mechan¬ 
ical analogies apply for both higher-order Ansdtze 
and for the other cluster-expansion Ansdtze earlier 
discussed. Of course, the higher-order Ansdtze be¬ 
come much more elaborate with many addi¬ 
tional "activities." For the nearest-neighbor 
Hulthen-Kastelyen Ansatz , the Hamiltonian ma¬ 
trix appears as a type of a derivative of the parti¬ 
tion function (or overlap) of the Ising model, and 
higher-order such Ansdtze similarly involve just 


elaborated "lattice-gas" models. For the nearest- 
neighbor resonating VB Ansatz "derivatives" of 
the overlap subgraph, generating polynomial arise 
[36]. In general [39], the analogies arising in these 
problems for the overlap matrix elements and for 
the Hamiltonian expectation values may be sum¬ 
marized as in Figure 3. Rather, generally, both 
overlaps and partition functions represent sums 
over relative probabilities which tend to correlate 
only nearer sites, and the Hamiltonian expecta¬ 
tions should involve much the same probability 
structure modified in some manner to give a size- 
extensive result, as is done by a logarithmic 
derivative. 

The analogies of Figure 3, of course, mean that 
methods used to solve one problem can be utilized 
in solving another. Thus, e.g., the statistical me¬ 
chanical transfer-matrix method [41, 42] to deal 
with partition functions and associated expectation 
values can be utilized for the wave-function prob¬ 
lems, too—quite generally in a computationally 
amenable fashion for quasi-l-dimensional prob¬ 
lems [43]. Or the Metropolis Monte Carlo method 
[44] can be neatly translated [29, 33] to the varia¬ 
tional wave-function-based sort of problem indi¬ 
cated here. But, besides computational methods, 
there are conceptual many-body features, such as 
long-range order and phase transitions. 


4. Qualitative Results 

Results for the various cluster expansion Ansdtze 
of the preceding sections might be compared to 
one another and several other approximation 
schemes in some global way. But, too, the Neel- 
based and resonating VB-theoretic approaches may 
be understood to apply for different structural 


(TIT) - {overlaps} > j {P artition functions} s Z 
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FIGURE 3. A correspondence among quantities: first, for cluster-expanded wave functions; second, for graph-theoretic 
generating functions; and, third, for statistical mechanics. 
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circumstances. This was perhaps first suggested in 
the context of a particular case (namely, the trian¬ 
gular lattice) by Anderson and Fazekas [45], but 
the argument can be made [46] much more gener¬ 
ally, i.e., one may compare the (0-order) energies 
for a simple Neel-state and a single Kekule struc¬ 
ture. The energy per site for a Neel state on a 
bipartite graph is readily obtained as 

£ (0) (Neel) = -Jz/4, (4.1) 

where z is the mean number of neighbor sites (i.e., 
the mean coordination number). The energy per 
site for a single Kekule structure (on a graph 
which admits at least one Kekule structure) is 
readily obtained as 

£ (0) (Kekule) = -J 3/4. (4.2) 

Hence, one can summarize the situation in the plot 
of Figure 4, where also is shown the results of 
further refinement of the resonance-theoretic ap¬ 
proach. For the Kekule-structure-based approach, 
a resonating VB result based on an equally 
weighted combination of all Kekule structures as 
applied [36] to a special sequence of benzenoid 
polymer strips (of varying mean coordination 


energy per site 



FIGURE 4. Plots of Neel-state energy per site and 
Kekule-structure energy per site for the nearest-neighbor 
Heisenberg (or VB) model as a function of the mean 
coordination number z. The Neel-state result is for 
bipartite graphs, while that for a Kekule structure 
presumes that the graph admits at least one Kekule 
structure. 


number) is included in the plot. As a result, one 
sees that 

■ Resonance-theoretic descriptions are most 
appropriate for (benzenoid or aromatic) sys¬ 
tems with low mean coordination number 
(z < 3) and with many Kekule structures 
per site. 

■ Neel-state-based descriptions are most ap¬ 
propriate for (3-dimensional) structures with 
high mean coordination numbers and low 
extents of frustration (i.e., with few small 
odd cycles of antiferromagnetically-signed 
interactions). 

Indeed, these conclusions are quite naturally sup¬ 
ported by the experimental successes of 
resonance-theoretic ideas in organic chemistry in 
application to the types of systems of the first 
proposition, and of Neel-state-based ideas in 
solid-state magnetic materials science in applica¬ 
tion to the types of systems of the second proposi¬ 
tion. 

Another distinction between resonance-theoretic 
and Neel-state-based approaches is that each seem 
to exhibit a qualitative difference as regards long- 
range ordering. Indeed, the Neel-state-based 
Ansatz involves the "classical" type of Neel anti¬ 
ferromagnetic ordering with order parameter 

m(i,j ) = ( + )({s? - <sf»{s? - <s?>}) -> m a , 

as d(i, j ) -> o°, (4.3) 

where the ± sign applies as the sites are or are not 
of the same type (starred or unstarred), d(i, j ) is 
the distance between sites i and j, and the expec¬ 
tations {sf ) and (s 2 > can be anticipated to be 0 (in 
the absence of a magnetic field). Many of the 
conventional many-body techniques (Green's func¬ 
tions, spin waves, Neel-state-based many-body 
perturbation theory, etc.) lead to this type of long- 
range ordering, although each different scheme 
can lead to different nonzero values. But the res¬ 
onating VB-theoretic approaches turn out to lead 
[29, 34, 47] to another type of long-range spin-pair¬ 
ing order involving a spin-pairing correlation func¬ 
tion for pairs of bonds, say e and /, 

<t(e,/) = ({s a(e) • s b(e) - p(e)}{s a(f) • s b(f) -p(/)}), 

(4.4) 
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where a(e ) and b(e ) are the sites at the two ends of 
a bond e, and p(e ) is the pairing expectation 
<s a(e) • s b(e) ). Then, this correlation function can 
approach different limits for distant bonds, de¬ 
pending on the equivalence classes of the two 
bonds e and f. Indeed, this type of long-range 
order can be readily appreciated from considera¬ 
tion of classical neighbor VB-bonding diagrams or 
Kekule structures. For example, consider the two 
Kekule structures shown in Figure 5 for a polyan- 
thracenoid polymer. In either case, there are many 
other related Kekule structures that can be drawn 
by rearranging a few double bonds independently 
within each anthracene unit, while the bond inter¬ 
connecting anthracene units must remain always 
single or always double, i.e., if for one Kekule 
structure the bond order for one of these intercon¬ 
necting bonds is known, then it is known for all 
such interconnecting bonds (even those in the 
long-range limit). Also, of course, the Kekule 
structures in different bond-order classes are dif¬ 
ferent in every monomer unit so that in the long- 
polymer limit they are noninteracting (via any 
finite-particle interaction); hence, each class gives 
rise to its own ground state. Finally, this long-range 
order persists with the inclusion of VB structures 
beyond Kekule structures so long as the range of 
the spin-pairings is limited (or even if it is not 
limited but the contributions of longer spin-paired 
structures falls off sufficiently fast, as one might 
expect if the resonance-theoretic description is to 



FIGURE 5. Two Kekule structures for a polymeric string 
of adjoined anthracene units. Note that with the first 
bond between adjoining anthracenes being single all the 
remaining interunit bonds are single, whereas with the 
first bond between adjoining anthracenes being double, 
all the remaining adjoining bonds are double. 


be reasonable). For polymers with multiple inter¬ 
connections between monomer units, the bond lo¬ 
calization often is not so severe but still occurs to 
some degree differently for different classes of 
Kekule structures. For example, consider the two 
Kekule structures of the polynaphthalenoid poly¬ 
mer of Figure 6, where the Kekule structures are 
representatives from two classes with either 0 or 1 
of the inter-connecting bonds between naphthalene 
units always being double. (A third class with two 
such bonds always double contains but a single 
Kekule structure.) Further, these different classes 
with associated ground states lead to a considera¬ 
tion of the possibility of solitonic excitations, as 
discussed elsewhere [48]. 

Finally, it may be mentioned that there are a 
number of rigorously proved [49] theorems con¬ 
cerning ground-state symmetries for the nearest- 
neighbor Heisenberg for a bipartite graph. Also, 
for this case, there are theorems [50] concerning 
long-range antiferromagnetic ordering at finite 
(nonzero) temperatures. 


5. Example Quantitative Results 

Results for the various cluster expansion Ansiitze 
of earlier sections might be illustrated for a few 
representative cases. When possible, comparison of 
different approximation schemes might be made. 
The first example cases here though will be for the 
conjugated-circuits model which is believed to be 
quite accurately solved by the uniformly weighted 
RVB cluster Ansatz. The second and third example 
cases return to the consideration of the nearest- 
neighbor Heisenberg model. 



FIGURE 6. Two Kekule structures for a doubly adjoined 
ladderlike chain of naphthalene units. Note the requisite 
long-range ordering manifested in the number of double 
bonds that occur between successive naphthalene units. 
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A general class of interesting molecules is that 
of the fullerenes of icosahedral symmetry, the C 60 
member being the "uniquely elegant" truncated 
icosahedron structure [51], for whose discovery the 
1996 Nobel prize in chemistry was awarded. This 
high-symmetry class of fullerenes has now been 
studied for almost a decade [52, 53], and a great 
deal of effort has been directed [54] to solutions of 
the Hiickel model, although more complete quan¬ 
tum chemical models have also been utilized [55]. 
The study via resonance-theoretic models is rather 
meager, in part, because the phases of the Kekule 
structures are not so readily specified as for the 
bipartite case (with starred and unstarred sites). 
But the conjugated-circuits model can be applied 
in a rather empirical mode (where such phase 
problems are simply presumed not to be explicitly 
manifested), and the six structures with up through 
240 sites have been treated [52] by the conjugated- 
circuits model. But, in fact, the powerful computa¬ 
tional procedure of [56] is quite readily applicable 
to much larger systems (so long as one remains 
within the conjugated-circuit framework, with or¬ 
thogonal Kekule structures). Hence, in Table I, we 
here report the results for all the 20 icosahedral- 
symmetry fullerene structures of up to 980 sites 
(so long as they have isolated pentagons, which 
seems to be a reasonable criterion for stability of 
fullerenes [57]). Such icosahedral-symmetry 
fullerene cages are uniquely identified [52-54, 58] 
by a pair of numbers h and k, which characterize 
an equilateral-triangular-network fragment, 20 
copies of which when appropriately joined to¬ 
gether may be viewed to compose the whole cage. 
Such a triangular fragment may be obtained by 
cutting from the graphite honeycomb network an 
equiliateral triangle with apices in the centers of 
three hexagons—the numbers h and k represent 
the coordinates of one apex relative to another in a 
skew coordinate system as indicated in Figure 7, 
where also the unit of distance is the (shortest) 
distance across a hexagon. Once one has 20 copies 
of one of these equilateral triangular fragments, 
"gluing" each to one of the same-sized faces of a 
regular icosahedron yields the fullerene cage, with 
one 5-membered ring appearing at each of the 
apices of the underlying icosahedron. The cages 
are characterized by 0 < k < h > 0; they have N 
= 20 (h 2 + hk + k 2 ) sites, and, evidently, the sym- 



FIGURE 7. An illustration for (h, k ) = (3,1) of the 
associated equilateral triangle to be cut from the graphite 
lattice graph, whereafter 20 copies of such triangles may 
be fused together on the surface of an icosahedron with 
faces of the same size so as to yield an icosahedral- 
symmetry fullerene. 


metry of the resulting graph is if h — k or 
k = 0, while, otherwise, it is of symmetry J^. One 
may observe that the results of the Table I reveal 
that the total Kekule-structure counts increase 
rather dramatically with cage size, but a suitable 
graph-theoretical computational method [57] is still 
able to handle the results for the simple wave- 
function cluster-expansion. Although the total 
Kekule-structure count evidently diverges expo¬ 
nentially fast with cage size, the Kekule structure 
counts per site ( defined as K 1/N , where K is the 
total count) rather rapidly approach a finite limit, 
which must be that for graphite. A similar slightly 
more slowly converging behavior is seen in Table I 
to apply for the conjugated-circuit resonance en- 
ergy per site. Notably, though, the cages with h-k 
not divisible by 3 exhibit [53, 54, 59] a 
HOMO-LUMO gap of 0, so that they should be 
reactive (perhaps near the 12 apices of the underly¬ 
ing icosahedron, where the structures are least like 
graphite). Notably, this partitioning of the cages in 
accordance with the divisibility of h-k by 3 is 
manifested in terms of the per-site conjugated-cir¬ 
cuit resonance energies and even (to a lesser extent 
though) in terms of the per-site Kekule-structure 
counts, i.e., if h-k is divisible by 3, then the reso¬ 
nance energy (per site) is somewhat less stabilizing 
than usual otherwise (and a related behavior can 
be discerned for K 1/N ). Finally, it may be noted 
that the N = 240 result in this table corrects slightly 
an earlier reported result [52]. 
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TABLE I_ 

Kekuld-structure counts and conjugated-circuit resonance energies for icosahedral fullerenes. 


N 

(h, k) 

K 

fcVN 

£ res /A/fli 

60 

(1,1) 

12,500 

1.17026 

0.14271 

80 

(2,0) 

140,625 

1.15971 

0.09717 

140 

(2,1) 

2,178,836,352 

1.16601 

0.15262 

180 

(3,0) 

1,389,029,765,625 

1.16804 

0.17977 

240 

(2,2) 

21,587,074,966,666,816 

1.16966 

0.18490 

260 

(3,1) 

249,276,672,312,858,144 

1.16656 

0.16632 

320 

(4,0) 

4,742,708,922,019,216,416,528 

1.16879 

0.17816 

380 

(3,2) 

63,052,896,815,923,948,794,000,000 

1.16921 

0.17976 

420 

(4,1) 

55,853,910,213,101,170,303,889,310,104 

1.17070 

0.19122 

500 

(5,0) 

1.05984557486917155901 X 10 34 

1.16964 

0.18198 

540 

(3,3) 

1.31669745812355346381 X 10 37 

1.17149 

0.19307 

560 

(4,2) 

1.65980752921912098393 X 10 38 

1.17017 

0.18432 

620 

(5,1) 

3.05480054527886621321 X 10 42 

1.17091 

0.18766 

720 

(6,0) 

4.24881220361973614246 X 10 49 

1.17200 

0.19477 

740 

(4,3) 

6.53828524614458400211 X 10 5 ° 

1.17130 

0.18886 

780 

(5,2) 

6.64523410371069994773 X 10 53 

1.17220 

0.19513 

860 

(6,1) 

1.30985649576953680262 X 10 59 

1.17150 

0.18940 

960 

(4,4) 

2.49942403779921924432 X 10 66 

1.17264 

0.19596 

980 

(5,3) 

3.29588295422837584208 X 10 67 

1.17191 

0.19093 

980 

(7,0) 

3.84734704509710763917 X 10 67 

1.17210 

0.19188 

oo 

(°°, /f) 

— 

1.17531 

0.19962 


5.2. THE HEISENBERG MODEL FOR AN 
INFINITE CHAIN 

One natural (and oft initially considered) sys¬ 
tem on which to make a comparison of various 
strong correlation approximation methods is the 
simple case of the 1-dimensional chain of sites. 
Also, such a comparison for the infinite-length 
chain is made in Table II, where energies per site 
are reported for the three types of cluster- 
expanded wave functions discussed in Sections 
4-6 along with results reported via other many- 
body methods and the exact result [31], known for 
cyclic chain systems, via a (Beth e-An$atz) method 
which does not seem to be very readily extendable 
to other structures. It may be noted that the so- 
called Jordan-Wigner-transformed RHF and UHF 
results are restricted [60] and unrestricted [61] 
Hartree-Fock solutions for the Hamiltonian after a 
transformation of the Pauli operators in • s i + 1 to 
a Fermionic form, and, again, this approach seems 
not too readily extendable to other structures. The 
spin-wave result involves [62] a Hartree-Fock so¬ 
lution after transformation to Bosonic operators, 
and is applicable to all kinds of structures, but the 
bosonic operators are defined on an enlarged space 


of which the physically realizable space is only a 
small part, while, however, the approximate 
Hartree-Fock solution admixes in this nonphysical 
space, so that it is not evident that one obtains a 
variational bound to the physical ground state. 
This Bosonic method seems not to be ordinarily 
applied to other than Heisenberg models, although 
there is some possibility [63]. These various ap¬ 
proximation schemes which seem to us not so 
amenable to other structures or to other models 
are indicated in Table II by placing their energy 
values in parentheses. Now, in light of the discus¬ 
sion in connection with Figure 4, this system is not 
expected to be exceptionally well described via 
either the Neel-state-based or resonance-VB-theo- 
retic approaches. But, nevertheless, rather reason¬ 
able results are obtained at least when any of these 
types of cluster-expanded wave functions are taken 
beyond the lowest order. Indeed, the results are 
quite good in comparison with a variety of stan¬ 
dard many-body methods, e.g., as compared to the 
unrestricted Hartree-Fock limit (at — 0.5000) which 
is one of the poorer results in the table, i.e., sizable 
portions of the so-called “correlation energy/' are 
obtained. The Monte Carlo methods of which there 
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TABLE II_ 

Energies per site for various methods as applied to 
the nearest-neighbor Heisenberg Model for an 
infinite ladder. 


Method 

{Energy} IJN 

Ref. 

Neel-state energy 

Ordinary UHF limit of the 

-0.5000 


Hubbard model 

-0.5000 


Single Kekule structure 

3 -* 1 first-order real- 

-0.7500 


space renormalization 

-0.7826 

[64] 

Hulthen-Kasteleyn Ansatz 
Jordan-Wigner- 

-0.8156 

[31,32] 

transformed RHF 

9 1 first-order real- 

(-0.8383) 

[60] 

space renormalization 
Cluster-expansion of 

-0.8423 

[65] 

cluster expansion 

Two-site Neel-state-based 

-0.8469 

To appear 

cluster expansion Ansatz 
(Non-variational) antiferro¬ 

-0.8558 

[26] 

magnetic spin-wave 
Jordan-Wigner- 

(-0.8634) 

[62] 

transformed UHF 

19 1 first-order real- 

(-0.8646) 

[61] 

space renormalization 
Two-bond-range 

-0.8650 

[66] 

resonating VB Ansatz 

First sequential cluster 

-0.8716 

[35] 

expansion of Eq. (3.4.2) 
Four-site Neel-state-based 

-0.8737 

To appear 

cluster-expansion Ansatz 
Three-bond-range 

-0.8758 

[27] 

resonating VB Ansatz 

-0.8790 

Present 

Exact 

(-0.886294) 

[31] 

Monte Carlo N = 48 ring 

3 -» 1 additive-cluster 

-0.887 

[67] 

renormalization 
Neel-state-based second- 

-0.8969 

[68] 

order perturbation theory 
Limit to Hubbard Green’s 

-1.0000 

[69] 

function approximant 

-1.2188 

[70] 


are different sorts exhibit what seems the highest 
accuracy for a variety of systems and models. 

5.3. THE HEISENBERG MODEL FOR THE 

INFINITE LADDER 

Another example system is that of a ladder, the 
structure of which pictured in Figure 8 can be 
viewed as a strip cut from the square-planar lat¬ 
tice. Here, various results are reported in Table III. 
This system satisfies conditions enunciated in con- 



FIGURE 8. A square-planar lattice strip. 


nection with Figure 4 so as to make a resonating 
VB picture more favorable: This system is of a 
fairly low mean coordination number z — 3, and it 
has many Kekule structures. Indeed, for a chain of 
n-fused squares, the number of Kekule structures 
is known to be given by the (n + l)th Fibonacci 
number F n+1 , which, in turn, can be expressed in 
very many ways, e.g., 

F m = {(2 + + (2 + <M4>-}/5, (5.3.1) 

where (fr + = (1 + /5)/2 ~ 1.61803399 is the 
"golden mean" and <£_= 1 — cf) + is some sort of 
complement. In concert with the expectations, the 
resonating VB-theoretic results seem to be quite 
good. 

But perhaps a few words should be said as to 
just what systems these results might apply. Basi¬ 
cally, in application to conjugated 7r-networks with 
4-membered rings, the Heisenberg model should 
have important higher-order terms, which, if not 
included, predict too large a resonance stabiliza¬ 
tion, e.g., for cyclobutadiene. Thus, the quoted 
results for the current nearest-neighbor model do 
not apply to such conjugated 7r-networks. 

But the current nearest-neighbor model is be¬ 
lieved to be well applicable to ladderlike struc¬ 
tures of Cu 2+ -containing perovskites (of the gen¬ 
eral structural class of high-temperature supercon- 

TABLE III_ 

Energies per site for various methods as applied to 
the nearest-neighbor Heisenberg Model for an 
infinite ladder. 


Method 

(Energy) IJN 

Ref. 

Neel-state energy 

-0.7500 


Single Kekule structure 

-0.7500 


Two-site Neel-state-based 

Ansatz 

-1.1014 

Present 

One-bond-range RVB 

Ansatz 

-1.1138 

[68] 

Neel-state 2nof-order 

perturbation theory 

-1.1250 

[69] 

Two-bond-range RVB Ansatz 

-1.1462 

Present 

Numerical extrapolation 

(from exact diagonalizations) 

-1.156 

[68] 
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ductors), and, indeed, these ladder structures have 
been much studied [71-73] in this context in the 
last few years. Interestingly, there is an important 
difference in relative applicability of Neel-state- 
based and resonating-VB-theoretic wave functions 
for the current case of ladders with two legs in 
comparison with a ladder with three legs. This 
difference can be readily understood in terms of 
the relative abundances of Kekule structures as 
mediated by the long-range spin-pairing orders 
discussed in Section 4 here, although these earlier 
understood [68] consequences were termed [71] 
"surprising" when first studied in connection with 
the experimentally realized perovskites. Basically, 
if the long-range spin-pairing order deviates much 
from the optimal resonance-theoretic circumstance 
for which spin-pairing from one site to every 
neighbor occurs with equal probability, then reso¬ 
nance makes a lesser contribution and the alterna¬ 
tive Neel-state description is relatively more pre¬ 
ferred. 


6. Prospects 

Many-body VB theory is seen to be imple- 
mentable in several different manners, even upon 
focus on the circumstance of variationally treated 
cluster-expanded wave functions. Much has been 
done with the simplest Ansatze, but a number of 
computations with higher-order Ansatze indicate 
even further promise. Such results at least in some 
cases seem to be of quite high accuracy. Within a 
variational format, it is a challenge to apply the 
methods to extended systems of 2- or 3-dimen¬ 
sional systems, although for the Heisenberg model 
on the square-planar lattice, there has been a cer¬ 
tain degree of work (in connection with high-tem- 
perature superconductivity, and some further indi¬ 
cation of this work is given in the reviews of [12]). 

There appear some qualitative differences be¬ 
tween many-body MO-based and VB-based wave 
functions. Also, it may be emphasized that, in 
practice, these differences can persist to somewhat 
high orders, most especially for extended many- 
body systems. Such a point has occasionally been 
made previously [74]. But, here, with the recogni¬ 
tion of the correspondence to statistical-mechanical 
quantities, as in Figure 3, two different types of 
wave functions might be imagined to fall into 
what in statistical mechanics are termed different 
"universality classes," e.g., exhibiting different 


types of long-range order and exhibiting different 
critical point behavior (independent of perturba¬ 
tive corrections to a zero-picture). Thus, many- 
body VB-theoretic cluster-expanded wave func¬ 
tions can be imagined to be much more than just 
another way to do a computation equivalent to 
that for an MO-based many-body wave 
function—the two descriptions need not in any 
achievable high order be equivalent. The question 
then devolves to just which systems each of the 
two types of descriptions apply. 

Thus, there appears a certain need for VB-theo¬ 
retic descriptions. Although the computations de¬ 
scribed here have focused on Heisenberg and con- 
jugated-circuit models, there is also some (lesser 
amount of) work on Hubbard or PPP semiempiri- 
cal models. It is hoped that the methods will be 
applied in an ab initio framework. 
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ABSTRACT: We outline the CASVB strategy, which may be used either to generate 
very compact modern valence bond representations of CASSCF wave functions or to 
optimize general types of modern VB wave function. A simple algorithm is presented for 
the elimination of redundant or constrained parameters from the appropriate second-order 
optimization problems. Selected results for three systems—benzene, the allyl radical, and 
LiH—are used to illustrate various ways in which CASVB procedures can be used to 
perform fully variational optimizations of nonorthogonal orbitals and structure 
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Introduction 

T he initial aim of developing our CASVB pro¬ 
cedures [1-4] was to exploit the invariance of 
wave functions of “full configuration interaction 
(Cl)" form to general nonunitary linear transfor- 
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mation of the active orbitals, so as to generate 
alternative representations of CASSCF wave func¬ 
tions in which a dominant component adopts 
modern VB form. By a wave function of “modern 
VB“ form, we mean here one or more spatial 
configurations constructed from general, nonortho¬ 
gonal orbitals, and combined with all allowed ways 
of coupling together the electron spins so as to 
obtain the required resultant. The prototype sin- 
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gle-configuration wave function of this type is the 
spin-coupled wave function [5]: 

*SC = <Pl<Pl ■■■Vl @pp 4>l 4>2 ■ ■ ■ <I>N ®SM ) OQ 

in which the singly occupied nonorthogonal or¬ 
bitals 4 V accommodate the active electrons, and 
the doubly occupied orthogonal orbitals <p t accom¬ 
modate the inactive ones. The spin eigenfunction 
®sm is fully optimized in the full spin space for N 
electrons with total spin S and projection M, and 
@pp represents perfect pairing of the spins of the 
2 n inactive electrons. 

It was anticipated from the outset that it would 
be desirable to incorporate our CASVB procedures 
into molecular orbital (MO) Cl packages that fea¬ 
ture powerful CASSCF codes. This could make it 
possible to use such "standard" packages to carry 
out the fully variational optimization of quite gen¬ 
eral types of modern VB wave function, by alter¬ 
nating optimization of the active and external pa¬ 
rameter sets in an iterative two-step procedure. 
The CASVB procedures have already been made 
available as part of the molpro package [6], in 
which they are interfaced to a very efficient 
CASSCF program [7], and we hope to see them 
incorporated into many others in due course. 

Since almost all of the technical details under¬ 
pinning our approach are available elsewhere 
[1-4], we present in the next section only a brief 
overview of the methodology. An important as¬ 
pect of actual calculations is our scheme for elimi¬ 
nating redundant or constrained parameters from 
the second-order optimization procedures; this is 
described here for the first time. We then use 
selected results for three systems—benzene, the 
allyl radical, and LiH—to illustrate some of the 
ways in which the CASVB procedures can be uti¬ 
lized to perform fully variational optimizations of 
nonorthogonal orbitals and structure coefficients in 
modern VB wave functions. 


CASVB Method 

Consider a normalized CASSCF wave function 
^ CAS consisting of N CI structures built from m 
active orbitals, c/> z . A general, nonunitary, nonsin¬ 
gular nn X m linear transformation of the orbitals 

{(/>'} = {( f )}0 ( 2 ) 


induces a corresponding N cl X N cl transformation 
of the structure space 

{<£'} = {O)T(O). (3) 

Without explicitly constructing T(O), its effect on 
the complete structure space can be realized in a 
very efficient manner by writing O as a product of 
m 2 simple updates and utilizing the general prop¬ 
erty of full Cl spaces that 

O - 0 1 0 2 ~ T(O) - TCO^TCO,). (4) 

The simple updates take the form 

O m „(A): (5) 

for which the corresponding transformation of the 
structure space can be achieved by application of 
the operator 

/ + AEW + A 2 E$, (6) 

in which E ( *J and Effl generate spin-averaged 
single and double excitations. This highly efficient 
strategy [1-4, 8] for realizing exactly the effects of 
the structure transformation in Eq. (3) is central to 
the viability of the CASVB approach. We have also 
presented expressions for the first and second 
derivatives of T(O) with respect to the A parame¬ 
ters [2], 

Consider now a modern VB wave function 
consisting of a linear combination of a subset of 
the N cl structures in the CASSCF expansion, but 
constructed instead from m nonorthogonal orbitals. 
An appealing criterion for choosing the orbital 
transformation O in Eq. (2) is to maximize the 
overlap S VB between ^ VB and <P CAS , 

_ <*CA S I*VB> , . 

VB “ ■ ( ’ 

An efficient second-order Newton-Raphson strat¬ 
egy for maximizing S VB was presented in Ref. [2]. 
This approach provides a relatively inexpensive 
scheme for representing CASSCF wave functions 
in modern VB form. Very high values of S VB , on 
the order of 0.99, may typically be achieved with a 
single spin-coupled-like configuration [Eq. (1)]. 

An alternative to maximization of S VB is to 
choose O by minimizing the expectation of the 
energy, E VB , defined according to 

r <^ vb |H|^ vb > 

VB (^vbI^vb) ’ } 
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The nonorthogonality problem implicit in evaluat¬ 
ing the matrix elements in this expression may be 
circumvented by transforming back to the original 
basis of orthogonal molecular orbitals, using the 
identities 


H' = T f (0)HT(0), 

(9) 

S' = T t (0)ST(0), 

(10) 

= T -1 (0)c = T(0 _1 )c, 

(ID 


in which H, S, and c are the Hamiltonian matrix, 
overlap matrix, and Cl vectors appropriate to {<£}, 
whereas the primed quantities relate to {O'}. An 
efficient second-order Newton-Raphson strategy 
for minimizing E VB was presented in Ref. [2]. The 
resulting ^P VB is optimal for the given choice of 
active space and corresponds to a representation of 
the full CASSCF wave function in which a small 
number of modern VB structures dominate the 
total energy. It is our experience that analogous 
overlap- and energy-optimized wave functions are 
always in very good qualitative agreement. The 
viability of the energy optimization method de¬ 
pends on the availability of highly efficient algo¬ 
rithms for realizing the action of the Hamiltonian 
operator on a full-CI vector [9, 10]. 

Furthermore, optimization of the energy expec¬ 
tation value E vb with respect to core-active, core- 
virtual, and active-virtual orbital rotations can be 
achieved using standard CASSCF procedures. As a 
consequence, fully variational modern VB calcula¬ 
tions can be performed by alternating optimization 
of the active and external parameter sets in an 
iterative two-step procedure. This has been imple¬ 
mented as part of the molpro package [6]. We 
have also presented practical approaches [3] for 
the proper utilization of molecular point group 
symmetry in modern VB approaches that involve 
the optimization of nonorthogonal orbitals. 


Elimination of Redundant or 
Constrained Parameters 

We consider in this section the optimization of a 
general multiconfiguration modern VB wave func¬ 
tion of the form 

1 v struc 

</> act = £ c,^ (12) 

1—1 


in which the structures ipj are defined in terms of 
a set of m orbitals, {c^y}, with varying occupations 
and spin functions. We have used the notation \p aci 
to remind ourselves that the total wave function 
can, without loss of generality, also include or¬ 
bitals that are doubly occupied in all structures. 
The active orbitals are allowed to be nonorthogo¬ 
nal, and are expanded in a set of basis functions 
{ Xj) according to 

*«bas 

</v = Ew- (13) 

i=l 

It is convenient to collect the " primitive" varia¬ 
tional parameters from Eqs. (12) and (13) into a 
vector x, such that x(l:m bas ) contains the expansion 
coefficients for orbital x(m bas +l:2m bas ) con¬ 
tains the expansion coefficients for orbital </> 2 , etc., 
and x(m Xm bas + l:m X m bas + N struc ) holds the 
structure coefficients. Our procedure seeks to min¬ 
imize the energy E VB (x) based on a second-order 
model of the form 

E vb (x + <$x) « E vb (x) + g T 8x + 1/2 8x t G 8x , 

(14) 

in which g is the (column) vector of first deriva¬ 
tives of E with respect to the variational parame¬ 
ters, and G is the corresponding matrix of second 
derivatives (Hessian). There are many variants of 
such procedures, such as, for example, stabilized 
Newton-Raphson schemes or augmented Hessian 
approaches [11-13]. 

Methods for eliminating redundant or con¬ 
strained parameters from such second-order opti¬ 
mization problems are well known in the literature 
(see, e.g.. Ref. [11]). A common way of taking 
constraints into account is to include Lagrange 
multipliers in the objective function. It is well 
known, however, that such approaches can ad¬ 
versely affect the convergence characteristics. An 
attractive alternative is the use of nonlinear elimi¬ 
nation in a constrained minimization scheme that 
involves first and second derivatives of the con¬ 
straints with respect to the variational parameters 
[13]. The second-order nonlinear elimination con¬ 
strained minimization algorithm described in Ref. 
[13] has proved invaluable in the direct calculation 
of core-optimized spin-coupled wave functions. 

We have strived here for a computationally 
somewhat simpler approach, bearing in mind that 
the number of variational parameters can be quite 
considerable in practical applications. Some of the 
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methodology described here has now also been 
implemented in our determinant-based spin-cou¬ 
pled code [14]. Direct comparisons with the non¬ 
linear elimination approach show no appreciable 
degradation in convergence characteristics, but a 
significant reduction in the computational over¬ 
head. Analogous considerations to those described 
here apply also to CASVB calculations that involve 
maximization of S VB (x) rather than minimization 
of Eyg(x), 

One can of course always define nonsingular 
linear transformations of the parameters 



y = Lx, 

(15) 

and rewrite the second-order model in the form 


E VB (y + Sy) * E VB (y) + (g y ) T Sy 



+ 1/2 8y T G !/ 8y, 

(16) 

with 

g^a-Tg 

(17) 

and 

G y = (L" 1 ) T GL” 1 . 

(18) 


It is useful to point out here the practical advan¬ 
tage of choosing L to be an orthogonal transforma¬ 
tion, so that L -1 = L t . 

At the start of each iteration, x is modified (if 
necessary) so that ' v h VB obeys all of the constraints. 
Our primary concern in the optimization proce¬ 
dure is then to define a rectangular transformation 
matrix L such that our constraints are necessarily 
satisfied in the updated vector, x + 8x. In the 
CASVB strategy, we can distinguish three classes 
of constraint: 

1. Simple constraints of the form c T x = 0 

2. Normalization conditions for orbital and 
structure coefficients 

3. Orthogonality constraints 

These are in order of increasing complexity but, in 
practice, the elimination of parameters associated 
with normalizations must always be carried out 
last. 

SIMPLE CONSTRAINTS 

Constraints that may be expressed in the form 

£c(0x(0 = 0 (19) 


include various simple cases such as, for example, 
forcing a single expansion coefficient to be zero (in 
which case c has only one nonzero coefficient) or 
setting pairs of coefficients equal or opposite. 
However, much more complicated conditions, such 
as symmetry requirements for orbitals, are also 
covered here. The symmetry condition on a single 
orbital 

6\ = H,, (20) 

can be satisfied by diagonalizing the matrix repre¬ 
sentation of the symmetry operation O s in the 
basis of {Xiir and then expanding </> in the (right 
hand) eigenvectors corresponding to the eigen¬ 
value A [3]. If n x such eigenvectors exist, then the 
symmetry condition embodied in Eq. (20) gener¬ 
ates ra bas — n x simple constraints, where the con¬ 
straints vectors will span the orthogonal comple¬ 
ment to the eigenspace associated with A. When 
two orbitals are related by symmetry, as in 

6% = 4> v , ( 21 ) 

then the set of allowed parameter vectors can be 
found by the matrix multiply 

c(i + (v - 1 )m bas ) 

= E(O s ) y c(; + (fi- l)m bas ), 

/ 

i = l,...,m bas , (22) 

for each of the m bas linearly independent vectors 
describing updates to </y. As before, the con¬ 
straints may be defined as the m bas parameter 
vectors spanning the orthogonal complement. Cal¬ 
culations employing symmetry conditions on the 
orbitals are described later for the allyl radical and 
for LiH. 

NORMALIZATION CONDITIONS 

In order for an orbital normalization to be sta¬ 
tionary, i.e., 

<^ + S^|^ + 8^>-<^|^> = 0, (23) 

we require (to first order) that {<f> \8</> ) = 0. This 
condition can be expressed as 

(0 (#l) ) T S = 0 (24) 
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in which is the fith column of the orbital 
transformation matrix, and S is the matrix of over¬ 
laps between basis functions. In terms of the con¬ 
straints vector c, this then becomes 

c(i + (/t-DwO = ((0^>) T S) ;/ 

i = l,..., m bas . (25) 

The elimination of the normalization parameters 
may thus be carried out in a very similar fashion 
to the elimination of the simple constraints de¬ 
scribed above. The slight difference in this case is 
that the condition is now on the update vector 
only, i.e., of the form c T 8x = 0. The normalization 
condition associated with the structure coefficients 
can be eliminated in an exactly analogous manner. 

ORTHOGONALITY CONSTRAINTS 

Although the CASVB procedures are primarily 
intended for the determination of intrinsically 
nonorthogonal orbitals, there are various situations 
in modern VB calculations in which it may be 
advantageous to enforce orthogonality between 
some of the orbitals. Such an example is given 
later for LiH. 

For the elimination of the constraint associated 
with (<l> \<f> v ) = 0 we write the update of orbital 
0^ in the form 8<f> = diafa + b<t> v L ), where 
(0J0/ ) = 0. Similarly, S0„ = <5(c0 M + d0^). The 
condition 


tion. This must be achieved without affecting the 
simple orbital constraints outlined above and so 
we have adopted what might best be termed a 
pairwise symmetric orthogonalization. For each or- 
thogonalization pair in turn, we solve 

+ A0J0„ + A0^> = 0 (28) 

for the two update vectors A 0^ and A 4> p . We have 
used the notation A 0„ to avoid confusion with the 
update predicted by the second-order model. 
The updates must be "allowed." For example, A 0^ 
must be orthogonal to constraints vectors involv¬ 
ing 0^ as well as to any orbitals 0^ for which 
(0^|0 ff ) = 0 has previously been enforced. Fur¬ 
thermore, it is useful to require that A 0 is orthog¬ 
onal to 0 . The natural choice for this update 
vector is clearly the projection of <f) v onto the 
allowed space, because this will require a minimal 
change to both orbitals (0 and 0 y ) to achieve 
orthogonality. If both orbitals have nonzero projec¬ 
tions on the opposite allowed space, we can solve 
for the value of c in 



(A0^ and A cj> v are here the projections of the other 
orbital). The special cases where the projection of 
either of 0^ or 0 t , is null are treated separately in a 
similar manner. 


+ + (26) 

requires, to first order in the orbital changes, that 
a = — c. In practical terms, this leads to a con¬ 
straints vector that is very similar to the normal¬ 
ization case: 

c(i + (ix- l)m bas ) = ((O'^/s),., 

c(z + (v - l)m bas ) = ((0 (ft) ) T S) ;/ 

i = l,...,m bas . (27) 

While this is not exact, because 0 ■** and 0/ will 
not in general be orthogonal, it does become a very 
good approximation close to convergence. The 
main effects of the approximation are likely to be 
on the global convergence characteristics, but in 
our applications so far this has not been a problem. 

Since the elimination of orthogonality con¬ 
straints is only approximate, the orthogonality 
must be reimposed after each update in an itera¬ 


IMPLEMENTATION 

We can see that all three types of constraint can 
be expressed as conditions on the update, taking 
the form 

cj 8x = 0, i = 1,..., N con . (30) 

The set of free variables of the problem can there¬ 
fore be found as the orthogonal complement of 
this set of N con vectors. We find it by a series of 
standard Gram-Schmidt orthogonalizations, as 
follows: 

1. The N con constraints vectors cj are first or- 
thogonalized, discarding any with a norm 
below an appropriate threshold. 

2. The full set of unit vectors is orthogonalized 
to the surviving constraints. 

3. The resulting vectors are finally orthogonal¬ 
ized, discarding any with a norm below the 
threshold. 
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The first step not only facilitates the evaluation of 
the orthogonal complement, but it also has the 
effect of eliminating any linear dependencies 
among the constraints vectors. The final orthogo- 
nalization not only establishes the minimum lin¬ 
early independent set of free parameters, but it 
also ensures that the transformation matrix L be¬ 
comes orthogonal. 

The orthogonal complement could of course be 
found in various other ways, but we have found 
that the above procedure, starting from a full set of 
unit vectors, tends to give a transformation matrix 
I of maximum sparseness. This can significantly 
influence the efficiency of the transformation of the 
Hessian embodied in Eq. (18). 


Results 

We have chosen three sets of examples to indi¬ 
cate some of the various ways in which the CASVB 
code can be used to carry out fully variational 
optimizations of nonorthogonal orbitals in modern 
VB wave functions: benzene, the ally! radical, and 
LiH. All of the calculations described here were 
carried out using the appropriate modules now 
available in the molpro [6] suite of programs. 

BENZENE 

For benzene (D 6h ) we adopted bond lengths 
r cc = 139.64 pm and r CH = 108.31 pm. The s/p 
basis functions for carbon and the s basis functions 
for hydrogen were taken from correlation consis¬ 
tent pVTZ basis sets [15]. These were augmented 
with polarization functions with exponents d c = 
0.8 and p H = 1.0, so that the basis sets for C/H 
consist of (10 s5 pi d/5slp) Cartesian GTOs con¬ 
tracted to [4s3pld/3sl p], A 6-in-6 CASSCF calcu¬ 
lation was performed for the tt electrons, keeping 
the 36 a electrons in an optimized closed-shell 
core. The nondynamical correlation energy incor¬ 
porated in this wave function is ~ 73 millihartrees 
(see Table I). By definition, the RHF wave function 
[core](a 2li ) 2 (c 1 ^) 4 accounts for none of this corre¬ 
lation energy. The weight of the configuration 
[core](fl 2w ) 2 (^i^) 4 i n the full CASSCF wave func¬ 
tion is 88.4%. 

Spin-coupled calculations, with an active space 
corresponding to the six tt electrons, were carried 
out (using the CASVB code) with a frozen core, 
taken directly from the CASSCF calculation. This 


TABLE I_ 

Calculated energies for benzene. 


Calculation 

hartrees 

E ^CAS’ 

millihartrees 

Proportion of 

^CAS “ ^RHF 

CASSCF 

-230.8368216 

0 

100% 

SC 

-230.8293315 

7.49 

89.7% 

SC (f.c.) 

-230.8293314 

7.49 

89.7% 

RHF 

-230.7640556 

72.77 

0% 


calculation, labeled SC (f.c.) in Table I, recovers 
nearly 90% of the nondynamical energy and the 
overlap with the CASSCF wave function exceeds 
0.995. The further energy lowering and change in 
S VB on optimizing the a "core" is negligible 
(calculation SC in Table I). 

Although no such constraints were imposed, 
and the calculations were actually performed in 
C 2v symmetry, we find six symmetry-equivalent 
spin-coupled tt orbitals that may be transformed 

A 

into one another by successive C 6 rotations. Num¬ 
bering the orbitals sequentially around the ring, 
the symmetry-unique overlap integrals are 
(4>i\4> 2 > = 0.524, = 0-029, and <^|0 4 > = 

— 0.157. Each orbital consists of a deformed C(2p 77 .) 
function associated with a given carbon atom, but 
it exhibits some deformation toward the neighbor¬ 
ing atoms in the ring (see Fig. 1). This outcome is, 
of course, entirely consistent with all of the previ¬ 
ous spin-coupled studies of this molecule [3, 
16-22]. The deformations of the orbitals play a 
crucial role; without them, it would be essential to 
incorporate also the plethora of ionic structures 
with double occupancy of one or more orbitals. 

Expressed in the Rumer basis [23], often em¬ 
ployed in classical VB calculations, the total spin 
function corresponds to weights of 40.6% each for 
the two Kekule structures and of 6.3% each for the 
three para-bonded ("Dewar") structures. Using the 
same set of orbitals, the energy change on restrict¬ 
ing the spin space to just one of the Kekule struc¬ 
tures corresponds to a resonance energy of 83.5 kj 
mol -1 . 

Hirao et al. [24] have adopted the same name, 
i.e., CASVB, for calculations in which they trans¬ 
formed the CASSCF wave function to a representa¬ 
tion based on orthogonal localized molecular or¬ 
bitals. In the case of benzene, they report weights 
of 7.8% each for the Kekule structures and of 2.6% 
each for the para-bonded structures. In their model, 
76.6% of the benzene ground-state wave function 
is ionic. Analogous conclusions, but by no means 
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FIGURE 1 . Various representations of a symmetry-unique spin-coupled tt orbital for benzene. (Left) Contours in the 
horizontal plane 1 bohr above the molecular plane. (Center) Contours in a vertical mirror plane. (Right) A representative 
three-dimensional contour (isosurface). 


quite so extreme, arise in classical VB descriptions 
based on strictly localized orbitals [25]. It should 
be clear why the present authors prefer modern 
VB calculations with fully-optimized nonorthogo- 
nal orbitals. 

The Chirgwin-Coulson weight [26] of in 
^cas 99.1%, so that spin-coupled ionic configu¬ 
rations amount to just 0.9% of the total wave 
function. Nonetheless, this very small component, 
with 1 — S VB < 0.005, accounts for some 10% of 
the nondynamical correlation energy, and so it is 
natural to analyze it further. We find that doubly 
ionic configurations are the more important con¬ 
tributors, with an accumulated Chirgwin-Coulson 
weight of 0.55%. This is in line with the expected 
diminished importance of singly ionic configura¬ 
tions in wave functions based on fully optimized 
nonorthogonal orbitals. Furthermore, from the 
dominance in the total wave function of the two 
Kekule-like configurations, we can expect singly 
ionic configurations with charges in ortho posi¬ 
tions to be the least important. This is borne out by 
the total weights of the various singly ionic config¬ 
urations, which decrease in the order meta > para 
> ortho. We find that charges in meta positions 
give a total weight of 0.32%, whereas para and 
ortho contributions are orders of magnitude 
smaller. 

ALLYL RADICAL 

As a test of the convergence characteristics of 
the CASVB code, we have reinvestigated the 2 A 2 
ground state of the allyl radical, C 3 H 5 , treating as 
active the three tt electrons. The spin-coupled VB 
approach (see also the discussion of LiH) has re¬ 


cently been employed by Oliva et al. [27] to study 
the doublet valence states and the n = 3 Rydberg 
excited states of the allyl radical below the first 
ionization potential (8.13 eV). In general, the exci¬ 
tation energies compare favorably with those ob¬ 
tained in analogous three-electron full Cl calcula¬ 
tions. 

The calculations in the present work were per¬ 
formed in C 2v symmetry, using the same geome¬ 
try as Oliva et al. [27] and the basis set they 
denoted basis II, except that the most diffuse d 
function (exponent 0.015) was placed on all three 
carbon atoms, rather than just on the central one. 
The total number of contracted Cartesian GTOs is 
120 (52A l7 39B x , 17B 2 ,12A 2 ). A 3-in-3 CASSCF cal¬ 
culation was performed for the tt electrons, with 
the 20 cr electrons accommodated in an optimized 
closed-shell core. Our various total energies are 
collected in Table II. 

As was also found in previous work [28], a fully 
variational spin-coupled calculation with the same 
choice of active and inactive spaces generates a 
total wave function that is identical to that from 
the CASSCF, albeit in much more compact form. 
One of the spin-coupled orbitals, </> 3 , takes the 


TABLE II_ 

Calculated energies for the allyl radical ( 2 A 2 ). 


Calculation 

E, hartrees 

E ^CAS 
millihartrees 

3-in-3 CASSCF 

-116.522760 

0 

SC: antipair orbitals 

-116.522760 

0.00 

SC: localized orbitals 

-116.521438 

1.32 

RHF 

-116.483356 

39.40 
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form of a deformed C(2p 7r ) function associated 
with the central C atom, but with deformations 
toward the terminal C atoms. The remaining spin- 
coupled orbitals, < p x and < p 2 , adopt antipair form, 
as is shown in the top row of Figure 2: they 
resemble in- and out-of-phase combinations of de¬ 
formed C(2p 77 ) functions associated with the ter¬ 
minal C atoms. The corresponding electron spins 
are predominantly, but not exclusively, triplet cou¬ 
pled. Expressed in the Kotani or Serber bases [23], 
the mode with triplet coupling of these two spins 
accounts for 99.1% of the total spin function. The 
only nonzero overlap is ((/> 1 l^ 3 ) = 0.744. 

It proved possible in a previous study [28] to 
locate also a rival spin-coupled solution in which 
4> 1 and </> 2 adopt more localized forms, in which 
they are equivalent with respect to reflection in the 
<j v mirror plane perpendicular to the molecular 
plane. In a somewhat smaller basis set, and with¬ 
out full optimization of the a core, the energy of a 
solution based on localized orbitals was found to 
be inferior to that of the antipair solution by 1.5 
millihartrees [28]. 

In the present work, we used the CASVB code 
to optimize a constrained spin-coupled wave func¬ 
tion in which <p 1 and </> 2 are permuted by reflec¬ 
tion in the cr v mirror plane, but <p 3 is invariant to 


this operation. The doubly occupied core orbitals 
were fully optimized simultaneously with the 
spin-coupled tt orbitals. The convergence charac¬ 
teristics of the constrained optimization procedure 
were found to be excellent. The resulting spin-cou¬ 
pled 7 t orbitals are shown in the bottom row of 
Figure 2. No constraints were applied to the spin 
space, but it turns out that in order to achieve the 
correct overall 2 A 2 symmetry the spins associated 
with orbitals </> x and <fi 2 are required to be strictly 
triplet coupled. The nonzero overlaps are 3 ) 
= <</> 2 |</>3> = 0.501 and <<hltf> 2 > = -0.184. The 
difference in energy from the antipair solution 
(1.32 millihartrees, see Table II) is similar to the 
value found previously [28] and arises from the 
removal of all the spin degrees of variational free¬ 
dom. Convergence to this localized orbital solution 
is only possible with the appropriate symmetry 
constraints; without them, this solution corre¬ 
sponds only to a saddle point in the full spin-cou¬ 
pled variational space. 

LiH 

It is useful to describe first the key features of a 
previous spin-coupled VB study of the X*E + 
ground state of LiH [29]. The spin-coupled wave 




FIGURE 2. Spin-coupled tt orbitals for the 2 A 2 ground state of the allyl radical, shown as contours in the horizontal 
plane 1 bohr above the molecular plane. 
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function may be written in the form 


Vsc 


0102 03 04 52 C 0k®00;k ~ {0102 03 04 ^ 

k=l 


(31) 


with four singly occupied nonorthogonal orbitals. 
For four electrons with a net spin of zero, the full 
spin space [23] consists of just two linearly inde¬ 
pendent modes of spin coupling. The two spin¬ 
coupling coefficients (c ok ) were optimized simulta¬ 
neously with the orbitals, without any constraints 
that would alter the total wave function. 

The converged spin-coupled orbitals all possess 
a symmetry. At large nuclear separations, they 
correspond to Li(lsls'2s) and H(ls). We found 
that the two "core" orbitals, and the overlap be¬ 
tween them (~ 0.93), remain essentially un¬ 
changed as the atoms approach. The H(ls) func¬ 
tion also changes relatively little. On the other 
hand, dramatic changes occur in the orbital ini¬ 
tially associated with Li(2s). As the atoms ap¬ 
proach, electron density is transferred from lithium 
to the hydrogen atom, in the form of a rather 
diffuse deformed H(ls) function. For nuclear sepa¬ 
rations around R e , the form of the two valence 
orbitals implies considerable Li + Fi _ character [29]. 
Perfect pairing remains the dominant mode of spin 
coupling for all values of R. 

At convergence, each occupied orbital gives rise 
to a "stack" of orthonormal virtual orbitals, deter¬ 
mined in an effective field which excludes the 
occupied orbital. Virtual orbitals in different stacks 
are not in general orthogonal to one another. Fur¬ 
ther spatial configurations were generated by 
means of excitations from valence spin-coupled 
orbitals (cr 3 and cr 4 ) into corresponding virtual 
orbitals, and the corresponding wave functions. 


total energies and dipole moments were obtained 
in very compact nonorthogonal Cl calculations. A 
suitably constructed series of such spin-coupled 
VB expansions [29] exhibited very rapid conver¬ 
gence onto a dipole moment function previously 
calculated by means of accurate MO Cl calcula¬ 
tions. 

The calculations in the present work were per¬ 
formed in C 2v symmetry using the density-matrix 
averaged atomic natural orbital basis sets of Wid- 
mark et al. [30]. For Li/FI, these sets consist of 
(14s9p4d3//8s4p3d) spherical GTOs contracted to 
[6s5p3d2//4s3p2d], generating a total of 73 GTOs 
(32A 1 ,17B 1 ,17B 2 ,7A 2 ). A full-valence CASSCF 
calculation for this molecule involves just two ac¬ 
tive electrons, with the orbitals partitioned: 

■ Optimized core (double occupancy): 1 a x (i.e., 
lcr) 

■ Active (variable occupancy): 2a x -4a v 1 b v 
and lb 2 (i.e., 2o—4a and 17r) 

■ Virtual (unoccupied): 5fl 1 -32fl 1 , 2b 1 -17b 1/ 
2b 2 -17b 2 , 1 a 2 -7a 2 

A two-electron full-CI calculation, with the doubly 
occupied 1 a x orbital taken from the full-valence 
CASSCF wave function, requires fairly trivial com¬ 
puting resources, and gives a clear indication of 
the best that we can expect to achieve in frozen-core 
calculations with this basis set. 

Computed equilibrium geometries, together 
with the corresponding total energies and dipole 
moments, are collected in Table III. Starting from 
the RHF wave function, the single largest energy 
improvement corresponds to the radial correlation 
introduced by opening up the 3a 1 orbital (i.e., 3<r). 
Almost as large is the angular correlation intro¬ 
duced by opening up the degenerate 1 b^ and 1 b 2 


TABLE III 


RHF, CASSCF, and full-CI equilibrium geometries for LiH. 


Active orbitals 

Inactive space 

«e. A 

E, hartrees 

/x, Debyes 

(RHF) 

— 

1.606 

-7.987257 

6.02 

2a 1 ,3a 1 

(la.,) 2 

1.634 

-8.003784 

5.74 

2a 1 ,3a 1 ,4a 1 

(la,) 2 

1.633 

-8.009548 

5.88 

2a 1s 1 b v 1 b 2 

(la,) 2 

1.589 

-8.001320 

5.98 

2a 1 ,3a 1 ,1 b v ^b 2 

(la,) 2 

1.611 

-8.015914 

5.78 

2a 1 ,3a 1 ,4a 1 , ^b^,^b 2 

(la,) 2 

1.611 

-8.021112 

5.89 

(full Cl) a 

(la,) 2 

1.608 

-8.023861 

— 

a Doubly occupied ^a■^ 

core orbital taken from the corresponding full-valence CASSCF. 
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orbitals (i.e., I 77 ). The full-valence CASSCF calcu¬ 
lation gives a value of R e that is very close to that 
from the two-electron full Cl, and the minimum 
energies are within 3 millihartrees of one another. 
This full-CI calculation gives an equilibrium bond 
length of 1.6076 A, and all subsequent calculations 
were carried out at this geometry. The experimen¬ 
tally determined value of R e is 1.595 A [31]. 

Energies from various four-electron CASSCF 
calculations are collected in Table IV, for a nuclear 

0 

separation of 1.6076 A. Comparing with the corre¬ 
sponding two-electron CASSCF calculations at the 
same geometry, it is clear that opening up the la 1 
core orbital results in a fairly dramatic energy 
lowering. There are of course further energy lower¬ 
ings on opening up also the 5a v 6 a v 2 b v and 2 b 2 
orbitals, and it is clear that the all-electron full-va¬ 
lence CASSCF wave function is still far from the 
corresponding full Cl. Note that three of the 
CASSCF calculations have been labeled A, B, or C 
in Table IV in order to facilitate the subsequent 
discussion. 

As we have indicated, computationally the most 
inexpensive use of the CASVB code is to generate 
modern VB representations of CASSCF wave func¬ 
tions by means of the overlap criterion [Eq. (7)]. 
Applying this criterion to CASSCF calculation A 
leads to a representation in which the total wave 
function is overwhelmingly dominated by a spin- 


coupled-like component {<^> 1 c/> 2 <^> 3 <^> 4 }. The overlap 
between this dominant component and the full 
wave function (S VB , see Table V) is very close to 
unity. The energy E VB corresponding to the spin- 
coupled-like component accounts for all but ~ 7.3 
microhartrees of the total CASSCF energy. This 
small difference could of course be reduced if the 
orbital transformation were chosen instead using 
the energy criterion [Eq. (8)]. A fully variational 
calculation for the spin-coupled configuration 
{(pi 4*2 $ 3 provides an energy within 7.3 micro¬ 
hartrees of that provided by the CASSCF calcula¬ 
tion. All four spin-coupled orbitals were found to 
be of pure cr symmetry (i.e., a a ). 

For calculation B, we considered the three-con¬ 
figuration modern VB wave function 





(/>l<^ 2 03 04 Yj 
k= 1 


c F ) 4 , 


Q^( < ^l ( / ) 2 < / ) 5 ( / ) 5®pp) 

+ C 2 s/[^4> 2 <t> b <f) b (dl p ) 


(32) 


in which © 4 p is the four-electron perfect-pairing 
spin function. There can, of course, only be one 
mode of spin coupling for the configurations with 
a doubly occupied orbital (<p 5 or <£ 6 ), on account of 
the Pauli principle. Orbitals <Pi~<p A were con¬ 
strained to be of cr symmetry (i.e., a^), and orbitals 


TABLE IV_ 

RHF CASSCF, and full-CI results for LiH at 1.6076 A. 


Active orbitals 

Inactive space 

E, hartrees 

Debyes 

(RHF) 

— 

-7.987257 

6.03 

Two active electrons: 




2a 1 ,3a 1 

(la,) 2 

-8.003710 

5.70 

2a 1 ,3a 1 ,4a 1 

(la,) 2 

-8.009480 

5.84 

2a 1s 1b l5 1 b 2 

(la,) 2 

-8.001275 

6.02 

2a 1 ,3a 1 , 1b.,, 1 b 2 

(la,) 2 

-8.015913 

5.77 

2a 1 ,3a 1 4a 1 1b 1f 1 b 2 

(la,) 2 

-8.021111 

5.88 

(full Cl) a 

(la,) 2 

-8.023861 

— 

Four active electrons: 




A (1-4)a 1 

— 

-8.016846 

5.71 

B (1-4)a 1s 1b.,, 1b 2 

— 

-8.029067 

5.78 

(1-4)a 1s J \b v 2b v 1b 2 , 2b 2 

— 

-8.035753 

5.76 

(1-5)a l5 1b l5 1b 2 

— 

-8.034309 

5.88 

(1 —5)a-,, 1b.,, 2b l5 1b 2 , 2b 2 

— 

-8.035003 

5.88 

C (1 —6)a n , lb-,, 1b 2 

— 

-8.038302 

5.88 

(1-6)3^ 1b 1; 2b.,, 1b 2 ,2b 2 

— 

-8.044885 

5.87 


a Doubly occupied la-, core orbital taken from the corresponding full-valence CASSCF. 
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TABLE V_ 

Various CASSCF and CASVB results for LiH, as described in the text. 3 



CASSCF 


Overlap criterion 


Variational MCSC 

£ cas , hartrees 

1 s VB 

£ vb , hartrees 

A£, ^hartrees 

£mcsc< hartrees 

mcsc» Debyes 

A 

-8.016846 

5.3 X 10~ 7 

-8.016839 

7.3 

-8.016839 

5.71 

B 

-8.029067 

2.3 X 10“ 6 

-8.029048 

19.2 

-8.029048 

5.78 

C 

-8.038302 




-8.033347 

5.86 


a The labels in the first column correspond to those in Table IV. 


0 5 and 0 6 to be a tt pair (i.e., b 1 and b 2 ). Using 
the overlap criterion [Eq. (7)], we may generate an 
alternative representation of the CASSCF wave 
function in which a component of the form of Eq. 
(32) is overwhelmingly dominant. The optimal 
value of S VB (see Table V) is remarkably close to 
unity, and the corresponding expectation value of 
the energy is only 19.2 microhartrees inferior to 
that from the full CASSCF calculation. A fully 
variational optimization of the multiconfiguration 
spin-coupled (MCSC) wave function ^P B leads to 
an energy that is 19.0 microhartrees inferior to 
CASSCF B. 

In the case of CASSCF calculation C, we pro¬ 
ceeded directly to the fully variational optimiza¬ 
tion of a four-configuration MCSC wave function 
of the form 



C x sf 


2 ' 

< p l < p 2 < p 3 < p 4 52 C 0k®0Q;k 

k= 1 / 


+ 


C 2 ^l 


^ 1^2 ^5 ^6 12 C 0k®Q0;k 

k = 1 


T C 3 Ssf(^(f) 1 (f) 2 (/>7<^ 7 ©pp) 
+ 0102 08 ) * 


(33) 


Orbitals were constrained to be of a x sym¬ 

metry, and orbitals 0 7 and 0 8 to be the b x and b 2 
pair. In order to maintain some distinction be¬ 
tween {0 1 0 2 0 3 0 4 } anc * (0102 05 06)/ we COn " 
strained 0 5 and 0 6 , to be orthogonal to both of 0 2 
and 0 2 . Such a calculation, which utilizes numer¬ 
ous features of our constrained optimization pro¬ 
cedure, converges rapidly to a total energy within 
about 5 millihartrees (see Table V) of that from the 
full CASSCF wave function. 

We find that the converged orbitals 0^04, and 
the overlaps between them (see Table VI), are very 
similar to those obtained from the single-config¬ 
uration spin-coupled calculation [Eq. (31)] and the 
three-configuration MCSC calculation [Eq. (32)]. 
The overlaps of 0 3 and 0 4 with 0 5 and 0 6 , turn 
out to be rather small (see Table VI), and this 
suggests that it could be a reasonable approxima¬ 
tion in future work to restrict orbitals 0 5 and 0 6 to 
lie in the orthogonal complement of orbitals 0 a -0 4 . 
As might be anticipated, the dominant component 
of all three modern VB wave functions, ^sc 
and T r c , is the combination of the perfect-pairing 
spin function with the orbital string 0102 03 04 - 

Multiconfiguration spin-coupled calculations for 
LiH were first performed by Pyper and Gerratt 
[32]. The Pyper-Gerratt (PG) wave function for 


TABLE VI_ 

Overlaps between the nonorthogonal orbitals in the MCSC wave function 0 C [Eq. (33)]. 



01 

02 

03 

04 

05 

<t>6 

07 

08 

4> i 

1 

0.926 

0.123 

0.064 

0 

0 

0 

0 


0.926 

1 

0.197 

0.096 

0 

0 

0 

0 


0.123 

0.197 

1 

0.774 

0.047 

-0.028 

0 

0 


0.064 

0.096 

0.774 

1 

0.066 

0.006 

0 

0 


0 

0 

0.047 

0.066 

1 

0.585 

0 

0 


0 

0 

-0.028 

0.006 

0.585 

1 

0 

0 


0 

0 

0 

0 

0 

0 

1 

0 


0 

0 

0 

0 

0 

0 

0 

1 
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this molecule takes the form 

%G = Cj{ + C 2 £ {< T l°2 77 a 7T a}- ( 34 ) 

a = x, y 

The orbitals were fully optimized according to 
their designated symmetry properties simultane¬ 
ously with the C f coefficients, but the mode of spin 
coupling was restricted to perfect pairing. It should 
be clear that the appropriate all-electron CASSCF 
wave function with which to compare is the one 
with an active space consisting of (1-4)^, (1-2 )b 1 , 
and (1-2 )b 2 . According to the energies listed in 
Table IV, the energy improvement of the PG wave 
function relative to the spin-coupled configuration 
should be considerable. Penotti [33] has recently 
developed a full generalization of the PG method 
in which the orbitals, spin-coupling coefficients, 
and structure coefficients are fully optimized si¬ 
multaneously. The computational demands turn 
out to be very much larger than those of the 
approach we have presented here. Penotti has also 
considered simultaneous optimization of the basis 
set [33]. 

An attractive alternative to the exact optimiza¬ 
tion of MCSC wave functions has recently been 
explored by Clarke et al. [34], who examined a 
four-configuration description of the form 

''I'scvrs* = C 1 {a 1 a 2 cr 3 a i } + 

+ C 3 £ {o'lO^cX)- (35) 

«“*/ y 

In the spirit of a spin-coupled VB expansion [29], 
orbitals were taken from a standard spin- 

coupled calculation, without further optimization. 
In order to overcome convergence problems, spin- 
coupled orbitals cq-cq were projected out of the 
ra bas basis functions, and Gram-Schmidt orthogo- 
nalization was used to extract a set of m bas - 4 
linearly independent functions. Orbital a 5 was 
then expressed as a linear combination of <r v cr 2 , 
<t 4 , and the linearly independent projected func¬ 
tions. Similarly, orbital cr 6 was expressed as a 
linear combination of a v cr 2 , cr 3 , and the linearly 
independent projected functions. In this form, or¬ 
bitals cr 5 and cr 6 were optimized simultaneously 
with 7 T a and Tr' a (a = x, y) and the Q coefficients, 
but the spin-coupling coefficients in each configu¬ 
ration were fixed to be the same as those in the 
spin-coupled wave function {a-^c^cq}. The main 
simplifications, which should make the approach 
viable also for systems with more active electrons. 


are the use of a second-order perturbation theory 
expression for the energy and an approximation to 
the required elements of the Hessian. Of course, 
once the orbitals have been (approximately) opti¬ 
mized in this way, they may be used to construct a 
variational spin-coupled VB wave function, relax¬ 
ing also the constraints on the spin-coupling coeffi¬ 
cients, but without further orbital optimization. 
The various CASSCF and fully variational MCSC 
calculations described in the present work should 
prove to be useful benchmarks for assessing the 
accuracy and computational demands of this type 
of approach, and further results will be published 
in due course. The most appropriate all-electron 
CASSCF wave function with which to compare 
^scvb* is, °f course, the one with an active space 
consisting of (1-6 )a v (1-2 )b u and (1-2) b 2 . 


Summary and Conclusions 

We have outlined the CASVB approach for the 
fully variational optimization of general types of 
modern valence bond wave function, and we have 
presented a simple algorithm for the elimination of 
redundant or constrained parameters from the ap¬ 
propriate second-order optimization problems. The 
CASVB code may also be used to generate very 
compact modern VB representations of CASSCF 
wave functions, with the overlap criterion [Eq. (7)] 
being relatively inexpensive. 

Various ways in which the CASVB procedures 
may be used were illustrated by means of selected 
results for benzene, the allyl radical, and LiH, 
including the incorporation of symmetry condi¬ 
tions and the generation of fully variational multi¬ 
configuration spin-coupled (MCSC) wave func¬ 
tions. We have also established useful benchmarks 
for assessing the accuracy and computational de¬ 
mands of various other MCSC approaches that are 
currently being pursued [33, 34]. 

Since the modern VB wave function in the 
CASVB approach can be viewed as a constrained 
form of CASSCF wave function, a multitude of 
techniques immediately become available for 
modern VB methods, including the incorporation 
of dynamical correlation and the direct, on-the-fly 
evaluation of integrals over basis functions. In the 
case of the implementation of CASVB in molpro 
[6], the CASSCF part of the code provides for the 
efficient evaluation of energy gradients, enabling, 
for example, geometry optimizations with fully 
variational, core-optimized MCSC wave functions. 
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ABSTRACT: We have implemented a linear scaling, fully self-consistent density- 
functional method for performing first-principles calculations on systems with a large 
number of atoms, using standard norm-conserving pseudopotentials and flexible linear 
combinations of atomic orbitals (LCAO) basis sets. Exchange and correlation are treated 
within the local-spin-density or gradient-corrected approximations. The basis functions 
and the electron density are projected on a real-space grid in order to calculate the 
Hartree and exchange-correlation potentials and matrix elements. We substitute the 
customary diagonalization procedure by the minimization of a modified energy 
functional, which gives orthogonal wave functions and the same energy and density as 
the Kohn-Sham energy functional, without the need of an explicit orthogonalization. The 
additional restriction to a finite range for the electron wave functions allows the 
computational effort (time and memory) to increase only linearly with the size of the 
system. Forces and stresses are also calculated efficiently and accurately, allowing 
structural relaxation and molecular dynamics simulations. We present test calculations 
beginning with small molecules and ending with a piece of DNA. Using double-z, 
polarized bases, geometries within 1 % of experiments are obtained. © 1997 John Wiley & 
Sons, Inc. Int J Quant Chem 65: 453-461, 1997 


Introduction 

T he ability to perform reliable simulations of 
molecules and solid systems with thousands 
of atoms has enormous payoffs in many fields, 
especially for the materials and pharmaceutical 

Correspondence to: J. M. Soler. 


industries. With the rapid increase of computa¬ 
tional power, the emphasis is changing from 
semiempirical potentials and force fields to ab 
initio quantum mechanical methods [1]. However, 
a major problem of these methods is that, in gen¬ 
eral, their computational cost scales as N 3 , with 
the number of atoms N. In the last few years, 
much effort has been devoted to overcome this 
problem, and a number of methods have been 
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developed with "order-N" [O(N)] scaling, i.e., 
whose computational cost scales only linearly with 
the number of atoms [2-7]. Most of these efforts 
and methods are based on simplified, non-self- 
consistent, tight-binding or Harris-functional mod¬ 
els, although the first steps toward self-consistent 
methods have also been made, using Gaussian 
orbitals [8] or a real-space grid to define the elec¬ 
tron wave functions [9]. These last methods are 
very demanding in computer time and memory, 
requiring large computers and becoming competi¬ 
tive only for extremely large systems. 

We have developed a fully self-consistent den¬ 
sity-functional method, based on flexible linear 
combination of atomic orbitals (LCAO) basis sets, 
with actual and efficient O(N) scaling. It allows 
from extremely fast simulations, on the one hand, 
using minimal basis sets, to very accurate calcula¬ 
tions, on the other hand, using expanded bases 
(multiple-z, polarization, off-site orbitals), the 
choice depending on the size of the system, re¬ 
quired accuracy, and available computational 
power. In previous studies [10, 11] we have de¬ 
scribed preliminary versions of this method, where 
the essentials of the linear scaling were already put 
forward (see Fig. 2 in Ref. [10]). The main im¬ 
provements that we describe in this work are: (i) 
the flexibilization of the basis sets to include essen¬ 
tially any desired functions and (ii) the imple¬ 
mentation of new features, like spin polarization, 
generalized gradient corrections (GGA) ex¬ 
change-correlation, or the calculation of the stress 
tensor. 


Basic Approximations 

Apart from that of Born and Oppenheimer, the 
most basic approximations used here concern the 
treatment of exchange and correlation, and the 
elimination of core electrons, which are replaced 
by pseudopotentials. Exchange-correlation is 
treated within the framework of the Kohn-Sham 
[12] version of density-functional theory. We allow 
for both the local (spin) density approximation 
(LDA/ LSD) [13], with several possible parametri- 
zations, or the generalized-gradient-corrections 
(GGA) approximation, in the fully ab initio (non- 
empirical) version of Perdew, Burke, and Ernzer- 
hof [14]. 

Since we calculate on a real-space grid (see 
below), we use standard norm-conserving pseu¬ 


dopotentials [15] to avoid the computations for 
core electrons and to smooth the valence charge 
density. These pseudopotentials are transformed 
to their Kleinman-Bylander form [16] for im¬ 
proved simplicity and efficiency. This implies that 
only two-center integrals are required to calculate 
the nonlocal part of the pseudopotential matrix 
elements. The nonlinear partial-core correction, for 
the treatment of exchange and correlation in the 
core region [17], is also allowed to treat special 
cases, like alkaline atoms. 


Basis Sets 

In order to have a finite range for the matrix 
elements, one can either neglect them beyond a 
given distance or use confined basis orbitals, i.e., 
orbitals that are strictly zero beyond a certain 
radius [18]. We have adopted this last approach 
because it keeps the energy strictly variational, 
thus facilitating the test of the convergence with 
respect to the radius confinement. 

The algorithms used in our method (explained 
in the following) perform all the required calcula¬ 
tions numerically. No analytical expression of any 
kind is needed. We are, therefore, not bound to 
Gaussians, and adjusting the radial wave function 
to the core region (pseudopotential) or to other 
features is easy and inexpensive. Multiple-z bases 
can be introduced, and also polarization and off¬ 
site orbitals (orbitals centered anywhere). Some 
procedures are implemented for the generation of 
the basis starting from the solution of the separate 
atomic problems with the corresponding pseu¬ 
dopotentials, but the user can introduce his/her 
favorite basis, too. 

In the case of a minimal basis set, we have 
found convenient and efficient the method of 
Sankey and Niklewski [18]. Their basis functions 
are the (angular-momentum-dependent) numerical 
eigenfunctions of the atomic pseudopotential, for 
an energy chosen so that the first node occurs at 
the desired cutoff radius r c (which may depend on 
the atomic species and angular momentum). Be¬ 
yond that radius, the basis functions are zero. The 
effect of the confinement is shown below. 

One obvious possibility for multiple-z bases is 
obtained by using the set of pseudopotential 
eigenfunctions with an increasing number of nodes. 
This set has the virtue of being asymptotically 
complete. The efficiency, however, of this kind of 
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bases depends on the radii of confinement of the 
different orbitals, since the excited states of the 
pseudopotential problem are usually unbound. In 
the following some examples are shown of the 
results on molecular geometries when using this 
kind of bases, changing the confinement radii vari- 
ationally. 

Split valence bases are also easily generated by 
incorporating suited Gaussian orbitals [19] as extra 
basis functions, on top of the minimal basis de¬ 
fined above. In general, we find that double-z plus 
single-z-polarization bases reproduce the molecu¬ 
lar geometries obtained with a planewave basis set 
within 1% in the cases that we have studied. 
Nevertheless, it is well known that the election of 
good basis sets is a delicate task, and lots of efforts 
have been devoted to it. It must be considered, 
however, that the use of pseudopotentials and 
numerical (but confined) functions makes previous 
experience on basis sets not directly applicable. 


Matrix Elements 

Within the nonlocal-pseudopotential approxi¬ 
mation, the standard Kohn-Sham Hamiltonian 
may be written as 

H = T + £ [ V f local (r) + V, nl ] + V H (r) + V xc (r), 

I 

( 1 ) 

where I is an atom index, Im are angular momen¬ 
tum quantum numbers, T = p 1 /2m is the kinetic 
energy operator, V H (r) and V xc (r) are the Hartree 
and exchange-correlation potentials, and 

v? = L\4>ZK B «l\ ( 2 ) 

Im 

is the nonlocal (nl) part of the pseudopotential, in 
its Kleinman-Bylander (KB) form. <f )are the 
so-called Kleinman-Bylander projectors, which 
obey 

= [V n (r) - V, local (r)](3) 
V; toCaI |^ ln >- 1 / (4) 

where ijs Um (r) are the atomic pseudoorbitals, i.e., 
the solutions to the radial Schrodinger equation 
with potentials V u (r) (without any cutoff radius). 
Since V n - V/ ocal is nonzero only inside a small 


pseudopotential core radius rj core , the functions 
4 >fjf n are also zero outside this radius. 

In order to eliminate the long range of V local , 
we screen it with a sum p atom (r) of atomic electron 
densities pf om (|r - r 7 |), constructed by populating 
the basis functions with appropriate valence atomic 
charges. Notice that, since the atomic basis orbitals 
are zero beyond r c , the screened "neutral-atom" 
potentials V™ are also zero beyond this radius 
[18]. Let now 8 p( r) be the difference between the 
self-consistent electron density p(r) and the sum of 
atomic densities p atom (r), and let <5V H (r) be the 
Hartree potential generated by 8 p( r). Then the 
total Hamiltonian may be rewritten as 

H = T + £ V z nl + £ V, na (r) + 8 V H ( r) + V xc (r). 

i i 

(5) 

The matrix elements of the first two terms involve 
only two-center integrals which are easily calcu¬ 
lated in reciprocal space and tabulated as a func¬ 
tion of interatomic distance [18]. This is done at 
the beginning of the calculation with great accu¬ 
racy, representing a very small computational ef¬ 
fort. 

The remaining terms involve potentials which 
are calculated in a real-space grid. The third term 
is a sum of short-range screened pseudopotentials 
which are tabulated as a function of the distance to 
the atoms and easily interpolated at any desired 
grid point. The last two terms require the calcula¬ 
tion of the electron density at the grid. To do this, 
we first find the atomic basis orbitals at the grid 
points, also by interpolation from numerical tables. 
Then the electron density is given by 

P( r) = £p^^W^(r), (6) 

(JLV 

where (i , v run over all the atomic basis orbitals 
<^, and p^ v is the one-electron density matrix in 
this basis, obtained as explained below. Since the 
method is aimed at molecules and large solid 
supercells, Brillouin zone integrals for repeated 
structures are approximated by the T point (k = 0), 
so that all the wave functions can be made real. 
Notice that only overlapping orbital pairs con¬ 
tribute to the sum at their intersection, so that the 
calculation of the density can be performed in 
CXN) operations using sparse-matrix multiplica¬ 
tion techniques. Once the density is available in 
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the grid, we solve Poisson's equation with a fast 
Fourier transform to obtain 8 V h (t), adding it to 
V na (r) and V xc (r). The matrix elements of these 
three terms are then computed by integration in 
the same real space grid, also in O(N) operations. 


Total Energy and Forces 


overlap [20], the last term in (9) may be written as 

1 _ e 2 Z,Z r 1 , i 

- E 7— = ~ E f Vi >ocal (r)pj ocal (r) d 3 r 

l y |r j r j\ l jj J 

-Ei// local , (id 

I 

where 


Let us forget for the moment that we want an 
order-N method. Then, once the Kohn-Sham 
Hamiltonian has been obtained, we can solve the 
eigenvalue problem to find the electron eigenfunc¬ 
tions in the LCAO basis 


LT/ocai = j y/ ocaI ( r ) P/ local (r)47rr 2 dr. (12) 

Taking into account that p na = P local + patom^ 

(9) can be transformed, after some manipulations, 
into 


!'/';> = Ec,J^> (7) 

fX 

and the density matrix 

Ppv = E^,c, m c, v , (8) 

i 


£ tot = E/v( t m.' + O 

jXV 

1 „ t 

+ -II V, na (r) P/ na (r) d 3 r - E !i/ ocal (13) 

1 ii J i 

+ f V na (i ) 8p(i)d 3 t 


where are occupation weights which, at T = 0, 
will be either zero or two (spin polarization is not 
considered in this discussion for simplicity). The 
electron density can be obtained from Eq. (6) and 
the Kohn-Sham total energy can then be written 
as 


£ tot = - -1 / V H (r)p(r) d 3 x 

\xv 

+ j (e xc (r) - v xc (r))p(r) d 3 r 

1 e 2 Z,Z, 

+ 7^r~ LJ ~r 

2 !} Ir, - fj| 


(9) 


where Zj are the valence pseudoatom charges and 
€ xc (r) P (r) is the exchange-correlation energy den¬ 
sity. In order to avoid the long-term interactions of 
the last term, it is convenient to define the "local- 
pseudopotential charge density" P/ local as that 
which produces an electrostatic potential equal to 


P / oca (r) —-V 2 y/ ocal (r). 

477 6 


( 10 ) 


This positive charge density is confined within 
r/ ore and produces a potential eZj/r outside. 
Therefore, assuming that the core radian do not 


+ — j 8V H ( r) 8 p( r) d 3 r + J e xc (r)p( r) d 3 r. 

(14) 

The first two terms involve two-center integrals 
which are interpolated from initially calculated 
tables, as described previously, and the third term 
is found trivially from (12). The last three terms 
are calculated using the real space grid. 

Apart from getting rid of all long-range poten¬ 
tials [except that implicit in <5V H (r)], the advan¬ 
tage of (13) is that, except for the relatively slowly 
varying exchange-correlation energy density, the 
grid integrals involve 8 p( r), which is generally 
much smaller than P (r). Thus, the errors associated 
with the finite grid spacing are drastically re¬ 
duced. Also critical is that the kinetic energy ma¬ 
trix elements can be calculated almost exactly, 
without any grid integrations. Figure 1 presents 
the convergence of the total energy for a diamond 
supercell of 64 atoms as a function of the plane- 
wave cutoff associated to the grid [ E cut (Ry) = (77/ 
a) 2 , with a being the separation between grid points 
in atomic units], used to represent the electron 
density [21]. This convergence is considerably faster 
than that typical of planewave calculations, which 
require a larger cutoff to represent accurately the 
kinetic energy. 

Atomic forces and stresses are obtained by di¬ 
rect derivation of (13) with respect to atomic posi- 
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FIGURE 1 . Convergence of the total energy versus 
grid cutoff for a supercell of diamond with 64 C atoms. 

tions and obtained simultaneously with E tot . Com¬ 
plete formulas will be given elsewhere and here 
we provide only a brief sketch. Thus, dT /dx l can 
be obtained by straightforward derivation of the 
interpolating functions which represent T^, and 

d r or dpi*) „ 

- fe xc (r)p(r) d 3 r = fv xc ( r)- d 3 r 

dx l J J dr l 

= -2Ep / ,,,XVc/ v |y«|^>, 

fXiV 

(15) 


Order-iV Functional 

Once the Kohn-Sham Hamiltonian matrix in 
the LCAO basis has been computed for a given 
charge density, we seek to solve it in order-N 
operations. For that purpose, we must avoid the 
N 3 scaling of the eigenvalue problem, which ap¬ 
pears if either direct diagonalization or standard 
iterative solution are performed (in the latter case 
because of the explicit orthogonalization step nec¬ 
essary during the energy minimization). To avoid 
this problem, several methods have been proposed 
in the last years. Here we use the approach pro¬ 
posed by Mauri et al. [3] and by Ordejon et al. [4, 
5], although any other linear scaling procedure 
could in principle be used. The method of Mauri 
and Ordejon contains two basic ingredients. The 
first is the calculation of the wave functions ijy 
describing the ground state of the system by the 
minimization of a modified energy functional, 
which does not require an explicit orthogonaliza¬ 
tion step, but for which the wave functions be¬ 
come automatically orthonormal at the minimum. 
More generally, we use a modification of the 
Mauri-Ordejon functional, which was proposed 
by Kim it et al. [6]. This functional takes the form: 

M 

E = 2 E (Hij - tjS ;/ )(25, 7 - S,,) + tjN, (17) 
*/ J 


where index runs only through the basis or¬ 
bitals of atom J. Similar expressions are obtained 
for the derivatives of the other integrals in (13). 
Except for the gradient in front of <f> , the matrix 
elements involved in the contribution to the forces 
of the last three terms of (13) are the same as those 
of the Hamiltonian, and are calculated by the same 
techniques. The calculation of the forces as deriva¬ 
tives of the energy assures that Pulay corrections 
are automatically included. The stress tensor can 
also be calculated simultaneously with the forces, 
since it involves the product of each force contri¬ 
bution times a distance. For example. 







(16) 


where e xy is the xy component of the strain tensor 
and x LJ and y Y] are the x and y components of 
the vector between atoms I and /. 


where the two in front of the sum is for spins, N is 
the number of electrons, and H iy = (fa\H\ipj), S /; - 
= (fa | ipj ) are the Hamiltonian and overlap ele¬ 
ments between the wave functions. The sum is 
done over a set of functions equal or larger than 
the number of occupied states (N/2), and the total 
charge is controlled by the Fermi level parameter 
7 ]. The minimization of this functional with respect 
to variations of the wave functions fa (in terms of 
their expansion in the LCAO basis) provides the 
exact ground state of the given H (in the space 
spanned by the basis set) and orthonormal wave 
functions [6]. 

The second ingredient of the order-N solution is 
the restriction of the minimization of the func¬ 
tional in Eq. (17) to wave functions which are 
localized in space. These are Wannier-like wave 
functions centered on different positions of the 
system, which are truncated beyond a given cutoff 
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radius R c (different from the cutoff of the LCAO 
basis orbitals r c ). This R c is a variational parame¬ 
ter which controls the accuracy of the calculation, 
and it is the only approximation involved in the 
O(N) solution of the Hamiltonian. A detailed de¬ 
scription of the accuracy of that truncation can be 
found in Ref. [5]. 

Once the wave functions solution of H have 
been obtained with the order-N functional, a new 
Kohn-Sham Hamiltonian is built from the new 
charge density, and the procedure is iterated until 
self-consistency is achieved. At this point, the en¬ 
ergy and atomic forces are evaluated by means of 
the expressions given in the previous section. 


Accuracy and Efficiency Tests 

SMALL MOLECULES 

Several tests on the performance of the approxi¬ 
mations used in our method are presented here, 
especially for the bases and their confinement. 
Table I shows the results for the interatomic dis¬ 
tance in diatomic molecules as compared with 
experiment and with other LDA calculations per¬ 
formed with converged augmented plane waves. 
They are all calculated within LDA. Several bases 
are compared. The single-z basis is as explained 
above, using a sufficiently long radius of confine¬ 
ment. Its performance is improved by scaling the 
radial dependence with appropriate scale factors, 
which are obtained variationally. Double-z bases 
are used in the split-valence scheme, and the effect 
of confinement on them is also shown. 


Table II displays results of polarized bases. The 
accuracy obtained is within 1% of the experimen¬ 
tal numbers (except for hydrogen, since its small 
size makes the percentual error larger). This shows 
the high quality attainable with short-range basis 
functions. The transferability of the previous bases 
is tried on CO and NO (within Table I) and also 
for HCN (Table III), which also performs very well 
with double-z bases (both split-valence and short- 
range). 

More delicate is the description of subtle bond¬ 
ings like the hydrogen bonds. A couple of water 
molecules is studied, and the results for the geom¬ 
etry are compared with experiments and with other 
calculations. Table IV displays the results using 
LDA exchange-correlation. Table V does the same 
for the results of GGA calculations. It can be con¬ 
cluded that: (i) with a double-z, polarized basis the 
results obtained for the geometry are perfectly 
comparable in accuracy with converged plane 
waves results, (ii) a good comparison with experi¬ 
ment requires the use of GGA, and (iii) the bind¬ 
ing energies are in general difficult to describe 
with our methods, requiring both long r c and 
ample bases. 

DIVA 

We present here first ab initio results for the 
structure of deoxyribonucleic acid (DNA). It is 
performed on a turn of the double helix with ten 
guanine-cytosine base pairs, which is repeated pe¬ 
riodically making an infinite structure with a unit 
cell containing 650 atoms. It is relaxed starting 


TABLE I o _ 

Bond lengths (in A) calculated for some diatomic molecules using different bases. 3 



Experiment [22] 

APW [22] 

SZ 

SZ(scaled) 

SZ(SR) 

DZ(SV) 

DZ(SR) 

H 2 

0.741 

0.767 

0.928 

0.772 

0.789 

0.773 

0.727 

c 2 

1.244 

1.249 

1.425 

1.348 

1.332 

1.279 

1.262 

n 2 

1.098 

1.098 

1.283 

1.255 

1.194 

1.123 

1.125 

o 2 (lda) 

1.207 

1.212 

1.487 

1.445 

1.296 

1.366 

1.259 

o 2 (lsd) 

1.207 

— 

1.406 

1.388 

1.284 

1.293 

1.241 

CO 

1.127 

1.132 

1.309 

1.276 

1.206 

1.172 

1.156 

NO 

1.148 

1.148 

1.409 

1.376 

1.252 

1.233 

1.234 


a SZ: single-z pseudoatomic orbitals with large cutoff radius (r c = 6.2 a.u.); SZ(SR) same with short r c (3.3 a.u. for H, 4.1 a.u. for C, 
3.6 a.u. for N, and 3.2 a.u. for O); SZ (scaled) scales the SZ bases with scale factors determined variationally: 0.950 for C, 0.975 for 
N, 0.980 for 0, and 0.800 for H. For CO and NO the scaling factors are found to be the same as for the dimer of each element; 
DZ(SV): split-valence double-z for long r c orbitals; DZ(SR): short r c double-z bases. 
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TABLE II 


Calculated bond lengths (in A) for some 
diatomic molecules adding polarization orbitals 
to double-z (DZP) short range (SR) basis sets. 3 



Exp. 

100 Ry 

DZP (SR) 

200 Ry 
DZP(SR) 

h 2 

0.741 

0.729 (1.49%) 

0.726 (2.01%) 

C 2 

1.244 

1.253 (0.74%) 

1.250 (0.46%) 

n 2 

1.098 

1.103 (0.45%) 

1.092 (0.55%) 

o 2 (lda) 

1.207 

1.215 ( 0 . 66 %) 

1.206 (0.08%) 

o 2 (lsd) 

1.207 

1.203 (0.32%) 

1.197 (0.83%) 

CO 

1.127 

1.117 (0.89%) 

1.118 (0.75%) 

NO 

1.148 

1.144 (0.31%) 

1.142 (0.55%) 


a r c for the polarization orbitals minimize the total energy. In 
parenthesis, the relative error with respect to experiment. 


from the experimental geometry [29], which was 
obtained from X-ray diffraction experiments on 
microcrystalline fibers, refined with a least-squares 
method using "standard" bond lengths. 


TABLE III_ 

Bond lengths (in A) for HCN using different 
basis sets. 



Exp. 

SZ(SR) 

DZ(SR) 

DZ(SV) 

c/(HC) 

1.066 

1.136 

1.070 

1.083 

d(CN) 

1.153 

1.211 

1.159 

1.174 


This calculation is, to our knowledge, the first 
self-consistent ab initio calculation of such a struc¬ 
ture [30] and represents quite a challenge because 
of both the complexity and the importance of the 
system. In a first step presented here, the simplest 
scheme has been used, namely, LDA for exchange 
and correlation, minimal basis set, and quite short 
confinement radii [30] (3.3 a.u. for H, 4.1 a.u. for C, 
3.6 a.u. for N, 3.2 a.u. for O, and 4.7 a.u. for P). The 
grid cutoff was 113 Ry. The approximations are 
substantial but constitute a beginning toward finer 
calculations (higher cutoff, double-z, somewhat 
longer r/s, GGA instead of LDA). 


TABLE IV 


LDA results for the H 2 0 dimer using different basis sets. 3 



SZ(SR) 

DZ(SR) 

DZP(SR) 

SZ 

DZ 

DZP 

PLW-VWN [23] 

LCAO-VWN [24] 

Exp. [23] 

r(O-O) 

2.628 

2.575 

2.687 

2.723 

2.590 

2.747 

2.70 

2.71 

2.98 ± 0.01 

r(0,-H) 

1.012 

0.971 

0.976 

1.049 

0.999 

0.986 

0.961 

0.980 

— 


105.6 

110.1 

106.1 

108.7 

112.2 

106.7 

106.2 

104.7 

— 

r(O d -H,) 

1.042 

0.999 

0.988 

1.069 

1.026 

1.002 

0.980 

0.997 

— 

r(O d -H 2 ) 

1.007 

0.972 

0.971 

1.041 

1.000 

0.981 

0.961 

0.977 

— 


103.0 

106.9 

105.6 

107.5 

113.3 

107.3 

106.1 

105.4 


o d 

1.2 

0.4 

6.3 

6.7 

7.1 

7.5 

4.84 

9.0 

6 ± 20 

BE 

19.39 

20.39 

19.37 

14.21 

15.54 

11.01 

9.06 

9.16 

5.44 + 0.7 


a For notation see Ref. [233. Lengths in A and angles in degrees. PLW: plane-wave calculation. VWN: Vosko, Wilk, and Nusair [25] 
functional for correlation. BE: Binding energy (kcal/mol). 


TABLE V_ 

GGA 14 results for the H 2 0 dimer using different basis sets. 



SZ(SR) 

DZ(SR) 

DZP(SR) 

SZ 

DZ 

DZP 

PLW-B [23] 

LCAO-B [24] 

LCAO-PW [24] MP 2 [28] 

r(O-O) 

2.663 

2.641 

2.715 

2.837 

2.752 

2.902 

2.98 

2.886 

2.877 

2.911 

r(O a -H) 

1.013 

0.967 

0.971 

1.051 

0.996 

0.981 

— 

0.979 

0.981 

0.957 

<t>a 

105.2 

109.2 

105.8 

107.2 

111.2 

106.2 

— 

106.2 

104.4 

— 

r{O d -H,) 

1.039 

0.989 

0.987 

1.066 

1.015 

0.988 

— 

0.990 

0.990 

0.964 

r(CVH 2 ) 

1.008 

0.967 

0.967 

1.045 

0.997 

0.980 

— 

0.977 

0.979 

— 

<Pd 

103.2 

106.6 

105.3 

107.1 

110.8 

104.7 

— 

106.2 

106.0 

— 


- 0.8 

0.1 

4.9 

6.9 

2.6 

4.7 

— 

7.0 

15.1 

4.5 

BE 

15.17 

15.92 

15.57 

10.16 

11.76 

7.36 

4.90 

4.51 

5.993 

5.44 


a For notation see Ref. [23]. Lengths in A and angles in degrees. BE: Binding energy (kcal/mol). B: Becke [26] functional for GGA. 
PW: Perdew-Wang91 [27] functional for GGA. MP2: Second-order Moller-Plesset with a 6-311 ++ G(2d, 2p) basis [28]. 
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Let us show first results for an isolated gua¬ 
nine-cytosine pair. Figure 2 shows the structure of 
the pair, with its three hydrogen bonds. Table VI 
shows the distances obtained for them. The bridge 
distances (N-N or O-N) are 10-15% shorter than 
that obtained from other more elaborate calcula¬ 
tions [31, 32]. This may be due to the short r c 
employed in the calculations. The position of the 
hydrogen atoms in between tends too much to¬ 
ward the middle of the bridge. It has been argued 
that this is a pathology of LDA that can be cor¬ 
rected using GGA instead, but it could also be due 
(or enhanced) by the fact of using nonscaled sin- 
gle-z basis for H, which gives too long O-H and 
N-H bonds. 

The DNA calculation required about 500 Mb of 
computer memory. For the order-N solution of the 
electronic problem, a cutoff of 4.0 A was used for 
the electronic wave functions, which were reduced 
to only occupied states (i.e., M = N/ 2). The en¬ 
ergy functional [Eq. (17)] therefore reduces to the 
original Mauri-Ordejon functional [3, 4], The full 
relaxation of the structure needed over 200 mini¬ 


mization steps, which amounted to around 5 days 
of CPU time in a HP Cl 10 workstation. The devia¬ 
tion from the experimental geometry of Ref. [29] is 

/ E(r th - r ) 2 o . v 

a=\ -= 0.23 A. (18) 

V N 

The hydrogen bonds between guanine and cyto¬ 
sine display similar pathologies as the ones showed 
by the isolated pair of bases, as can be seen in 
Table VII. It is interesting to note, however, that 
the skeleton shows much smaller deviations, as, 

TABLE VII_ 


Distances (in A) for the H bonds in a guanine 
(Gu) - cytosine (Cy) pair within the DNA structure. 


Gu-Cy 

c/(Gu-Cy) th 

d(Gu-Cy) exp 

N-N 

2.58 

2.85 

N-0 

2.63 

2.78 

O-N 

2.50 

2.80 


TABLE VI_ 

Distances (in A) for the H-bonds in the guanine (Gu) - cytosine (Cy) isolated pair. 3 


Gu-Cy 

cf(Gu-H) 

d(H-Cy) 

of (Gu-Cy) 

d(Gu Cy) MP2 

Exp. 

N-H-N 

1.24 

1.37 

2.61 

3.05 

2.95 

N-H-0 

1.17 

1.36 

2.53 

3.01 

2.86 

O-H-N 

1.23 

1.28 

2.51 

2.93 

2.91 


a MP2 refers to the Moller-Plesset calculation of Ref. [32] using a 6-31G* basis. Experimental result are taken from Saenger [33]. 
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for example, can be seen in a characteristic dis¬ 
tance between the C atoms in the sugar molecules 
supporting a basis pair at opposite sides, R, char¬ 
acteristic of the interhelix distance: 

R exp - 10.70 A, R th - 10.61 A. (19) 

The structure is not relaxed along the helix axis. 
However, the calculated stress along that direction 
is quite small (11 kbars) allowing us to expect a 
small relaxation in that direction. A more detailed 
account of the results will be presented elsewhere. 


Conclusions 

An efficient method for ab initio calculations 
has been presented. It is based on the density-func¬ 
tional theory (DFT) and utilizes LCAO and stan¬ 
dard norm-conserving pseudopotentials. It has also 
been shown how the computational effort scales 
linearly with the system size. After a description of 
its different aspects, calculations have been pre¬ 
sented for testing its capabilities, beginning with 
small molecules and finishing with DNA. The 
quality of the results has been shown to be mostly 
related to the quality of the LCAO basis set. 
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ABSTRACT: Electron propagator methods for calculating vertical electron detachment 
energies of closed-shell anions are compared with respect to accuracy and efficiency. 
Examination of self-energy terms and numerical tests on typical anions show that the 
partial third-order (P3) quasi-particle approximation is superior to the third-order and 
outer valence Green's function methods in both respects. © 1997 John Wiley & Sons, Inc. Int 
J Quant Chem 65: 463-469, 1997 


Introduction 

T he chemistry and physics of gas-phase anions 
exhibit several lively trends where theory and 
experiment interact. Double Rydberg anions 
demonstrate the importance of diffuse basis func¬ 
tions and electron correlation in describing 
electron pairs bound to a molecular cation [1]. 
Thorough calculations continue to identify super¬ 
halogens: molecules with electron affinities ex¬ 
ceeding those of the halogen atoms [2]. Doubly 
charged and multiply charged metal fluoride com¬ 
plexes have been studied computationally [3]. Car- 
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bon cluster dianions [4] have been examined with 
several theoretical techniques [5]. Advances in 
preparative and computational capabilities por¬ 
tend an expansion of interest in larger species such 
as fullerenes [6] and organic poly anions [7]. A 
need therefore exists for correlated methods that 
are accurate and feasible for systems that require 
hundreds of basis functions. 

Among the most important properties of an 
anion are its electron detachment energies, both 
vertical and adiabatic. Because of the variety and 
unprecedented character of bonding patterns found 
in anions, a theory of electron detachment energies 
should be free of model potentials, should be sys¬ 
tematically improvable with respect to electron 
correlation, and should include no arbitrary pa¬ 
rameters. Some ab initio techniques satisfy these 
requirements. Accuracy and feasibility criteria, es¬ 
pecially the scaling of arithmetic operations and 
storage requirements, often determine which of 
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these techniques will be most advantageous for a 
given system. Electron propagator methods [8-15] 
excel in calculations of many types of electron 
binding energies and have proven to be useful 
tools in the study of anions [16]. 

A recent propagator approximation is compared 
here with methods that have been extensively ap¬ 
plied. Closed-shell anions with subtle correlation 
effects and extensive orbital relaxation in the corre¬ 
sponding neutral radicals are considered. Calcula¬ 
tions are performed with various basis sets in 
order to estimate the behavior of the propagator 
approximations in the limit of a complete basis set. 


Theory 


The r, s element of the electron propagator [8] is 

G rs (E) = {{a\;a s )) 

I (N\a\\N-l,nXN-l,n\a s \N) 
"J™ „ E + E„(N-1) — E 0 (N) —irj 


+ £ 


m 


(N\a s \N+l,mXN+l,m\al\N) 
E — E (N + 1) +E 0 (N) + 7tT~ 


( 1 ) 


Through introduction of superoperators and a cor¬ 
responding metric, the propagator may be repre¬ 
sented more compactly [9, 17]. Superoperators act 
on field operator products, X, where the number 
of annihilators exceeds the number of creators by 
one. The identity superoperator, I, and the Hamil¬ 
tonian superoperator, H, are defined by 

IX = X (2) 

and 

HX = [ X, H]-, (3) 

respectively. The superoperator metric, defined by 

(fi\v) = (N\[^,p] + m, ( 4 ) 

depends on the choice of the N-electron reference 
state, |N). In this notation, 

«4;fl s » = G rs (E) = (fl r |(Ef-H) _1 fl s ). (5) 

A matrix generalization of Eq. (5) is 

_| 

G(E) = (a|(E f-H) a). (6) 


After the inner projection, 

G(E) = (a|u)(u|( El- H)u) _1 (u|a), (7) 

where u is the vector of all X field operator 
products. If u is partitioned into the primary space, 
a, and an orthogonal space, f, then the partitioned 
form of the propagator matrix reduces to 

-l 

1 

0 .' 

( 8 ) 

After a few elementary matrix manipulations, a 
convenient form of the inverse propagator matrix 
emerges, where 

G -1 (E) = El - (a|Ha) 

-(a|Hf)[El-(f|Hf)] _ \f|Ha). (9) 

Elements of the zeroth-order inverse propagator 
matrix are 


G(E) =[1 0] 


El — (a Ha) 

— (a|Hf) 

— (f |Ha) 

El-(f|Hf) 


[G^(E)] r ,-(E-e r )5 rs . (10) 

(The poles equal Koopmans' theorem results.) The 
inverse propagator matrix and its zeroth-order 
counterpart therefore are related through 

G-HE) = GoHE) - 2(“) - £'(E), (ID 

where 

2 (°o) rs = (a r \Ha s ) correhtjon = E(«IU«) P ( c „ (12) 

til 

and 


V(E) = (a|H£)[El - (f|Hf)] _1 (f|Ha). (13) 

Corrections to the zeroth-order propagator in Eq. 
(11) are gathered together in a term known as the 
self-energy matrix X(E). The Dyson equation may 
be written as 

G _1 (E) = Go’(E) - 2(E). (14) 

In the self-energy matrix, there are energy-inde¬ 
pendent terms and energy-dependent terms: 

2(E) = 2(oo) + S'(E). (15) 

In the limit of |E| -> 2(E) approaches its 

energy-independent component, 2(°°). At a pole 
energy, 

det{G(E)} -»o°. (16) 
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Poles may be found by requiring that 

det{G -1 (E)} =0. (17) 

Previous Approximations 

The second-order self-energy matrix, 

X (2) (E) = (a|Hf 3 ) (1) El — (f 3 |Hf 3 ) (0) \f 3 \Haf\ 

(18) 

follows from zeroth- and first-order choices for 
superoperator blocks defined in terms of the a and 
f 3 (2 h-p and 2 p-h ) operator manifolds [9]. (Super¬ 
scripts in parentheses denote the order through 
which a superoperator matrix element is evalu¬ 
ated.) Upon retention of first-order terms in 
(f 3 |Hf 3 ), third- and higher-order contributions to 
the self-energy matrix are generated. Third-order 
terms also are created by perturbative improve¬ 
ments in the ground-state averages [18], The 
third-order self-energy matrix contains contribu¬ 
tions of both types and may be expressed as 

2 (3) (E) 

= X (2) (E) + (a|ya) <3> 

/ , ~ A2)\ / , ~ \(0)l -1 /„ , * \(1) 

+ (a|Eff 3 ) El — (f 3 lEf 0 f 3 ) (f 3 |Ha) 

/ , A ,(1)[ , , A n( 0)1 -1 / , A s (2) 

+ (a|Hf 3 ) El — (f 3 |Ef 0 f 3 ) J (f 3 |Ha) 


+ (a|Hf 3 ) (1) [El - (f 3 lH 0 f 3 ) (0) ] \h\Vhf 
X El - (f 3 |H 0 f 3 ) <0) 1 (f 3 |Ha) (1) , (19) 

A 

where V is the superoperator for the fluctuation 
potential. 

Results on valence ionization energies and elec¬ 
tron affinities of closed-shell molecules generally 
indicate that off-diagonal elements of the self¬ 
energy matrix in the canonical basis are small and 
have a negligible effect on poles. Quasi-particle 
approximations explicitly neglect these matrix ele¬ 
ments. The associated pole search becomes espe¬ 
cially easy, for the zeroes of the diagonal elements 
of the Dyson equation can be found by solving 

E - e p + X pp (E). (20) 

Simple measures for estimating fourth- and 
higher-order terms have been applied widely. Once 
a third-order pole is found, various terms are rear¬ 
ranged to make these estimates. The A version of 
the outer valence Green's function (OVGF) meth¬ 
ods [14], e.g., corrects canonical orbital energies 
with 

V?' 0VGF) (E) = X <2 >(E) + (1 + X r )" 1 X' 3 r ) (E), 

( 21 ) 

where E is chosen to be the third-order pole and 


((a|Hf 3 ) <2) {El - (f 3 |Hf 3 ) <0) } \i 3 \H* f) 

X'= -2- _-_-_ill, (22) 

|(a|Hf 3 ) (1) {El-(f 3 |Hf 3 ) (0) } 1 (£ 3 |Ha) (1> |^ 

The B approximation is similar, but separates the energy-dependent 2 p-h and 2h-p contributions through 

X (B ’ 0VCF >(E) = X (2) (E) + (a|y a )fr + (1 + X r 2 "-py 1 Xf/ 2 "-f’KE) + (1 + X r 2 ’ , - h )~ 1 Xf r '^- h KE), (23) 
where 

((a|Hf 2;i , p ) <2> {El - (t 2h - p \Hf 2h ^f) 

X 2 h ~ v = —2—-—rj-(24) 

{(a|Hf 2 „- P ) a) {El - (f2^!-pI 2 /:- P ) <0> } (f 2 ,- P lHa) (1) )^ 
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and 


((a|Hf 2p _,) (2> {El - (f 2 p_/,|Hf 2 ft ) <0> | 1 (f 2p -fc|Ha) (1) | 

X 2p-h = _ 2 \ _i_ L _ tlL' (25) 

|(a|f?f 2p _ ;i ) (1> {El - (f 2p _,|Hf 2p _„) (0) } \{ 2p _ h \Ha) m Y r 


Note that the constant terms are not scaled in this 
approximation. These schemes function best when 
the third-order numerators in Eqs. (22), (24), and 
(25) are small compared to the second-order de¬ 
nominators. For other cases, the C recipe often is 
preferable. The self-energy correction is 

Xf r ’ OVGF KE) = S®(£) + (1 + X r c ) _1 X®(E), 

(26) 

where 

vC X 2, ' _p 2f/ 2,1_p) (E) + X, 2p -' , Xf/ 2p_ '' ) (E) 

Xr “ Xf^PKE) + 2f/ 2p_,!) (E) ‘ 

(27) 

The utility of quasi-particle approximations was 
tested recently through a comparison of third-order 
and OVGF results for 25 ionization energies of 10 
closed-shell molecules with H, C, N, O, and F 
nuclei [19]. Calculations were performed with a 
correlation-consistent, triple £ basis [20]. Root 
mean-square errors with respect to experiment in 
eV were 1.25 for Koopmans's theorem, 0.67 for 
third order, 0.27 for OVGF-A, 0.33 for OVGF-B, 
and 0.32 for OVGF-C. After employment of numer¬ 
ical criteria [19] for choosing a recommended 
OVGF value (i.e., a selection from one of the three 
versions), the root mean-square error is 0.25 eV. 
Once third-order results are available, OVGF esti¬ 
mates are very efficient tools for making assign¬ 
ments and predicting ionization energies. 


Partial Third-order Theory 

The usual choice of superoperator metric starts 
from a Hartree-Fock wavefunction plus perturba¬ 
tive corrections: 

(Y|Z) = <HE|(1 + T t )[ Y t , Z] + (1 + T)|HF>, 

(28) 


where 

T = T 2 (1) + T[ 2) + T 2 (2) + TP + TP + •••. (29) 

The level of excitation in Tp is indicated by the 
subscript, e, and the order is defined by the super¬ 
script, /. For example, second-order triple excita¬ 
tions are represented by T 3 (2) . Coupled-cluster 
parametrizations of this metric suggest an alterna¬ 
tive form: 

(Y|Z) - < HF\e~ T [Y\Z] + e T \HF >. (30) 

This choice produces asymmetric superoperator 
matrices. A simplified final form for the self-energy 
matrix that does not require optimization of clus¬ 
ter amplitudes is sought presently; the approxima¬ 
tion 

e T * 1 + r 2 (1) (31) 

therefore is made. 

With this choice, several third-order terms that 
appeared with the usual metric are eliminated. 
The new self-energy matrix in third order is asym¬ 
metric and is expressed by 

2(E) 

= (a|Hf 3 ) (,) {El-(f 3 |Hf 3 ) (m } 1 (f 3 |Ha)° ) 

+ (a|Hf 3 ) (1) {El - (f 3 |Hf 3 ) <0> } \f 3 \Ha f 

+ (a|Hf 3 ) (1) {El - (f 3 lHf 3 ) (0) } 1 (£ 3 |V£ 3 ) (1> 

X |e 1 - (f 3 |Hf 3 ) (0) j 1 (f 3 |Ha) <1) . (32) 

Note that energy-independent terms in the third- 
order self-energy matrix are not retained. 

Two observations suggest additional economies: 
First, numerical results for ionization energies 
show that third-order, 2 p-h terms in Eq. (32) are 
small relative to their 2 h-p counterparts [21]. 
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Terms arising from these operators are important 
in second order, however. Second, evaluation of 
one of the third-order 2 p-h terms requires electron 
repulsion integrals with four virtual indices, ac¬ 
cording to 


znr 


W e) = -E 


< pi\\ab){ab\\cd) 


2 7b E + e i 


(33) 


^a ^b 


where i, ], k,... are occupied indices, a,b,c ,... 
are virtual indices, and p, q, r, are general in¬ 
dices. Storage of these integrals is often avoided 
through semidirect algorithms [22]. Contractions 
involving integrals with four virtual indices re¬ 
main the arithmetic bottleneck in third-order 
quasi-particle theory, for the scaling factor for a 
given final state with index p is OV 4 , where O is 
the number of occupied orbitals and V is the 
number of virtual orbitals. 

Neglect of third-order, 2 p-h terms produces 
this self-energy matrix: 


1 < pi\\ab)(ab\\qi) 

Z{E)pq = ~ 

6 — 6 , 

a l 


2“E + e 


y < 

+ 2 % E + t n -*i- e i 

1 U pai j(EKij\\qa) 

+ ~L "FT- 

2 E + € n — 6: - €: 

cnj a 1 J 


(34) 


where 




(qa\\bc)(bc\\ij) 


^ be ^b ^c 


+ (1 ~ Pj j) 


(qk\\bi)(ba\\ jk) 


} bk € j + € k ~ e a € b 


(35) 


and 




< pa\\kl){kl\\ij) 


2 k i E + e a e k 6 } 


-(l-Pn)i: 


< pb\\ jk)(ak\\bi) 


} bk E + e b 6 j e k 


(36) 


This partial third-order expression is designated 
by the abbreviation P3. 

Comparison of the self-energy matrix elements 
of Eq. (34) with older, related methods [14, 23] 
reveals the advantages of the P3 approximation. 
OVGF results [14] and the third-order quasi-par¬ 


ticle calculations on which they are based require 
the evaluation of the diagonal elements of the 
self-energy matrices given in Eq. (18) (second-order 
terms) and Eq. (19) (third-order terms). The second 
term (the third-order, energy-independent dia¬ 
grams) and the third term in Eq. (19) are not 
present in the P3 self-energy. Furthermore, the 
2 p-h operators included in the fourth and fifth 
terms of Eq. (19) are dropped in P3 theory as well. 
Another difference between P3 and OVGF is the 
absence of scaling factors, such as those in Eqs. 
(21M27), in P3. 

In the P3 quasi-particle propagator, only diago¬ 
nal elements of the self-energy matrix are retained. 
The first contraction in Eq. (35) is the most de¬ 
manding, for it has an arithmetic scaling factor of 
O 2 V 3 for a given q. This step also requires electron 
repulsion integrals with one occupied and three 
virtual indices. The intermediate is energy- 

independent and must be evaluated once only for 
each final state. 

A quasi-particle implementation of the P3 
method was applied to 19 ionization energies of 
six closed-shell molecules with a correlation-con¬ 
sistent, triple £ basis [21]. Comparisons with re¬ 
sults of more costly propagator techniques were 
made by applying the same basis sets. The average 
absolute errors in eV are 1.34 for Koopmans's 
theorem, 0.25 for OVGF-B, and 0.19 for P3. The P3 
procedure exhibits accuracy at least as good as 
that of other methods, superior arithmetic scaling, 
and no need for electron repulsion integrals with 
four virtual indices. 


Results 

Electron propagator calculations were per¬ 
formed on F“, OH - , NH 2 , Cl - , SH - , and PH 2 
with a modified version of Gaussian 94 [24]. The 
geometries of the anions were optimized with 
MBPT(2) [25] and the 6-31 + G(d,p) basis [26]. 
Propagator calculations employ three basis sets: 
6-311 + +G(2d/,2pd) [27], aug-cc-pVDZ, and aug- 
cc-pVTZ [20]. Core orbitals were dropped from all 
summations pertaining to the propagator self-en¬ 
ergies. Several diagonal self-energy approxima¬ 
tions were examined: second order, third order, 
OVGF-A, OVGF-B, OVGF-C, the recommended 
OVGF (OVGF-R) value arising from a selection 
procedure [19], and P3. 
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TABLE I_ 

Electron detachment energies (eV). 


Anion 

Basis 

KT 

2 

3 

A 

B 

C 

R 

P3 

Expt. [28] 

F” 

6-311 ++G(2df,2pd) 

4.83 

1.23 

5.69 

3.83 

4.39 

4.17 

4.39 

3.72 



aug-cc-pVDZ 

4.93 

1.03 

6.61 

4.32 

5.05 

4.62 

5.05 

3.79 



aug-cc-pVTZ 

4.92 

1.21 

6.75 

4.24 

5.00 

4.66 

5.00 

4.15 

3.399 ± 0.003 

OH” 

6-311 + + G(2df,2pd) 

2.88 

-0,01 

3.59 

1.92 

2.38 

2.28 

2.38 

2.12 



aug-cc-pVDZ 

2.95 

-0.17 

4.22 

2.26 

2.86 

2.58 

2.86 

2.11 



aug-cc-pVTZ 

2.97 

-0.02 

4.43 

2.24 

2.86 

2.67 

2.86 

2.48 

1.828 

NH 2 - 

6-311 + + G(2df,2pd) 

1.30 

-0.38 

1.36 

0.52 

0.77 

0.81 

0.77 

0.77 



aug-cc-pVDZ 

1.31 

-0.51 

1.53 

0.64 

0.97 

0.91 

0.97 

0.64 



aug-cc-pVTZ 

1.34 

-0.30 

1.65 

0.68 

1.01 

1.02 

1.01 

0.96 

0.779 ± 0.037 

Cl” 

6-311 ++G(2df,2pd) 

4.08 

3.04 

3.68 

3.39 

3.54 

3.57 

3.54 

3.41 



aug-cc-pVDZ 

4.10 

2.91 

3.85 

3.49 

3.70 

3.68 

3.70 

3.35 



aug-cc-pVTZ 

4.09 

3.08 

3.88 

3.51 

3.73 

3.74 

3.73 

3.50 

3.615 

SH” 

6-311 + + G{2df,2pd) 

2.56 

1.80 

2.27 

2.03 

2.18 

2.20 

2.18 

2.08 



aug-cc-pVDZ 

2.58 

1.70 

2.39 

2.09 

2.29 

2.28 

2.29 

2.01 



aug-cc-pVTZ 

2.58 

1.88 

2.48 

2.16 

2.38 

2.39 

2.38 

2.19 

2.319 ± 0.010 

PH 2 ” 

6-311 + +G(2df,2pd) 

1.20 

0.89 

1.06 

0.95 

1.06 

1.05 

1.06 

0.98 



aug-cc-pVDZ 

1.21 

0.81 

1.10 

0.94 

1.10 

1.09 

1.09 

0.88 



aug-cc-pVTZ 

1.23 

0.99 

1.22 

1.06 

1.22 

1.21 

1.21 

1.09 

1.271 ± 0.010 


F”, OH", and NH 2 were challenging tests for 
perturbative quasi-particle methods. Table I sum¬ 
marizes these calculations and shows that third- 
order results (see the column labeled 3) may be 
worse than the predictions of Koopmans's theo¬ 
rem (see the KT column). Such behavior is rare for 
ionization energies of closed-shell molecules. 
Second-order results are relatively unreliable and 
may even produce the wrong sign. It is also worth 
noting that third-order pole strengths often exceed 
unity for OH - and NH 2 . (The analytic behavior of 
the self-energy matrix elements does not assure 
the correct sign of the derivatives with respect to 
E.) OVGF scaling procedures reduce the errors, but 
discrepancies remain large. The A version of OVGF 
behaves best for F”, OH”, and NH^ but OVGF-R 
(see the column labeled R) generally favors the B 
version. As basis sets are improved, electron de¬ 
tachment energy results generally increase. The 
overestimation of these quantities by third-order 
and OVGF methods implies that discrepancies will 
grow in calculations with basis sets of higher qual¬ 
ity. It also is possible for fortuitous cancellations of 
basis-set and correlation errors to produce accurate 
predictions. 

The situation is markedly different for Cl”, SH”, 
and PH 2 . Here, agreement with experiment is 
good for third-order and OVGF methods, even 
though an overestimate of the electron detachment 
energies occurs for certains of pairs of methods 


and basis sets. For Cl” and SH”, better basis sets 
will produce poorer results with third-order and 
the B and C versions of OVGF. A reexamination of 
the OVGF selection procedure and its parameters 
may be advisable for anions. 

P3 results for F”, OH”, and NH 2 indicate that 
substantial errors have been overcome with this 
approximation, but that more terms must be added 
to obtain close agreement with experiment. Fortu¬ 
itous agreement with experiment is clearly a dan¬ 
ger with the 6-31 + + G(2df,2pd) basis, especially 
for NH^. Basis sets larger than aug-cc-pVTZ are 
likely to produce greater errors. This method must 
be used with caution when orbital relaxation is 
important in the final, neutral state. 

P3 behaves better with respect to basis-set satu¬ 
ration for Cl”, SH”, and PH 2 . Use of aug-cc-pVQZ 
basis sets will probably produce very close agree¬ 
ment with experiment, for the aug-cc-pVTZ results 
are still below the experimental values. In these 
three cases, relaxation and correlation corrections 
to Koopmans theorem results are relatively small 
and perturbative arguments can be expected to be 
more successful. 


Conclusions 

The P3 quasi-particle approximation is more 
computationally efficient than are the third-order 
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and OVGF methods. The bottleneck contraction of 
P3 requires 0 2 V 3 operations, whereas the most 
difficult contraction of third order (and therefore 
OVGF) scales as OV 4 . The latter step must be 
repeated for each energy iteration, but the 0 2 V 3 
step present in P3 must be performed only once 
per final state. No electron repulsion integrals with 
four virtual indices are needed in a P3 calculation. 

Results on the electron detachment energies of 
anions suggest that P3 is a more reliable method 
than are the third-order and OVGF approxima¬ 
tions in the limit of a complete basis set. For F", 
OH“, and NH^, large discrepancies with experi¬ 
ment remain for all of the methods, although the 
errors are not as large for P3. For Cl", SH“, and 
PH 2 , where relaxation and correlation effects are 
not as large, P3 results produce close agreement 
with experiment and are likely to be superior to 
third order and OVGF for saturated basis sets. 
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ABSTRACT: We develop the path integral method for quantum chemistry, apply the 
Monte Carlo method to an evaluation of the path integral, and calculate the ensemble 
average of the energy. For finite temperature, a simple Monte Carlo evaluation of the 
path integral brings out the negative-sign problem. In this work, to avoid this problem in 
the numerical evaluation, we apply the new reweighting method to the Monte Carlo 
integration, and calculate effectively the ensemble average of the energy for finite 
temperature. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 471-476, 1997 


Introduction 

The path integral introduced by Feynman [1] 
has been applied to many problems in quantum 
mechanics. In the path integral, the physical quan¬ 
tity can be handled as a commuting variable like a 
classical one. Thus the physical processes can be 
understood more clearly than in ordinary quan¬ 
tum mechanics. When we apply the path integral 
method directly to the electron field described by 
the anticommuting field operator, we must deal 
with the corresponding anticommuting c-num- 
bers, the Grassmann numbers. However, the prop¬ 
erties of the Grassmann numbers are, in general, 
not so well known [2, 3], so that, in the field of 
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quantum chemistry, the path integral method has 
not been widely applied to the many-electron 
problems. 

Recently the path integral method has been ap¬ 
plied to the Huckel model [4] and the many-elec¬ 
tron problem [5]. We have also developed the path 
integral method without using the Grassmann 
numbers to estimate the electron correlation in 
molecules [6], In that work, we derived the new 
closure relation in terms of the Linear Combina¬ 
tion of Atomic Orbital (LCAO) coefficients of the 
molecular orbital, and have constructed a path 
integral form of some physical quantities using the 
LCAO closure relation. In the previous work [7], 
we presented the new closure relations that could 
be derived from an arbitrary complete set of state 
vectors, applied the closure relation to construct 
the partition function in the path integral form, 
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and evaluated the ensemble average of the energy. 
In these treatments, there appear no Grassmann 
numbers corresponding to the electron field opera¬ 
tor but the c-numbers corresponding to LCAO 
coefficients or state amplitudes. 

The essence of the path integral is a sum of 
histories, and the path integral is usually ex¬ 
pressed as a multidimensional integral with a huge 
number of variables. Therefore, we cannot directly 
carry out this integration by ordinary ways and 
will be forced to use statistical methods, such as 
the Monte Carlo method. Therefore, we have ap¬ 
plied the Monte Carlo method to evaluate the path 
integral [7]. 

The ordinary quantum Monte Carlo method is a 
very powerful tool in order to estimate thermal 
and quantal fluctuations of interacting quantum 
spin systems and strongly correlated electron sys¬ 
tems [8]. This method is based on the generalized 
Trotter formula, and the quantum fluctuation is 
expressed in the ordinary path integral form. In 
simulations of quantum systems, however, some 
of the local Boltzmann factors become negative, 
which leads to deteriorated statistics [9]. This diffi¬ 
culty is called the negative-sign problem. The neg¬ 
ative-sign problem is particularly serious in low- 
temperature regions of frustrated quantum spin 
systems and fermion systems [10, 11]. 

To reduce the difficulties of the negative-sign 
problem, the reweighting method has been pro¬ 
posed [12]. In this method, the Monte Carlo sam¬ 
pling is performed under the virtual Boltzmann 
weight instead of the original one, which is posi¬ 
tive definite and describes well the original sys¬ 
tem, and a value of a physical quantity is 
reweighted by the correction factor, the ratio of the 
original and virtual Boltzmann factor. 

Conventionally, one has been simulating the 
system by the absolute value of the Boltzmann 
factor of the original system as the virtual Boltz¬ 
mann factor. However, in the quantum Monte 
Carlo calculation, it has been difficult to get a 
quick convergence in numerical integrations using 
the absolute value weight function. 

In this work, we apply the new reweighting 
method to the Monte Carlo integration, and calcu¬ 
late effectively the ensemble average of the energy 
for finite temperature. In the second section, we 
briefly summarize the path integral formulation 
for the ensemble average of the energy. In the 
third section, the negative-sign problem in the 


Monte Carlo method is discussed and a new 
reweighting function, the midpoint procedure, is 
proposed to avoid this problem in the numerical 
evaluation. Finally, we apply this method to the 
hydrogen molecule, evaluate the ensemble average 
of the energy, and compare a quickness of conver¬ 
gence in numerical integrations between the con¬ 
ventional weight function and ours. 


Path Integral Formulation 

In this section, we briefly summarize our clo¬ 
sure relations and the path integral formulation of 
the ensemble average of energy [7], 

Now, we assume the orthonormal set, {| k)}, 
spans the Hilbert space as 

M 

1 = El k)(kl (1) 

k= 1 

<klk') = 8 kk ,, (2) 

where M is a dimension of the vector space {|fc)}. 
The state |fc) could be either the single-particle 
state or the many-electron configurational one. 
Then we could construct a generalized state vector 
by the linear combination of the above state vec¬ 
tors, as 


|C> = 


M 

E Qlfc>. 


k= 1 


( 3 ) 


here, the coefficient C k is an arbitrary complex 
number. If we choose the atomic orbital as \k), 
then | C) corresponds to the molecular orbital, and 
if | k) is the many-electron configuration, it be¬ 
comes the configuration interaction (Cl) state. 

Here, let us assume that C k are variables, and 
change these parameters arbitrarily. Then resulting 
|C) is transformed into an arbitrary state vector. It 
is important to notice that the space spanned by 
the state vector |C), generated by changing C k 
arbitrarily, could include every state in the Hilbert 
space, namely the set {|C>} is an overcomplete set. 
Thus, we could construct the closure relations. 


-/n 

J k= 1 


dC* dC k 


2 rri 


exp 


\ 


M 

- E IQ I 

1 


ICXCI, (4) 
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and 


= mi[ n 

J k— 1 


dCt dC k 


2ui 


j M 

E IQI 2 -1 

\k= 1 


|C)<C|. (5) 


The delta function ensures the normalization con¬ 
dition of the state vectors. Thus, in this case, each 
variable varies only on the surface of the unit 
sphere in the complex space. 

Next, we restrict C k in Eq. (3) to be real. Then, 
we could get the real number closure relations. 


, ^ dC k 

= 2 n^exp 

J k= 1 V 7T 


M 


E CjMlCXCl, (6) 

k -1 / 


and 


1 = IT 


IM\ ,2* 

T/n 


M dC k < M 
8 


4tt 


EQ 2 

u=i 


|C)<C|. (7) 


Consider a partition function, Z = Tr 
where Tr implies a trace operation, and fi = 
(/: b T) - 1 with the Boltzmann constant fc B and a 
temperature T. Let us divide the temperature in¬ 
terval into L infinitesimal slices, i.e., ft = Lr, 
and inserting the closure relation among segments, 
then Z becomes 


1 r 

Z = lim — / d/x,[C*,C]Z[C*,C], (8) 

l _, —» oo J 


where 


A ** dC*(l) dC k (l) 

d/x[c* ,c] = nn —— 

/= 1 k= 1 2tti 

L 

X n W[C*(Z),C(/)], ( 9 ) 

Z = 1 


ZtC*,C] = n<c(/)k-^ /L lc(/ - 1)>, (10) 
1=1 


and 


E =xrk (li) 

W[C*(/), C(0] is the Gaussian weight function or 
the delta function corresponding to Eq. (4) or (5), 
respectively, and is the normalization factor 
appearing in the denominator of the closure rela¬ 
tion. The trace operation in the partition function 
imposes a periodic boundary condition upon |C(/)) 
as 


|C(0)> = |C(L)>. (12) 


The ensemble average of the Hamiltonian can 
be calculated as 


<H> = 


Tr (e~P H H) 
Tr e~$ H 


— lim 

L-> 00 


/d/t[C*,C]Z[C*,C]- 
w i; (C(l)\H\C(l — 1)> 

X ; ^ <c(/)icq-i)> 

J d/r[C*,C]Z[C*,C] 


Note that, in the above expression, the normaliza¬ 
tion factor E has disappeared. It is impossible to 
evaluate Eq. (13) by direct numerical integration 
because of its huge amount of integration variables 
generated by slicing ft and the presence of the 
two-body interactions. Therefore, we should ap¬ 
proximate the above integral or should adopt other 
integration methods such as the Monte Carlo 
method. 


Negative-Sign Problem and 
Reweighting Method 


An estimation of Eq. (13) requires the numerical 
calculation of two multidimensional integrals over 
M X L variables with large enough L, but we 
cannot directly carry out this integration by ordi¬ 
nary ways. Therefore we apply the Monte Carlo 
method to do it. 

If we carry out the Monte Carlo integration, we 
would encounter the negative-sign problem. The 
negative-sign problem has its origin in the fact that 
some of the off-diagonal elements of the local 
Boltzmann factors, (C(l)\e" ISH/L \C(l — 1)) become 
negative. 

If the factor becomes negative, we cannot carry 
out the Monte Carlo integration. To tackle the 
problem, we apply the reweighting method to 
Monte Carlo integration [12]. In this method, a 
virtual Boltmann factor, B(/, / - 1) is introduced, 
which is a positive definite weight function. 


<co)k^ H/L ica 


= B(u - 1) 


-1)> 

<C(Z)|e-^ H/t |C(/ - 1)> 

BO, l- 1) 


. (14) 
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We regard the correction factor, which is the sec¬ 
ond factor in the above, as a part of the integrand 
and perform the importance sampling through the 
stochastic process defined by the virtual Boltz¬ 
mann weight. 

When the virtual system, which is described by 
the virtual Boltzmann weight, serves as a good 
approximation of the original system, we could 
obtain correct values for a smaller number of the 
Monte Carlo steps. One condition of a useful vir¬ 
tual system is that the ground-state energy of the 
virtual system is close to that of the original sys¬ 
tem. From this point of view, various virtual Boltz¬ 
mann factors and effective Hamiltonians were pro¬ 
posed as candidates for the better virtual system 
[12]. In the conventional approach [9], the absolute 
value of the Boltzmann factor is used as a weight 
function, i.e., 

B(l,l - 1) = |<C(/)|e”^ H/L |C(Z - 1))|. (15) 


However, in the quantum Monte Carlo calculation, 
it is difficult to get a quick convergence in numeri¬ 
cal integrations using the absolute value weight 
function. 

For the path integral formulation, the local 
Boltzmann factor is approximated as 


(C(/)|e“^ H/L |C(Z - 1» 

« <C(Z)|CO - 1)> 

P (C(l)\H\C(l - 1)> 


X exp 


L <C(/)|C(/-1)> 


(16) 


Therefore, the origin of the problem is reduced to 
the overlap (C(Z)|C(J — D). In our Monte Carlo 
algorithm [7], because we randomly generate the 
state |C(Z)> from \C(l — 1)), it is possible that the 
overlap becomes negative. Considering this fact, 
we propose a new weight function as 

Bill- 1) = <C(Z)k^ H / (2L) |f> 

X (t\e-P H/( ~ 2L) \C(l - 1)>, (17) 


where 



10 



state, |f> between |C(/ - 1)) and |C(/)>. 

Here we introduced the midpoint state, |0 be¬ 
tween |C(Z - 1)) and |C(J)>. (See Fig. 1.) In this 
case, B(Z, l - 1) is approximated as 

1 ) 

- — 1 )) 


p 

(< C(l)\H\t > 

(t\H\C(l - 1 » ) 

1 


2 L 

< cum 

<fica-i» J 



(19) 

By using Eq. (18), the preexponential factor be¬ 
comes 

<c(Olf><f|c(J- i)> 

(1 + <C(Z)|C(Z - 1)»(1 + <Cil - 1)1 C(Z)» 

“ 2 + <C(Z)[C(Z - 1)> + (Cil - 1)|C(Z)> 

( 20 ) 

Then the weight function Bill — 1) is always pos¬ 
itive and the negative-sign problem is absent. 

Our new weight function, the midpoint proce¬ 
dure, is interpreted as follows. Under the original 
local Boltzmann factor, the system freely evolves 
from |C(Z - 1)) to |C(Z)>, as in Figure 2(a), and we 
must consider every path between them. On the 



|A __ ]C(Z - 1)) + 1C(Z)> _ 

t ” yjl + (Cil)\Cil - 1)> + (Cil - 1)| Cil)) ' 

(18) 


FIGURE 2. Paths between | C(l - 1)) and |C(/)>. (a) 
Paths under the original Boltzmann factor. We must 
consider every path between them, (b) Paths under the 
midpoint procedure. Every path must passes |f > at the 
middle of the time interval /3/L. 
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other hand, under the midpoint procedure, the 
system evolves from | C(l — 1)) to |C(Z)) but be¬ 
comes 10 at the middle of the time interval, /3/L, 
as in Figure 2(b). For large enough L or small 
enough time interval, we could expect that the 
system behaved as if it were a classical one and 
reached the midpoint at the middle of the time 
interval. Thus, our midpoint procedure is a good 
approximation of the original Boltzmann factor 
and does not bring the negative-sign problem. 

We will show the effectiveness of the midpoint 
procedure by evaluating the ensemble average of 
the energy in the next section. 


Numerical Examples: 

Hydrogen Molecule 

To demonstrate that our new reweighting func¬ 
tion works, we calculate here the average energy 
(H) of the hydrogen molecule. 

The calculation is carried out with the mini¬ 
mum STO (£ = 1.0) and 4-31G basis sets at the 
internuclear distance of 1.4 a.u. The Monte Carlo 
integrations were done changing the virtual Boltz¬ 
mann factor, the absolute value weight, and the 
midpoint procedure. The number of samplings is 
100,000 and the temperature is 10 K. Calculations 
are executed on the S-4/1000 (FUJITSU Co. Ltd.). 
The convergence of (H) are presented in Figures 
3(a) and 3(b) corresponding to the minimum STO 
basis set and the 4-31G one, respectively. In the 
figures, the solid curves mean the energies calcu¬ 
lated by the midpoint procedure, and the dashed 
ones that by the absolute value weight. We omit¬ 
ted error bars in the figures, because the width of 
the error bar depends upon the sampling period in 
the Monte Carlo integration. Note that the full Cl 
energies are —1.821140 and —1.867025 a.u. for 
those basis sets, respectively. For both of calcula¬ 
tions, the calculated energies monotonously ap¬ 
proach to the full Cl energies, but the calculation 
by the midpoint procedure converges faster than 
that by the absolute value weight. 


Concluding Remarks 

We have introduced the midpoint procedure in 
the reweighting method. We could imagine that in 



Monte Carlo Steps 
a) 



Monte Carlo Steps 
b) 

FIGURE 3. Convergence of the ensemble average of 
energy of the hydrogen molecule. The solid curve means 
the energy calculated by the midpoint procedure and the 
dashed one that by the absolute value weight, (a) 
Calculations with the minimum STO basis set. (b) 
Calculations with the 4-31G basis set. 


short enough time the system behaved as if it were 
a classical one and reached the midpoint at the 
middle of the time interval. Therefore, the physical 
meaning of the procedure is clear and reasonable. 
We have calculated the ensemble average of the 
energy and have shown that calculations with the 
midpoint procedure converged faster than that 
with the conventional absolute value weight. 

The reweighting method is a useful tool to eval¬ 
uate integrals by the Monte Carlo method, espe¬ 
cially those with the negative-sign problem. The 
reweighting method has a freedom of the choice of 
the reweighting function, and it should be chosen 
by the physical insights. To choose the better 
reweighting function, it is necessary that the func¬ 
tion is hard to be negative and the virtual system 
specified by the function has close properties to 
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the original one. If the properties of the virtual 
system are very different from the original one, we 
could not expect a quick convergence of calcula¬ 
tions. The fast convergence of calculations and 
clearness of the physical meaning shows that our 
midpoint procedure is a good approximation to 
the original system. 

In the Monte Carlo method for the Fermi sys¬ 
tem, the negative-sign problem always occurs 
whatever closure relations one uses; however, we 
can expect that the midpoint procedure is effective 
on the Monte Carlo integration with any closure 
relation because of the above arguments. 

Using the midpoint procedure, we can easily 
and quickly evaluate physical quantities by the 
path integral Monte Carlo method not only for the 
zero temperature but also for the finite tempera¬ 
ture. In this work, we have used the <5 function 
closure relation, Eq. (4), and have calculated the 
energy with the midpoint procedure only for the 
hydrogen molecule. In the future, we will calculate 
other quantities as well as the energy at the finite 
temperature for many species with various closure 
relations. 
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ABSTRACT: A multigrid method for numerically solving electrostatics and quantum 
chemical problems in real space is discussed. Multigrid techniques are used to solve both 
the linear Poisson equation and the nonlinear Kohn-Sham and Poisson-Boltzmann 
equations. The electrostatic potential, Laplacian, charge densities (electrons and nuclei), 
Kohn-Sham DFT orbitals, and the self-consistent field potential are all represented 
discretely on the Cartesian grid. High-order finite differences are utilized to obtain 
physically reasonable results on modestly sized grids. The method is summarized and 
numerical results for all-electron atomic and molecular structure are presented. The 
strengths and weaknesses of the method are discussed with suggested directions for 
future developments, including a new high-order conservative differencing scheme for 
accurate composite grid computations which preserves the linear scaling property of the 
multigrid method. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 477-486, 1997 


Introduction 


umerical grid-based approaches offer an al¬ 
ternative to traditional basis-set methods for 
electronic structure. Several groups have recently 
begun to develop real-space grid techniques, 
including Becke [1], Bernholc et al. [2—4] 
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Chelikowsky et al. [5], Gygi and Galli [6], Kaxiras 
et al. [7], and Beck et al. [8-11]. In addition to 
these methods, there have been other develop¬ 
ments which point to the possibility of linear scal¬ 
ing electronic structure. The various linear scaling 
ideas (besides multigrid) are not discussed here 
but the reader is referred to the growing literature 
[12-23], 

The distinct advantages of the grid methods are 
(1) the convergence of the "basis set free" method 
is controlled only by the grid spacing and the 
order of the difference equations (or accuracy of 
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the finite element approximation), (2) the methods 
are flexible in that solution effort and accuracy can 
be focused in specific regions of space (all iteration 
steps are local), (3) finite or periodic systems are 
treated with equal difficulty; there is no need for 
supercells as with plane waves, and (4) linear 
scaling is possible with multigrid acceleration or 
related techniques. The principal disadvantage is 
that large grids may be required to obtain the 
desired accuracy so the methods can be memory¬ 
intensive. However, recent pseudopotential calcu¬ 
lations [3-5] have shown that storage require¬ 
ments are comparable to the corresponding 
plane-wave methods. In fact, it is somewhat sur¬ 
prising that results of decent accuracy can be ob¬ 
tained at all when the grids (if Cartesian) do not 
respect any physical atomic or molecular symme¬ 
try. It will be shown below that reasonable accu¬ 
racy can indeed be obtained in all-electron calcula¬ 
tions even on uniform grids where the core states 
are relatively poorly represented. Grid refinement 
techniques will allow improvement in those re¬ 
gions without a substantial increase in computa¬ 
tional cost while still maintaining the favorable 
linear scaling property of the multigrid. 

In previous work, we first developed a multi¬ 
grid method for solving the three-dimensional 
Poisson equation [10]. The charges were repre¬ 
sented directly on a Cartesian grid. The technique 
was tested on several finite and periodic problems. 
Using high-order finite difference representations, 
it yielded accurate results for analytic atomic mod¬ 
els and for periodic lattice problems which tested 
the method's ability to handle long-ranged interac¬ 
tions. Systems as large as a 4096 atom ionic crystal 
lattice were examined, and the known linear scal¬ 
ing of the multigrid method was observed [11]. It 
is interesting to note that for the infinite lattice 
problem the (local) iterations in real space are 
performed within the calculation box with neigh¬ 
boring wraparound interactions only as far as the 
finite difference Laplacian extends in space. The 
periodicity is enforced simply by making the po¬ 
tential equal on the opposite boundaries. The re¬ 
sulting solution gives the same physical result as 
the higher-scaling Ewald method. 

The Poisson solver was then used in the multi¬ 
grid solution of the Kohn-Sham equation [8, 9]. 
The entire problem was represented discretely on 
the grid, including the nuclear charge density. In 
this way, the full electrostatic problem for all elec¬ 
trons and nuclei is solved in a single linear scaling 


step. The time spent in solving the Coulomb prob¬ 
lem composes only a small portion of the overall 
computational cost. The evolving self-consistent 
effective potential was generated by a combination 
of the electrostatic potential and the VWN ex¬ 
change correlation term [24]. The accuracy of the 
method was tested in several model atomic and 
molecular systems. While accurate results were 
obtained for small Z atoms, the core states of 
atoms such as Ne are not accurately represented 
on a uniform grid large enough to handle the 2 s 
and 2 p orbitals. 

The purpose of the present article is to present 
further numerical tests of the real-space multigrid 
methods. I first give a brief review of our multi¬ 
grid method and then summarize the equations 
which are solved numerically, namely, the Pois¬ 
son, Kohn-Sham, and Possion-Boltzmann equa¬ 
tions. Then results are presented for atomic struc¬ 
ture and it is shown that the full approximation 
scheme (FAS) multigrid method of Brandt et al. 
[25] works well for small Z atoms on uniform 
grids, but fails for larger Z due to poor representa¬ 
tion of the Is core and the 2s state. Nonetheless, 
converged numerical results can be obtained by a 
simple nested iteration procedure. Results are also 
presented for the CO molecule which provides a 
good test since the dipole moment is a difficult 
quantity to obtain accurately. New methods for 
solving the nonlinear Poisson-Boltzmann equation 
in ab initio colloid simulations (colloids treated 
explicitly, ions as a field) and for carrying out 
high-order conservative electrostatic computations 
are also described. 

Our work is related to recent work of the groups 
of Chelikowsky et al. [5], Bernholc et al. [4], and 
Gygi and Galli [6] However, each of these methods 
has employed pseudopotentials for the core states, 
while in our work we have treated the nucleus and 
core electrons also explicitly on the numerical grid. 
Part of the advantage of the real-space grids is that 
varying resolution can be employed in different 
parts of space without huge computational over¬ 
heads, and we seek to exploit this feature in the 
all-electron calculations. Gygi and Galli [6] utilized 
curvilinear coordinate calculations for variable res¬ 
olution, but care must be taken to ensure that the 
correct order of approximation is obtained during 
the multigrid coarse grid corrections or else the 
accuracy of the solution can be sacrificed [26]. Our 
high-order conservative equations method main¬ 
tains the correct order over the whole domain [27]. 
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Multigrid Description 

The multigrid technique was developed in the 
1970s to overcome the critical slowing down (CSD) 
problem which occurs in real-space relaxation 
methods for solving partial differential equations 
[28]. In short, multigrid combines iterations on the 
finest scale with correction cycles which pass to a 
sequence of coarser scales to smooth modes of the 
error on a large range of length scales. In this way, 
the solution is obtained to within the errors caused 
by placing the problem on the grid in roughly 
10-20 fine-scale iterations. The methods were first 
developed to solve linear elliptic problems like the 
Poisson equation. Subsequently, more general 
multigrid algorithms were developed to handle 
nonlinear problems including eigenvalue problems 
[25]. Here, I describe our multigrid method; an¬ 
other detailed discussion was given in a recent 
article [8]. 

The technique employed here is the FAS nonlin¬ 
ear multigrid method. The method is equivalent to 
the standard full multigrid method for linear 
problems, but can treat nonlinear problems such 
as the Kohn-Sham equations and the Poisson- 
Boltzmann equation as well. Consider the Poisson 
equation represented in real-space matrix notation: 

LU —/, (1) 

where L is the finite difference Laplacian, U is the 
potential (u denotes the current approximation); 
and / is —47rp (where p is the charge density). 
This matrix equation can be solved by one of the 
standard iterative relaxation techniques such as 
the Jacobi method or successive overrelaxation 
(SOR, used here for the multigrid smoothing steps 
on each scale). However, for a given grid spacing, 
the solution stalls after initial rapid reduction of 
the errors. 

This is the point at which the multigrid coarse 
grid correction cycle is performed to decimate er¬ 
rors at longer wavelengths. The problem is passed 
to the next coarser grid (H = 2 h, where H and h 
are the coarse and fine grid spacings, respectively), 
and a modified equation is solved (consider a 
two-level problem here where level one is the 
coarse scale, and level two, the fine): 

L^U 1 = l\f 2 + r 1 . (2) 


The superscript labels indicate the coarser level; l\ 
is an operator which locally averages (or restricts) 
the charge density; and t 1 is the defect correction 
and is given by 

r 1 = L l l\u 2 - lll 2 u 2 . (3) 

This function causes the coarse grid problem to 
"optimally mimic" the fine grid problem; put an¬ 
other way, if the exact solution were passed from 
the fine grid, no correction would occur. 

Iterations are performed on Eq. (2), and then the 
fine grid solution is corrected via 

u 2 <- u 2 + Iiiu 1 — l\u 2 ). (4) 

Here, 1 2 is the interpolation operator which takes 
the correction back to the fine scale, level 2. The 
FAS approach can be used for nonlinear problems 
since the iterations are performed on the function 
itself on all scales, and thus no linearity of the 
equations is assumed. A clear and detailed de¬ 
scription of the FAS method is given in the eigen¬ 
value paper of Brandt et al. [25]. 

We used a high-order finite difference represen¬ 
tation for the Laplacian [8, 29]. Along one of the 
three orthogonal axes, the eighth-order Laplacian 
is given by 

0 1 

V 2 ti(0 = 9 u(i - 4) + 128 u(i - 3) 

5040 h 2 

-1008 lid - 2) + 8064«(z - 1) 

-14350 u(i) + 8064(z + 1) 

-100 Su(i + 2) + 128 u(i + 3) 

— 9 u(i + 4)] + 0(h 8 ). (5) 

The three-dimensional version is just the sum of 
the three orthogonal components. Hence, there are 
3x9 — 2 = 25 terms in the Laplacian, as opposed 
to 7 for the second-order case. There is thus a 
three- to fourfold increase in the time of each 
iteration; however, a significant gain in accuracy 
results which allows a much larger grid spacing h, 
and this benefit far outweighs the increase in time 
of each iteration. The present calculations on a 
uniform grid would not be feasible with just the 
second-order form. The charge density is repre¬ 
sented on the grid, with the nuclei occupying a 
single grid point with a charge density of Z/h 3 . 
For the Poisson-Boltzmann equation, the charge 
density is replaced by the nonlinear exponential 
terms (below). The orbitals in the Kohn-Sham 
equations are represented similarly as a 3 d array 
on the grid. 
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In solving the Kohn-Sham self-consistent eigen¬ 
value equations, we closely followed the FAS pre¬ 
scription given by Brandt et aL [25] with the addi¬ 
tion of the self-consistent loop as the effective 
potential changes during the solution process. The 
same FAS equations apply, except the Laplacian L 
is replaced by the Hamiltonian in coordinate space 
and eigenvalue and orthogonalization steps must 
be added. In the Brandt method, they proposed 
that these steps need only be performed on the 
coarse scales since they are global operations. This 
dramatically reduces the computational cost since 
a Gram-Schmidt process scales as Cj 2 N gri(J/ where 
cj is the number of orbitals and N grid is the total 
number of grid points. Therefore, this step scales 
as the third power of the number of electrons. By 
carrying out the orthogonalization on the next 
coarser scale, the overall time is reduced by a 
factor of 8. Also, it has recently been shown that 
accurate results can be obtained with localized 
orbitals [16], so the orthogolization can, in princi¬ 
ple, be reduced effectively to order N. 

We compute the eigenvalues on the next coarser 
scale to the finest scale and perform the orthogo¬ 
nalization there also. The eigenvalues from the 
coarse scale are then used in the propagation of 
the orbitals on the fine scale. This method requires 
that an accurate representation of the orbitals is 
possible on the coarser scale to the point of yield¬ 
ing eigenvalues which correctly mimic those on 
the finer scale. This condition is not satisfied for 
the higher Z atoms, especially for the 2 s state 
which has a cusp and an oscillation not too distant 
from the nucleus. The method works well for mul¬ 
tiple orbitals of the hydrogen atom and the ground 
state of the helium atom on a uniform grid, but 
not for the C, O, and Ne atoms examined here. 
However, a simple nested procedure where the 
solution just passes successively from coarse to 
fine scales (without coarse grid corrections) con¬ 
verges and we have used this method to obtain the 
numerical results for those cases. It is clear that the 
grid refinements will allow a faithful representa¬ 
tion in the more strongly varying regions. These 
difficulties are not encountered in the electrostatics 
computations. At the end of the self-consistency 
steps, we perform one final Gram-Schmidt pro¬ 
cess on the fine scale followed by a Ritz projection 
which generates the optimal occupied subspace 
(i.e., all errors are orthogonal to the occupied sub¬ 
space). 


Theory 

For the quantum chemical calculations, the 
Kohn-Sham equations of the density functional 
theory [30] were solved using the VWN [24] 
parametrization of the exchange correlation poten¬ 
tial. All calculations were carried out at the spin- 
restricted level. The Kohn-Sham equations are as 
follows: 

[-^V 2 + v e/f \ *A/(r) “ e;^/(r), (6) 

with 

r p(r') 

v eff (i) = v ion (i) + f --- dr' + v xc ( p( r)). (7) 

jj j | j. _ r | 

The electron density is obtained from the orbitals: 

N/2 

p(r) = 2 ^ ip 2 . (8) 

i= l 

The evolving electrostatic potential is obtained by 
multigrid solution of the Poisson equation: 

V 2 <£( r) = -4rrp tot (r). (9) 

As mentioned above, the nucleus is represented 
discretely on the grid and the Poisson equation is 
thus solved simultaneously for the potential due 
to the nuclei and electrons in a single linear scaling 
step. This yields the first two terms on the rhs of 
Eq. (7). Note that a self-energy for the nuclear 
distribution must be subtracted in computing total 
energies. This is a one-time calculation on a finite 
grid for a given order of the Laplacian, since the 
self-energy scales as Z 2 /h. 

In related work, we have begun to examine 
efficient numerical techniques for simulating col¬ 
loid systems at the ab initio level where the point 
ions of the solution are represented as a field and 
the charged colloids are discrete charges on the 
lattice [31]. These methods stem from the lattice 
field theory derived by Coalson and Duncan [32, 
33]. This is an exact statistical mechanical theory 
of the ion gas in the field of the colloids, and 
the Poisson-Boltzmann equation emerges as the 
saddle point (mean field) in their functional inte¬ 
gral representation. Corrections to the Poisson- 
Boltzmann equation can be made but are very 
costly computationally. The Poisson-Boltzmann 
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equation is known to be quite accurate for mono¬ 
valent ions at low to moderate concentrations in 
solution. Typically, with colloid particles of charge, 
say 600 c, something on the order of 10 5 ions would 
be required, which is far beyond the capabilities of 
current simulation methods for charged systems. 
Thus, the Poisson-Boltzmann equation makes a 
bridge from a continuum description of the ions to 
a discrete representation of the colloids: 


V- (e(?)W>(r)) 



p s (r) + + 


-qn_e (i ^ r) ~ v{r) ]. (10) 


In this nonlinear equation, e(r) is the (perhaps) 
spatially varying dielectric constant, p s (r ) is the 
colloid source charge, n + is the equilibrium charge 
density at infinity for the positive ions (similarly 
for the negative ions), and v(?) is an excluded 
volume potential preventing the ions from pene¬ 
trating into the colloid. In our preliminary calcula¬ 
tions, we have held e(r) constant in space. The 
Helmholtz free energy of the ion gas (potential of 
mean force for the colloids) can be obtained from 
the potential 4>(r) [32]. The FAS multigrid tech¬ 
nique was used to solve this equation for a pair of 
colloid particles over a range of separations to 
generate the interaction potential in aqueous solu¬ 
tion. The same FAS method as used for solving the 
Poisson equation applies as long as the charge 
density is replaced by the full nonlinear expression 
on the rhs of Eq. (10). 


Numerical Results 

The hydrogen atom presents a prototype prob¬ 
lem with a fixed potential to examine the perfor¬ 
mance of the eigenvalue solver by itself. The elec¬ 
trostatic potential was generated numerically on 
the grid using the Poisson solver with the nuclear 
charge at the center of the calculation box. An 
eighth-order finite difference representation was 
used for the Laplacian, both for the Poisson and 
Kohn-Sham equations. All the computations dis¬ 
cussed here were carried out on a 33 3 grid except 
the Poisson-Boltzmann calculations (below) which 
were on a 65 3 grid. The first five eigenfunctions 
and eigenvalues were computed, corresponding to 
the Is, 2s, and 2 p orbitals. 

The numerical data are presented in Table I. 
The eigenvalues are most accurate for the 2 p states, 
which do not have a density at the nucleus. The 1 s 


TABLE I_ 

Multigrid data for hydrogen atom. 



H atom multigrid data 



Exact 

MG 

CG 

A(1s) 

-0.50000 

-0.51873 

-0.51874 

A(2s) 

-0.12500 

-0.12727 

-0.12889 

A(2p.1) 

-0.12500 

-0.12477 

-0.12472 

A(2p.2) 

-0.12500 

-0.12477 

-0.12472 

A(2p.3) 

-0.12500 

-0.12477 

-0.12473 

<r)(1s) 

1.5 

1.348 


< r){2s) 

6.0 

5.859 


<r >(2p.1) 

5.0 

5.006 


<r >(2p.2) 

5.0 

5.006 


(r)(2p.3) 

5.0 

5.013 



The fine grid spacing was h = 1.0 au. Ail data are given in 
au 


and 2 s eigenvalues are in error by 4 and 2%, 
respectively, while the 2 p eigenvalues are only 
0.02% in error. The three 2 p orbitals are degener¬ 
ate to 5 decimal places, which is some indication 
of the accuracy of the method. The difference in 
accuracy between ls/2s and 2 p states reflects the 
impact of the distributed nucleus and the grid 
representation on the orbitals with a cusp. Note 
that the coarse grid (CG) eigenvalues are very 
close to the final fine grid results, which shows 
that a relatively good representation is possible 
there. The first orbital moments are also given and 
these also show that the 1 s and 2 s states have the 
largest errors. Cuts through orbitals are shown in 
Figure 1; the errors in the Is and 2s states concen¬ 
trate near the nucleus. The FAS multigrid data 
were obtained with 20 fine-scale iterations. While 
the multigrid cycles converge to results very close 
to the fully converged grid results, the least con¬ 
verged orbital (as monitored by the residual) is the 
2 s state, which, again, provides some indication of 
the difficulty this orbital causes during the multi¬ 
grid corrections. 

The Kohn-Sham DFT self-consistency step was 
then introduced into the multigrid solver. Here, I 
present numerical results for the He and Ne atoms, 
since our previous results contained a minor error 
in the VWN potential. This only slightly changed 
the quantitative results for total energies but did 
alter the individual eigenvalues. The multigrid 
solver located the ground state of the He atom 
with 15 fine-scale iterations; the computed energy 
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r[au] 

FIGURE 1. One-dimensional cuts through the five hydrogen orbitals. The analytic curves are also presented for the Is 
and 2s states (dotted curves with larger magnitude at the nucleus). One of the p orbitals is nearly a node along this 
direction. 


was —2.832 au vs. the exact result of —2.835 au.* 
The first moment of the ground state orbital ((r)) 
was 0.926 au as compared to the exact Hartree- 
Fock value of 0.927 au [34]. 

The all-electron Ne atom provides a more se¬ 
vere challenge, and the multigrid solver did not 
converge on the uniform grid due to the difficul¬ 
ties discussed above. The coarse grid eigenvalues 
were substantially different from the converged 
fine grid results (especially the 2 s orbital), so the 
coarse grid corrections did not further reduce the 
errors. In this case, a simple nested procedure was 
used, passing successively from coarse to fine 
scales. In previous multigrid work [8], we com¬ 
puted eigenvalues and orthogonalized on all scales, 
and this technique converged (at more computa¬ 
tional expense). The computed total energy of the 
Ne atom was —128.417 au, which compares with 
the exact numerical result of —128.233 au. The 

*We obtained numerically exact VWN results from Dr. 
Svetlana Kotochigova in the atomic physics research group 
directed by Dr. Charles Clark at NIST. 


eigenvalues were Is (-29.253 vs. -30.306 exact), 
2s (—1.690 vs. —1.323 exact), and 2 p (—0.438 vs. 
-0.498 exact). The orbital moments were very close 
to our previous results. In agreement with the 
findings for the hydrogen atom, the 2 p orbitals are 
better represented on the grid than are the 1 s and 
2 s states. 

An important question for molecular systems is 
to what extent errors near the nucleus might affect 
electron densities for valence electrons and, thus, 
chemical binding. The CO molecule was chosen 
since both the atoms have an even number of 
electrons, and the molecule has a relatively simple 
"closed-shell" electronic structure (as compared 
say to C 2 ). The results were obtained by nested 
iterations as discussed above (Table II). The sepa¬ 
rated atom energies obtained in the spin-restricted 
approximation were adjusted according to the re¬ 
cent study of Baerends et al. [35] to obtain the 
correct spin-polarized separated atom limit. Spin 
restriction should have only a very minor effect on 
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TABLE II_ 

Data for the CO molecule calculation. 



CO molecular data 

Exact 

Grid 

A(1s(0 core)) 

—18.745(X a) 

-16.619 

A(1s(C core)) 

-9.912(X a) 

-9.376 

A(1cr) 

—1.045(Xa) 

-1.194 

A(2cr) 

—0.489(Xa) 

-0.562 

A(1ir X 2) 

-0.413(X a) 

-0.403 

A(3 a) 

—0.304(Xa) 

-0.319 

e b 

12.8 eV(LDA) 

13.3 eV 

E b 

11.2 eV(expt) 


E b 

7.9 eV(HF) 


E tot 

—112.79(HF) 

-109.54 

E t o, 

—113.377(expt) 


Dipole 

0.24 D(Xa)C _ 0 + 

0.266 DC“0 + 

Dipole 

0.112 D(expt) C“0 + 


Dipole 

0.274 D(HF) C + CT 



The fine grid spacing was h = 0.3560 au, which implies a grid cutoff of E cut = 39 Ry. The internuclear separation was 2.136 au. All 
energies are in au unless otherwise indicated. The individual eigenenergies are first given, followed by molecular binding energies 
and dipole moments in Debyes. HF refers to the Hartree-Fock limit. 


the CO molecule. In the molecular calculations, the 
boundary potential was obtained by a multipole 
expansion (to quadrupole order) of the charge 
density. 

The eigenvalues for comparison were obtained 
from the Xa calculations of te Velde and Baerends 
[36]. Thus, it is a close but not exact comparison. 
Generally, except for the core states, the agreement 
is reasonable. The binding energy is quite close to 
the exact LDA result [37] and is an improvement 
over the pseudopotential result (11.1 eV) of 
Chelikowsky et al. [5] The dipole moment is very 
close to the numerically exact Xa result [37], which 
should be quite close to the full LDA number. 
Pseudopotential calculations [5] gave a result (0.10 
D C~0 + ) which differs appreciably from the exact 
DFT-Xa number. The highest-lying a orbital is * 
shown in a contour plot in Figure 2. This compares 
closely to the corresponding Hartree-Fock orbital 
given by Huo [38]. 

The accuracy for the binding energy and dipole 
moment are surprising in view of the crudity of 
the treatment near the two atomic nuclei. The 
effective cutoff in the grid representation is only 39 
Ry, which is far smaller than would be required to 
obtain fully converged all-electron plane-wave re¬ 
sults. At any rate, the results are encouraging and 
show that viable numerical results for molecular 
binding are possible with the grid techniques. The 
next step is to improve the resolution in the core 


region via grid refinement techniques (below) to 
test how this affects the overall accuracy and 
multigrid efficiency of the method. 

Next, results are presented for the computation 
of interaction free energies between colloid parti¬ 
cles where the ion gas is modeled at the mean-field 
level. Here, a second-order Laplacian was em¬ 
ployed, and the Poisson-Boltzmann equation was 
solved with the FAS multigrid nonlinear tech¬ 
nique. Twenty iterations were used on the fine 
scale, and the free energy was converged to a 
small fraction of kT by this point. The interaction 
energy curve is presented in Figure 3. Consistent 
with previous studies, the pair interaction is purely 
repulsive in the solution. Recent experiments by 
Larson and Grier [39] suggested that potential 
minima may exist when the colloid particles are 
confined between plates or when in a crystal ge¬ 
ometry. We are beginning to examine the many- 
body colloid interactions to probe whether nonad¬ 
ditive effects occur for trimers and higher n- mers. 
In preliminary results, we see no indication 
of nonadditivity for the trimer [40]. While the 
Poisson-Boltzmann equation is inherently nonlin¬ 
ear and thus many-body, it neglects ion correla¬ 
tions, so the lack of nonadditivity in our calcula¬ 
tions may reflect this fact. 

Finally, I discuss a new high-order conservative 
equation scheme which should allow accurate 
computations on composite grids [27]. As dis- 
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FIGURE 2. Contour plot of the highest-lying a orbital of the CO molecule. The carbon atom is on the bottom. 



cussed above, these refinements will be crucial 
both for accuracy and multigrid convergence rea¬ 
sons, especially in the region of atomic nuclei in 
all-electron calculations. To motivate the need for 
the conservative equations, imagine a simple two- 
dimensional compositive grid with a patch in the 
center with grid spacing one-half the rest of the 
domain. When the multigrid correction cycle is 
carried out on the coarser scale, the defect correc¬ 
tion r is defined only over the region covered by 
the refinement patch. It is easy to show that the 
sum of r over the patch is not zero. Most of the 
terms actually do cancel in the interior, but some 
terms are left near the boundaries. By looking at 
Eq. (2), we can see this implies that the total source 
strength has been changed. Nonconservation of 
source strength can pollute the solution over the 
whole domain, especially in the case of the long- 
ranged Coulomb force. Bai and Brandt [26] 
developed a second-order scheme to balance the 
fluxes on the boundary and applied it to a two- 
dimensional problem. We extended their tech¬ 


nique to three dimensions (finite and periodic sys¬ 
tems) and have now generalized it for higher-order 
equations. The method also requires high-order 
interpolation near the patch boundary to ensure 
correct behavior over the whole domain. The re¬ 
sulting r boundary terms resemble higher-order 
flux operators, but do not correspond exactly. The 
net result is that the sum of r is zero to machine 
precision and high-order solutions are possible 
over the whole composite domain. Details of the 
method will be presented shortly [27]. Efforts are 
underway to incorporate the conservative equa¬ 
tions into the Poisson and Kohn-Sham solvers to 
improve the resolution near the nuclei. 

Summary 

A real-space multigrid method has been devel¬ 
oped for solving the linear Poisson and nonlinear 
Kohn-Sham and Poisson-Boltzmann equations. 
The advantages of the grid techniques include 
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Lattice spacings 

FIGURE 3. Interaction potential of mean force between two colloid particles as a function of lattice spacing. The 
computation was performed on a 65 3 grid. The lattice spacing was 1600 au and the charge on each colloid was 600e. 
The bulk ion concentration yielded a total number of 48,000 positive ions and 49,200 negative ions in the calculation 
box. The dielectric constant was taken to be that of bulk water at room temperature. The mean Debye length under 
these conditions is roughly 8000 au, or five lattice spacings. The colloids occupied one cubical lattice unit. The 
interaction is given in units of AT. 


adaptivity and linear scaling capabilities. For the 
Kohn-Sham case, tests have been carried out on 
atoms and small molecules. The results show 
promise that the grid techniques can generate rela¬ 
tively accurate all-electron results, even though the 
core region is treated crudely if the grid is uni¬ 
form. With the incorporation of composite grid 
techniques, greater accuracy can be located near 
the nuclei with only a small increase in computa¬ 
tional cost. Future work will focus on incorporat¬ 
ing our new high-order conservative equations 
scheme [27] into the quantum chemical grid 
method. Other areas of interest are the finite tem¬ 
perature DFT theory of Alavi et al. [41] and real- 
space density matrix methods [13, 18, 19]. In re¬ 
lated work, the Poisson-Boltzmann solver will be 
used to study colloid interactions in large systems 
and carry out Monte Carlo simulations of colloids 
and polyelectrolyte chains in solution. 
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ABSTRACT: A parabolic potential supports resonances. We examine how the 
resonance energies are transformed into Floquet quasi-energies when the particle is 
simultaneously subjected to a monochromatic electric field. © 1997 John Wiley & Sons, Inc. 
Int J Quant Chem 65: 487-497, 1997 
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resonances; complex scaling 


Introduction 

T he parabolic (or inverted harmonic) potential 
has been studied extensively for the same 
reason, which gives a special importance to the 
harmonic potential. The lowest energy levels in a 
well of arbitrary shape can often be calculated 
accurately from a local quadratic approximation of 
the potential around the equilibrium position. The 
diatomic molecule is the best example of this situ¬ 
ation [1]. For a polyatomic molecule, the normal 
mode approximation reduces complicated poten¬ 
tial energy surfaces to a sum of harmonic poten- 
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tials [2]. When a chemical reaction is considered, 
the potential energy surface in the region of the 
transition state is usually written as a sum of 
harmonic potentials plus a parabolic term ex¬ 
pressed with the reaction coordinate [3]. In a one¬ 
dimensional treatment of the reaction the trans¬ 
missivity through the parabolic barrier represents 
the cumulative reaction probability, which is the 
basic quantity to calculate the microcanonical and 
canonical reaction rate constants [4]. Bell [5] has 
made an extensive use of this approximation. An 
interesting application to heavy-atom tunneling as 
an explanation for the automerization of cyclobu¬ 
tadiene is given by Carpenter [6]. Use of a parabolic 
barrier has also been considered as a useful test for 
the elaboration of the method based on autocorre- 
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lation functions of the reactive flux operator to 
calculate thermal rate constants [7]. This approach 
avoids the preliminary evaluation of state-to-state 
reaction probabilities. 

Because the scattering of a particle by a parabolic 
barrier is one of the few exactly soluble problems 
of nonrelativistic quantum mechanics [8-10], this 
is the starting point of the mapping method ad¬ 
vanced by Miller and Good [11] to derive a semi- 
classical scattering method valid close to the top of 
an arbitrary barrier, both below and above the 
barrier maximum. It has also been possible to 
define solutions of the wave equation with outgo¬ 
ing boundary conditions, which are the analog of 
the resonant states (that is corresponding to poles 
of the scattering matrix) [10, 12]. The idea of asso¬ 
ciating resonances to barriers has been further ex¬ 
tended and applied to the identification of reaction 
thresholds by Friedman and Truhlar [13, 14]. As an 
example it has been shown [14] that the lifetimes 
of the transition states of the reactive collision 
H + H 2 -> H 2 + H are well accounted for by the 
resonances of smallest width of the effective 
parabolic barriers associated with the potential 
maxima along the vibrationnally adiabatic poten¬ 
tial curves. 

New perspectives have been opened recently in 
a quite different field, which could make the 
parabolic potential to correspond to a realistic situ¬ 
ation. It has been shown by Sundaram et al. [15] 
that by grading the composition x in superlattices 
of type GaAs/A^Ga^. it is possible to grow a 
sample presenting a so-called parabolic quantum 
well (in fact a harmonic well) between two barri¬ 
ers. This is useful to produce a resonant tunneling 
transistor with equally spaced peaks in the charac¬ 
teristics [16, 17]. This grading technique can, in 
principle, be used to produce a potential of arbi¬ 
trary shape, among which the parabolic barrier. 

In this study we consider the scattering of a 
particle by a parabolic barrier in the presence of a 
monochromatic field. The effect of a field on a 
particle trapped in a double-well potential has 
been very actively studied in recent years. The 
most spectacular result is that under certain condi¬ 
tions tunneling through the barrier separating the 
wells can be completely suppressed [18-20]. Our 
problem is somewhat different since the barrier 
separates two regions where free (rather than 
bound) motion is possible. We are thus close to the 
situations met when a rearrangement takes place, 
for instance electron or atom exchange. We will 
show that the Floquet quasi-energies and wave 


functions of this situation can be exactly deter¬ 
mined. These resonances should play a role in the 
reaction in the presence of the field since methods 
exist to reconstruct the cumulative reaction proba¬ 
bility from Siegert eigenvalues and eigenfunctions 
[4, 21] even when the resonances do not corre¬ 
spond to the intuitive idea of metastable states 
with long lifetimes. 

In the following section we recall that the reso¬ 
nance energies and eigenfunctions for a parabolic 
barrier can be derived in a very simple way with 
the help of the complex rotation method with a 
particular value given to the rotation angle. The 
third section is devoted to the derivation of the 
Floquet quasi-energies and eigenfunctions when 
the particle is interacting with a monochromatic 
field. The interaction is written in the length gauge. 
We show in the fourth section that the quasi-en¬ 
ergies can also be obtained from a limiting proce¬ 
dure applied to the eigenvalues of a time-indepen¬ 
dent Hamiltonian [22]. This procedure is valid for 
an arbitrary oscillator interacting with a one-mode 
field and yields analytical results in the present 
case. Finally the fifth section describes a numerical 
method applicable also to an arbitrary oscillator. 
This is based on a propagation plus matching 
technique [23] that yields directly the Fourier com¬ 
ponents of the Floquet wave function. Some exam¬ 
ples of multicomponent Floquet eigenfunctions for 
the parabolic barrier are presented. Complex rota- 
tion (or scaling) of the coordinate is required to 
implement this method. We study the transition 
from complete scaling to incomplete (or exterior) 
scaling. This allows for an investigation of the 
shape of the components of the wave function as 
they would result from a treatment without scal¬ 
ing since it is possible to extend arbitrarily far the 
point of junction between real and complex inte¬ 
gration paths. 


Resonances of a Parabolic Barrier 

For a Hamiltonian written in reduced units: 

H = -2 - *>o* 2 / (!) 

dX 

the elements of the scattering matrix have poles 
given by the formula 

E v = 2v + 1). (2) 
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This result can also be obtained in a very simple 
way with a complex rotation of the coordinate [4, 
12]. Because this technique will be useful when a 
time-periodic term is added to the Hamiltonian, 
we repeat here this derivation. After substitution 
of pe l6 to x in the wave equation, one looks for 
solutions vanishing for p -» ± °° of the equation: 



0 2iO, ,2 2 

- e (o 0 p 


x(p) = Ex(p). 


(3) 


With the choice 6 = 7 t/4 the wave equation is 
transformed into: 


d 


dp- 


• 2 2 
- ico^p 


x( p) = E x( p)- 


(4) 


After multiplication by i this becomes: 

^ 2 2 

This shows that p) is an eigenfunction of the 
harmonic oscillator x v ( p ) = xj< xe ~‘ (77/4) ) and iE v 
the corresponding eigenenergy co 0 (2v + 1). Equa¬ 
tion (2) for E v is recovered. In Ref. [12] it has been 
shown in detail how to recover the harmonic oscil¬ 
lator wave functions by the substitution x -» 
xe -*o/4) into the parabolic cylinder functions 
which are the solutions of the unsealed wave 
equation of the parabolic barrier. 


x( p) = iEx( p)- (5) 


Floquet Eigenfunctions and 
Quasi-energies for a Particle 
Scattered by a Parabolic Barrier and 
Interacting with an External Field 


The Hamiltonian is now: 


Hj =- 2 ~ u>\x 2 + Ax cos (cot). ( 6 ) 

dx l 


where xj < x ) tihe free oscillator wave function of 
quantum number v and x c (t) is a solution of the 
classical equation of motion: 

~~(t) + 2 o)lx c (t) + Acos(ad) = 0 . ( 8 ) 

L(t) is the Lagrangian: 

x 2 (t) 

L(t) = — - - Ax c (t)cos(wt). (9) 

In the spirit of the method giving the reso¬ 
nances of the field-free case, we scale x - x c (t) in 
the factor, which gives the spatial behavior of the 
wave function. This variable becomes u(t) = 77 x — 
r]X c (t). We also make the appropriate sign changes 
in the classical equation of motion and the La¬ 
grangian. The time-dependent wave function is 
written: 


T'Cw, 0 = £(u(f))exp(-/Ef) 


X exp 


f L(r)dr 


xit) 

H--— r\ 1 u(t) 


( 10 ) 


After introduction of this function in the time- 
dependent Schrodinger equation and some alge¬ 
bra, it is found that most terms compensate and 
that £(u) obeys the equation: 

d 2 £(u) 

E£(u) = -T} 2 - 2 - V (11) 

du 

Choosing 17 = e ~ l(7r/4) gives the time-indepen- 
dent equation of a harmonic oscillator already met 
in the field-free case [Eq. (4)]. This shows that £(w) 
is in fact xj^ x ” x c (t))e ~* (7r/4) ) and that E is 
— ico 0 (2v + 1). However to identify a Floquet 
quasi-energy, it is necessary to obtain a solution of 
the time-dependent Schrodinger equation of the 
form: 


We start from the general solution of the time- 
dependent Schrodinger equation given by Kerner 
[24] for a harmonic oscillator interacting with the 
same field [the Hamiltonian differs just by the sign 
given to the potential in Eq. ( 6 )]: 


^(x ,0 - XzX x ~ x c (t))exp(-i(2v + l)co 0 t) 


X exp 


L(r)dr 


+ 


x „(0 


(x - X r (t)) 


(7) 


^ F (x, t ) = exp[ -fE F f]<E F (x, t) (12) 

with <h F (x, 0 a periodic function of time. This 
defines the Floquet quasi-energy E F and the Flo¬ 
quet eigenfunction <h F (x, t). In order to achieve 
this goal we choose a periodic solution of the 
classical equation of motion for x c (f) and we sepa¬ 
rate out the nonperiodic time-dependent factor in 
the wave function (Breuer and Holthaus [25]). The 
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periodic solution of the classical equation is: 


xXt) 


2 A cos( cot) 
o ) 2 + 4wq 


(13) 


and the nonperiodic factor in the wave function is: 


exp[—ico Q (2v + l)£]exp — i 


Jo L(r)dr 


t . (14) 


Introduction of x c (t ) as given by Eq. (13) into 
the Lagrangian allows for the calculation of the 
integral in Eq. (14). The Floquet quasi-energies are 
finally: 


- -i(o 0 (2v + 1) + • 2 2 v • (15) 

2(a) + 4 o)q) 

We note the similarity of this result with that of 
the harmonic oscillator [22, 25]. The shift with 
respect to the field-free energy (purely imaginary 
in the present case) is independent of the quantum 
number and differs by a change in the sign of the 
contribution of the potential. 


Another Route to the Floquet 
Quasi-Energies of the Parabolic 
Barrier 

We show now that these quasi-energies can be 
obtained in an extremely simple way already used 
with success for the harmonic oscillator [22]. 

A Floquet solution of the time-dependent 
Schrodinger equation can be written: 

n = +oo 

V F (.x,t) = exp[-iE F t] £ U„(x)e inut . (16) 


tl = - co 


The U n (x)'s obey the coupled differential equa¬ 
tion 


The eigenfunctions of the y oscillator can be 
used to perform a diabatic expansion of an eigen¬ 
function of this Hamiltonian: 


©(*/ y) = Ew„(*)x„(y)- 


(19) 


The coupled equations for the W n (x) are found 
to be: 


- colx 2 VSfAx) 


~ f fi / n + 1 

+ Ax + i/-W„ +1 (x) 

\ (o V co 


= (E - nco)W(x). 


( 20 ) 


For very large values of n the couplings for the 
relevant W„(x)'s for a given A depend weakly on 
n. As shown by Shirley [26], in this limit the 
components of the wave function of the time-inde¬ 
pendent approach merge into the components of 
the Floquet eigenfunction. For this equivalence to 
hold use the correspondence rule 


— = lim 

2 A-> 0, n oo 


A {n 
yfd) 


( 21 ) 


The eigenfunctions and eigenenergies of H(x, y ) 
can be calculated exactly. The limit given by Eq. 
(21) is to be taken afterward. With two new vari¬ 
ables x r and y 1 written: 


x' = x cos(O) - y sin(0), 
y' = x sin(0) + y cos(0), 


( 22 ) 


the Hamiltonian becomes separate if d satisfies: 


- y +V(x) U„(x) + —[U n _ 1 (x) + U n+1 (x)] 

= (E f - nw)U„(x). (17) 

We consider also the time-independent Hamil¬ 
tonian: 


tan(20) = 


+ COi 


The new frequencies are given by: 


(23) 


H(x, y) = - 


W ° X ~Jy i + \2’J V +A ' ry ' 

(18) 


m \ 2 A 


fli = —«qCos 2 (0) + I — I sin 2 (0) - — sin(20), 

(24) 
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a 


= — WnSin 2 (0) 


the shift is: 


ca w A 


+ 1 — 1 cos 2 (0) + —sin(2 0). (25) 


A„ = 


2( (O ^ + 4 (On ) 


(33) 


The eigenenergies are the sums of the resonance 
energies of the new parabolic barrier and of the 
eigenenergies of the decoupled harmonic oscilla¬ 
tor: 



— + 1)0^ + 0?2 T ^)f^2' (26) 


in complete agreement with Eq. (15). 


Algorithm for Resonance Wave 
Functions 


The shift induced by the coupling is: 

A„i ,„ 2 = ”*’( 2n i + ~ w o) 

+ (n 2 d - ^)(fl 2 — Ct>). (27) 

As A -> 0, we have flj -> o) 0 and fl 2 w. 
However, the correspondence rule requires n 2 to 
go to infinity. We are left with: 

^n l ,n 2 = ^ 2(^2 _ (28) 

Since, as A -> 0, we have 6 -» 0, 6 can be given 
the value: 


0 = 


(29) 


(O 


+ (o; 


The frequency 11 2 can be obtained from: 


Il¬ 


ea 


2 I = ~ OJ o ° 2 + I y I (1 ~ ^ 2 ) + A0 


/ (X) s 

2 

\ 2 , 



1 + 


4 A' 


co 2 (a> 2 + 4o>o) 
A binomial expansion gives 

2 A 2 

fl 9 CO + — 7 —z -r— . 

o)( co 2 + 4o>q) 


(30) 


(31) 


The Floquet wave function given in Eq. (10) can 
be written in principle in a standard way [cf. Eq. 
(16)] showing the spatial function U n (x) associated 
to each "photon" state (i.e., is to each exponential 
e inwt ). However, the wave function is a product of 
five exponentials of periodic functions of time. The 
Fourier expansion of the Floquet function is there¬ 
fore a product of five Fourier series which could at 
least formally be written compactly with general¬ 
ized Bessel functions [27]. Rather than trying to 
analyze this rather complicated expression, we will 
show that the graphical display of these Fourier 
components can be easily obtained from a rather 
general numerical algorithm. 

We have developed previously [23] an efficient 
method to obtain the eigenenergy and the eigen¬ 
function of a resonance in a multichannel situa¬ 
tion. This search for a Floquet quasi-energy and its 
associated wave function belongs to this class of 
problem. We recall here the main steps in this 
approach. The Fox-Goodwin propagation method 
[28] is used. This consists first in the definition on 
a grid of the inward and outward matrices: 

K = u; + 1 (u;)"\ ( 34 ) 

K = Um-iCU,") -1 . (35) 

where the U m 's are matrices of independent vector 
solutions of the coupled equations at point m of 
the grid. 

The propagation is performed with: 


The shift is: 


” 2(^2 2M 2 w(w 2 + 4ft) 2 )• (32) 


K-i = [2/B(x) - a(x + h)Ti,]~ l a(x - h), (36) 
and 


After introduction of the correspondence rule 
[Eq. (21)], with n x identified with v and n 2 dropped 


P„“ +1 = [2j8(x) - a(x - h)P°]~\(x + h), (37) 
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where a(x) and j8(x) are the Numerov matrices: 


a 


(x) = h 


j8(x) = h 


h 2 

1 + —W(x) 
12 

5 h 2 

1-W(x) 

12 


(38) 


(39) 


with W(x) = El - V(x), V(x) being the potential 
matrix. The matching condition at point n is: 

det|P,' - (P„° +1 ) _1 | = 0. (40) 

The initial values for the U's can be taken as 
zero in classically forbidden regions, since there is 
spontaneous generation [29] of the appropriate 
boundary conditions when propagating from such 
a region toward a matching point in the classically 
allowed region. This spontaneous generation holds 
also when complex rotation is used, since this has 
the same effect on the wave function as propaga¬ 
tion into a classically forbidden region. 

The entire procedure leads to an estimate of the 
left-hand side of Eq. (40) and is repeated until this 
condition is satisfied. The Newton-Raphson proce¬ 
dure can be applied because we look for a zero of 
an analytic function of the complex energy. The 
matching relation is also the compatibility condi¬ 
tion for a set of homogeneous equations satisfied 
at the matching point by the components of the 
vector solution. It is possible to calculate them, 
except for a normalization factor, and then to 
propagate them away from the matching point in 
both directions by using the matrices PJ, and F° r 
The same prescription is used in Johnson's renor¬ 
malized Numerov method [30]. The storage of all 
these matrices guarantees [31] that no overflow in 
closed channels or loss of independence occurs 
when reaching a classically forbidden region (or 
equivalently a region with damping of the compo¬ 
nents when complex rotation is used). 


Examination of Some Floquet 
Eigenfunctions of the Parabolic 
Barrier 

The emphasis in the context of resonances asso¬ 
ciated to barriers has been so far on the associated 
complex energies [8-10]. Little has been said about 
the behavior of the resonance wave functions. In 


Ref. [12] some scattering wave functions calculated 
at an energy equal to the top barrier energy have 
been shown. This energy represents the real part of 
all resonance energies [cf. Eq. (2)] of the field-free 
case. The scattering wave functions show clearly a 
localization in the region of maximum potential 
energy. In Ref. [21] the resonance energies of the 
Eckart barrier are derived and a particle-in-a-box 
technique is used to calculate some resonance 
functions for both the symmetric and nonsymmet- 
ric cases. Localization within the potential region 
is again demonstrated. 

We will display now some resonance complex 
wave functions of both the field-free case and 
field-on case for a parabolic barrier. 


FIELD-FREE CASE 

We consider the following values for the param¬ 
eters: 

o) = 1.2, (o 0 = 0.8, A = 0.5 (arbitrary units) 

(41) 


The "zero-point" field-free (v = 0) resonance is 
studied first. Two options are examined 

Complete Scaling of the Coordinate 

Coordinate x is changed into xe ld everywhere. 
A grid of 2000 points with step 0.01 is used. 
Although the choice 6 = 7t/4 has been shown to 
transform a resonance wave function of the barrier 
into a bound-state wave function of the harmonic 
well (cf. discussion in second section), examination 
of the factor e (uo ° x )/2 which is common to all 
resonance wave functions shows that localization 
is obtained for a range of angles, with 0 < 6 < tt/2. 
We give in Figure 1 the field-free resonance wave 
function for the following values of the rotation 
angle: 6 — 7t/4, 0.5, 0.3, and 0.5 rad. For all values 
of 0 except the smallest one, the resonance energy 
has a seven-figure agreement with the expected 
result ( —z0.8). For 0 = 0.05 rad the resonance en¬ 
ergy is found to be —0.001658 — i0.7948. This poor 
estimate is due to the fact that vanishing of the 
wave function at the two edges of the grid begins 
to be a constraint, as shown on the figure (panel 
with 6 = 0.05). A wider grid would restore the 
accuracy. 
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-10.0 -5.0 0.0 5.0 10.0 -10.0 -5.0 0.0 5.0 


FIGURE 1 . Field-free resonance wave functions of a barrier with complete scaling and various rotation angles. 
Parameters in arbitrary units are: barrier half frequency co 0 - 0.8, field frequency co = 1.2, coupling constant A = 0.5. 
The grid extends from -10 to 10 (arbitrary units). The calculation with 0 = 0.05 rad does not localize strongly enough 
the wave function, and a poor estimate of the resonance energy is obtained (see text). For 0= tt/4, the wave function 
is real. For all other values of the angle the wave function is complex. The normalization in all cases is chosen so as to 
give unit value to the amplitude of the real part of the wave function at the matching point (x = 0). The panels give the 
real and the imaginary parts of the wave function, the imaginary parts being given by the thiner curves. 


Exterior Sealing [32-34] 

Coordinate x is changed into x 0 + (x — x 0 )e l6 
for x > x 0 and into — x 0 + (x + x Q )e 10 for x < 
— x 0 , x 0 being some positive abcissa. The real part 
of the integration range extends from — x 0 to + x 0 . 
This has the advantage of displaying on the real 
segment the wave function as it would look in a 
real coordinate treatment (with complex boundary 
conditions compatible with the asymptotic behav¬ 
ior). With this procedure the asymptotic conditions 
for starting the left to right and the right to left 
propagations are again the vanishing of the com¬ 
ponents of the resonance wave function. We take 
x 0 — 5. The coordinate axis is deformed in the 
following way: from —10 to —5 the integration 
axis makes an angle tt + 0.5 rad, with the real axis; 


between — 5 and + 5 it is along the real axis; from 
+ 5 to +10 the integration axis makes an angle 
+ 0.5 rad with the real axis. 

Figure 2 gives the wave function of the field-free 
resonance wave function for quantum number 
v = 0. The energy was found to be — z‘0.80000017. 
The effect of complex rotation for | x | > 5 is clearly 
visible. 

FIELD-ON RESONANCE WAVE FUNCTION 

The parameters are now co 0 = 0.5, (0=1, and 
A = 1. We note that for such a choice the shift of 
the harmonic oscillator has a singularity, since in 
Eq. (15) o) 0 has to be replaced by ico 0 . The calcula¬ 
tion is now a multichannel one done with the 
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no scaling from -5. to +5. 



FIGURE 2. A field-free resonance wave function of a 
barrier with exterior scaling. Parameters are those of 
Figure 1. The grid extends from -10 to 10 (arbitrary 
units). The scaling is applied to the two segments from 
-10 to - 5 and from +5 to +10 with an angle 0 = 0.5 
rad. The energy provided by the calculation is E - 
— 0.13( — 6) - /'0.800017 (expected value E = -0.8). 
Normalization as in Figure 1, the real part of the wave 
function being unity at x = 0. 


procedure described in the third section. The strat¬ 
egy is as follows: the coupled equations [Eq. (17)] 
are infinite in number, and truncation is obviously 
necessary. We call the channel with n = 0 the 
reference channel With no field present the reso¬ 
nance energies and eigenfunctions of this channel 
are considered as describing the system. We note 
that the uncoupled channel with arbitrary n also 
represent the field-free system, but with eigenener- 
gies displayed by no. This is a distinctive feature 
of the Floquet theory. When the field is present, 
the energies close to those of the reference channel 
are being determined. The additional channels are 
introduced in a symmetrical way, one above and 
one below the reference channel at a time. Conver¬ 
gence with respect to the number of channels is 
then studied, since the critical number for a given 
accuracy of the quasi-energy can vary widely. Table 


TABLE I_ 

Convergence toward the correct resonance 
energy with the number N of channels in the 
multichannel approach to Floquet treatment of a 
parabolic barrier. 3 


N Resonance energy 


1 

— 0.90( —11) 

— /0.50000000 

3 

0.23492517 

— /0.56062542 

5 

0.25010394 

-/0.49954770 

7 

0.24982569 

— /0.50052883 

9 

0.25003014 

“/0.500000811 

11 

0.24999985 

— /0.50000035 

13 

0.25000008 

-/0.50000002 


a Parameters are in arbitrary units: barrier half-frequency w 0 
= 0.5, frequency of the field w - 1, coupling A = 1, rotation 
angle 6 = 0.5 rad. The results with complete scaling or 
exterior scaling for |x| > 5 differ at most by one unit in the 
last figure. Expected resonance energy: E = 0.25 - /'O.5. 
N = 1 is the field-free calculation. 


I shows how the exact energy is approached pro¬ 
gressively. A 7-figure accuracy is achieved in this 
example with 13 channels. Figure 3 gives five of 
the components of the 13-channel Floquet eigen¬ 
function, including the reference channel (panel 
n = 0), and two channels above (ft = 1,2) and two 
below (ft = — 1, —2). Despite the number of chan¬ 
nels needed for an accurate result, the amplitude 
of the wave function of the reference channel is 
still dominating the Floquet wave function (the 
components are not drawn on the same scale). 

Figure 4 shows the Floquet wave functions of 
the same model, with the same choice of functions, 
but with now exterior scaling for \x\ > 5. The 
wave functions on the interval — 5 < x < +5 
are calculated for real x: this is exactly how 
they would appear if (complicated) boundary 
conditions valid for a real coordinate treatment 
were introduced in the solution of the coupled 
equations. 


Conclusions 

We have shown that the methods (both analyti¬ 
cal and numerical) which can be used to obtain the 
Floquet quasi-energies and eigenfunctions of the 
harmonic oscillator require little change to give the 
corresponding information for the parabolic bar- 
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FIGURE 3. Field-free and field-on resonance wave functions for a parabolic barrier, with complete scaling (c. sc.). The 
parameters are given in Table I. Panel A = 0: the complex resonance wave function of the parabolic barrier in the 
absence of the field. In the other panels: five of the components of the 13-channel Floquet eigenfunction which leads to 
an accurate determination of the resonance energy (7-figure agreement with the analytical result, see Table I) The 
reference channel has n = 0. Channels n = +1 and n = +2 are above the reference channel, n = - 1 and n = — 2 are 
below. Every wave function or component wave function is complex. The real parts of the wave function when the field 
is off (A = 0), or of the reference channel wave function (n = 0) when the field is on, are unity at x = 0. The thiner curves 
represent the imaginary parts of the functions. Note that the components are not drawn all on the same scale. 


rier. This shows that the Floquet problem of the 
parabolic barrier can be added to the very few 
soluble cases of Floquet theory. Since the Floquet 
eigenfunctions make up a complete set for the 
expansion of an arbitrary solution of the time-de¬ 
pendent Schrodinger equation, they could be used 
to examine how the transmission across a barrier is 
affected by the presence of a field. It is legitimate, 
when the energy approaches that of the potential 
maximum, to use locally a parabolic approxima¬ 
tion for the potential. This is under study. 

It is also interesting to note that in the high- 
frequency regime (co » a) 0 ) the shift in Eq. (15) 
goes to the ponderomotive energy A 2 /2 <o 2 . If a 


barrier maximum is interpreted as a reaction 
threshold [13, 14], this means that the threshold is 
displaced upward by the ponderomotive energy, 
in a way similar to the shift occurring for the 
ionization threshold in an intense field [35]. 
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FIGURE 4. The model and selection of wave functions are the same as in Table I and Figure 3, but now exterior 
scaling (ext. sc.) is applied for |x| > 5 on the integration grid. Damping of the functions starts outside the real interval 
-5 < x < +5. Same conventions and remarks for normalization as in Figure 3. 
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ABSTRACT: Formalism of density-functional theory (DFT) is based on the calculus of 
variation. In the Hohenberg and Kohn theorem, a variational equation minimizing 
electronic energy with respect to an electron density is constructed. The calculus of 
variation allows one to formulate a problem which is reciprocal to an original one. Also, 
we may consider the problem of finding the electron density determining a given energy 
E = E[ p] for a maximum number N = N[ p] of the electrons forming the system. In this 
work, the reciprocal variational problem is discussed. Mathematical considerations are 
followed by a presentation of an application of the reciprocal problem (maximum 
entropy principle). Other possibilities of the applications are sketched. © 1997 John Wiley 
& Sons, Inc. Int J Quant Chem 65: 499-501, 1997 

Key words: density-functional theory; variational principle; reciprocal problem; 
maximum principle 


Introduction 

T here are many similarities among various ap¬ 
plications of density-functional methods. One 
of the main properties of this parallelism is an 
obligation of a variational principle. The most fun¬ 
damental application of the variational principle is 
a minimization of the electronic energy which, 
according to the Hohenberg-Kohn theorem, is a 
unique functional E = E v [ p] of an electron den¬ 
sity for a given external potential v = z?(r) [1]. The 
ground-state density-functional theory (DFT) can 

Correspondence to: K. Tkacz-Smiech. 
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be generalized to equilibrium states in ensembles 
of very many positive and negative ions at temper¬ 
ature 6 and chemical potential fi [2-5], FFaving a 
variational principle in hand, one can construct a 
variational equation dependent on the external po¬ 
tential z>(r). 

Generally, three minimum principles can be ap¬ 
plied in DFT: 

(i) For the ground-state energy, 

E v = F HK + \ V * P dT 'r 

(ii) For the equilibrium Helmholz free energy, 

H = E V - OS; 

(iii) For the grand canonical potential, 

Ft = E v — OS — ptN, 
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where H and fl may be derived as Legandre's 
transforms of the energy while S = S[ p] is an 
entropy and N = N[ p] determines a number of 
the particles in the system. In the variational treat¬ 
ment, one minimizes the functionals for E v [ p], 
H[ p], and Q[ p] via a variation of the electronic 
density function p(r). 


tion corresponding to the functional 

N = N — AE. (4) 

It results from the above that 


N = N- AE = 



\ 



— A (E — pN), 
(5) 


Reciprocal Principle in 
Zero-temperature Limit 

Assuming differentiability of the energy func¬ 
tional E v [ p] = E[ p], the variational principle 
states that the density p, which minimizes energy 
functional, is that one which satisfies the equation 

8{E[ p] - pN[ p]} = 0 (1) 


or, in the form of the Euler-Lagrange equation, 


8E[p] 

8p 


= 


( 2 ) 


The constancy of p is a result of variation of the p 
function under an isoperimetric condition N[ p] = 
const. 

As a functional E = E[ p] is, in general, a many- 
to-one relation, the calculus of variation gives no 
unique inverse of the derivative (2) [6]. But the 
calculus of variation allows one to consider a re¬ 
ciprocal problem which is formulated through a 
change of the roles of the optimized functional and 
the constraint [7]. According to the reciprocal prin¬ 
ciple, a stationary state of a system of constant 
energy is that one which is constructed by a maxi¬ 
mum number of the electrons. 

Let a given energy be equal to 

E = const (3) 

and p be a corresponding electron density due to a 
maximum number of the electrons. We can replace 
the density p by a density p' such that N[ p'] = 
N', E' < E and then deform p' so that it goes into 
p" while N' and E' are increased. It can be per¬ 
formed so that the energy again becomes equal to 
E. But, then, there is N" > AT, i.e., N" > N, which 
contradicts the definition of N. In the general case, 
we speak about a stationary value of the functional 
N[ p] with a constrained E[ p] = E 0 . The problem 
may be reduced to a solution of the Euler's equa- 


and from a necessary condition of maximization. 


8(N - A£) 

-I- = 0 (6) 

8p 

and 

8E 

1 = A—. (7) 

Sp 


Consequently, after cancellation, the same equa¬ 
tion as in the simple problem can be obtained: 

-A S(E-fiN) 


and, finally [compare with Eq. (2)], 


8E 

8p 



(9) 


Also, p equals the chemical potential: p = p = 
const. 

It follows from the above that the calculus of 
variation allows one to treat the problem of the 
stability of the electronic system through the 
equivalent reciprocal problem of maximization of 
the electron number for a given energy of the 
analyzed system. 


Reciprocal Variational Principle at 
Nonzero-temperature Limit 

Let us take a system at T # 0 and consider the 
changes which preserve the number of the elec¬ 
trons N = N[ p]. The energy E[ p] of the electronic 
system must be minimized subject to the following 
constraints: 

(i) The electron density integrates to the con¬ 
stant number of the electrons, to which the 
Lagrange's multiplier p is attached; and 

(ii) The entropy equals the observed value, S = 
S[ p] = const, with the Lagrange's multi¬ 
plier 0. 
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The variational principle states that 

8{E V [ p] - 6S[ p] - p,N[ p]} = 0, (10) 

or in the form of the Euler's variational equation. 


8E 8S 

- Q -/x = 0. 

8p 8p 


( 11 ) 


According to the reciprocal principle, the equilib¬ 
rium state can be achieved for a given N, E = const 
and for the electron density that gives a maximum 
entropy S[ p]. In this case, the maximum condition 
reads 

8S = 0 (12) 

for the functional 


S = S — a E — pN. 
The respective Euler's equation is 

8S 8(S - olE - PN) 

Sp 8p 

and, after transformation, 

8E p 1 8S 

8p a a 8p 


(13) 


(14) 


(15) 


Comparison of the above expression with Eq. (11) 
proves that, for the multipliers 1/a = 6 and p/a 
— — pi, there is an equivalency of the simple prob¬ 
lem of the energy minimization and the reciprocal 
problem of the maximization of the entropy. 


Summary 

Calculus of variation allows one to consider the 
reciprocal problem of the original problem mini¬ 
mizing the energy of the electronic system. This, as 
we have shown, can be established directly by 
constructing and solving the corresponding 
Euler-Lagrange equation. The proposed method 
can be used in the description of the equilibrium 


state of a given energy, equal to the experimental 
value, but of a varying number of the electrons 
forming the system. Such an attitude seems to be 
of a special significance from the point of view of 
possible application for the cases with a charge 
flow between chosen loges. In [8], a metallic crys¬ 
tal composed of spherical muffin-tins was consid¬ 
ered. It seems that further applications, e.g., in 
making a population analysis for molecules, may 
be interesting. A reciprocal principle maximizing 
N[ p] for the assumed experimental electronic en¬ 
ergy gives a theoretical basis for semiempirical 
methods. 

Another maximum principle presented in this 
work is the maximum entropy principle. It is of 
interest as to how the maximum hardness princi¬ 
ple [9-11] may be established from the reciprocal 
problem. One must look for an answer using the 
second equation of the hierarchy of the equation 
for the energy functionals [8, 12-13], 
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ABSTRACT: The time-dependent Schrodinger equation for the 1-D Hj molecule 
(with both nuclear and electronic degrees of freedom included) was solved numerically. 
A wave-function splitting technique was used, which allows one to circumvent the 
problem of lost information due to commonly used absorbing boundaries of the electron 
flux. This technique allows us to calculate the above-threshold ionization (ATI) 
photoelectron kinetic energy spectra in the presence of moving nuclei, as well as 
complete spectra of dissociating protons, beyond the Born-Oppenheimer approximation. 
A considerable enhancement of the ATI spectra, with respect to the spectra generated by 
a H atom, were found. The peaks seen in calculated Coulomb explosion spectra of 
protons agree well with the predictions of recent theoretical work related to the 
phenomenon of charge-resonance-enhanced ionization (CREI). © 1997 John Wiley & Sons, 
Inc. Int J Quant Chem 65: 503-512, 1997 


Introduction 

T he interaction of intense laser pulses (inten¬ 
sity I > 10 13 W/cm 2 ) with atoms and 
molecules leads to many new interesting multi¬ 
photon, nonperturbative phenomena such as 
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high-order harmonic generation and above¬ 
threshold ionization (ATI). This has been an area 
of active research in the past decade [1-4] and 
there has been extensive progress in the under¬ 
standing of these processes in atoms. The situation 
for molecules is not as advanced because of the 
complexity arising from the additional nuclear de¬ 
grees of freedom. Despite the fact that the elec¬ 
tronic and nuclear characteristic time scales are 
very different (attosecond, 10“ 18 s vs. femtosec¬ 
ond, 10“ 15 fs), the intense laser field induces a 

CCC 0020-7608 / 97 / 050503-10 



CHELKOWSKI, FOISY, AND BANDRAUK 


complex correlation between both motions, which 
we attempt to study by solving numerically the 
time-dependent Schrodinger equation (TDSE) us¬ 
ing the exact, 1-D, three-body Hamiltonian for a 
Hj molecular ion in an intense, linearly polarized 
laser field. We reported earlier results of simula¬ 
tions based on TDSE for H \ with 3-D electron and 
1-D protons for 212 and 600 nm wavelength [5,6]. 
Similar calculations for 1-D electrons and 1-D pro¬ 
tons were also reported recently [7], In all these 
studies, absorbing boundaries were used to re¬ 
move the electron flux from the finite electron grid 
(in [7], a nuclei absorber was used as well). There¬ 
fore, so far, ATI kinetic spectra in presence of 
moving nuclei have not been calculated. Similarly, 
removing the electron flux causes the loss of infor¬ 
mation about Coulomb exploding protons, with 
which the lost electron flux was linked (i.e., the 
information about the p + p + e~ channel was 
lost) and, consequently, the high-energy part of the 
reported proton spectrum was incomplete. 

We report, in the present article, the first com¬ 
plete calculations of Coulomb explosion spectra of 
protons and the ATI electron kinetic energy spec¬ 
tra from the dissociating HJ. The difficulty with 
the lost information through absorbing boundaries 
is overcome with the help of the wave-function 
splitting technique [8,9], which allows one to prop¬ 
agate in time the absorbed part of the wave func¬ 
tion in the asymptotic region (i.e., for the electron 
coordinate z far from the molecular center of mass) 
using a much faster algorithm than that used in 
the zone close to nuclei. We first describe this 
technique for the frozen internuclear distance, in 
detail, in the following section of our article. In the 
third section, we present details of our exact (and, 
therefore, non-Born-Oppenheimer, with unfrozen 
nuclei) calculations and show how the wave-func¬ 
tion splitting technique was implemented for the 
case of moving nuclei. In the fourth section, we 
present the kinetic energy spectra of the dissociat¬ 
ing nuclear fragments, which show clear signa¬ 
tures of the charge-resonance-enhanced-ionization 
(CREI) phenomenon. CREI was first discussed in 
[10] and further studied using models with frozen 
nuclear motion [11-13]. Thus, it remained to be 
investigated whether the effect of enhanced ioniza¬ 
tion would be washed out by the nuclear dynam¬ 
ics. Our previous non-Born-Oppenheimer calcula¬ 
tions [5,6] did not show very clear CREI peaks in 
the proton spectra due to the above-mentioned 
electron flux losses through the absorbing bound¬ 
aries. By contrast, the wave-function splitting tech¬ 


nique yields sharp Coulomb explosion peaks due 
to CREI and also a weaker peak originating from 
the center of the initial wave packet. The fifth 
section contains the ATI electron spectra from H 2 , 
showing considerable enhancement of peaks, both 
in the low-energy part and in high-energy part: A 
very broad high energy tail in energy spectra of 
electrons are seen, extending up to 16 U , where 
U p = e 2 El l /4mco 2 is the electron ponderomotive 
energy. Typically, the bulk of (about 99%) ionized 
electrons have strong ATI peaks in the energy 
range less than 3U p (this limit corresponds to the 
maximum drift velocity for electrons born in the 
continuum with zero kinetic energy [14]). Re¬ 
cently, it was pointed out that, contrary to the 
harmonics generation spectra, the ATI electron 
spectra do not have a sharp cutoff but extend 
much further than 3U p due to the rescattering of 
an electron returning to the nucleus [14-17]. So 
far, these effects were much discussed for ATI 
from atoms. Recent measurements of extremely 
hot (multi-keV) electrons ejected from a xenon 
cluster [18] suggest that these rescattering effects 
may be much stronger in molecules and clusters 
than in atoms since the electrons ionizing toward 
neighboring dissociating nuclei can scatter on them 
and absorb additional photons. Obviously, such a 
possibility does not exist in atoms or atomic ions. 
Therefore, the hot electrons are less likely in atoms, 
since they can originate only from the returning 
electrons scattered on one nucleus, while in the 
dissociating molecule, an ionizing electron can en¬ 
counter directly on its path another scatterer on 
which additional photons can be absorbed. The 
investigation of this issue is the main focus of the 
fifth section. 

We use, in the present article, a 1-D model for 
both electronic and nuclear degrees of freedom. It 
appears that the 1-D dynamics describes reason¬ 
ably well the movement of the atomic electrons [2] 
in the linearly polarized, intense laser fields (I > 
10 14 W/cm 2 ) and reproduces correctly such phe¬ 
nomena as high harmonic generation or ATI. Since 
the molecules in intense fields are aligned, due to 
the torque from laser-induced polarizabilities [19], 
one may expect that 1-D molecular dynamics can 
also reproduce the most saliant features of the 
dynamics in intense laser fields ((/ > 10 14 W/cm 2 , 
i.e., above the ionization threshold), in particular, 
in the long wavelength regime, when the molecule 
dissociates via the bond softening mechanism and 
the vibrational trapping does not take place [4] (it 
was shown recently [20] that for wavelengths much 
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shorter than 600 nm and for intensities below the 
ionization threshold the vibrational trapping can 
cause dissociating fragments to be emitted at right 
angles to the polarization direction). Such a 1-D 
molecular dynamics was presented recently [7]. It 
appear that the potential surfaces resulting from 
1-D electron motion are quite close to exact 3-D 
surfaces, providing, thus, a model for nuclear mo¬ 
tions with correct molecular harmonic and anhar- 
monic constants as well as correct dissociation 
energy [7]. 


Description of the Wave Function 
Splitting Algorithm for Solving the 
Time-dependent Schrodinger Equation 

One of the theoretical approaches to calculate 
the ATI spectra relies on exact numerical solutions 
of the time-dependent Schrodinger equation 
(TDSE) on recently available powerful computers. 
So far, this approach has been successfully applied 
to one-electron atoms and molecules (or two 1-D 
electrons [21]). Typically, in such numerical simu¬ 
lations, the initial wave function of the electron is 
well localized in space and later becomes spread 
over very large distances. For a short, intense, 40 fs 
laser pulse, the front edge of the wave packet 
of the ionizing electron can be as far as 4000 
bohr from the nucleus (for I = 2 X 10 14 and A = 
600 nm) and continues to expand for longer times. 
Thus, to avoid reflections from the boundaries, 
prohibitively large boxes (grids with 32,000 points 
or more), in which the electron wave packet is 
fully contained, must be used. The most common 
solution to this difficulty is to use much smaller 
boxes with absorbing boundary conditions. This 
approach allows one to calculate the ionization 
rates and harmonic generation spectra but it does 
not allow one to calculate the photoelectron (or 
ATI) spectra, since the external part of the wave 
function is absorbed and lost. Also, in our recent 
3-D numerical studies of competition between ion¬ 
ization and dissociation [5,6], the use of absorber 
for the electron flux led to a considerable loss of 
information about the nuclear movement, i.e., we 
were unable to reproduce completely the spectra 
of dissociating fragments from the Coulomb explo¬ 
sion channel since these fragments were linked 
with the absorbed electron flux. In this article, we 
develop a technique which allows one to recon¬ 
struct the absorbed electron wave packet in mo¬ 


mentum space and propagate it in time, analyti¬ 
cally, to any time after the pulse turn-off. Thus, 
with the help of this technique, one can calculate 
efficiently the photoelectron spectra after the turn¬ 
off of the pulse. A similar technique was used 
earlier [22] for calculating the kinetic energy spec¬ 
tra of photodissociation products, with the 
laser-molecule coupling neglected in the asymp¬ 
totic region. Recently [23], this technique has been 
generalized for nondecaying (for large distances) 
dipole couplings (for dissociating molecules) and 
for the calculation of the ATI spectra from time- 
dependent Schrodinger equation [8,9,24]. In the 
present work, we extend this technique for solving 
the TDSE for molecules with moving nuclei. Since 
the wave-function splitting technique is quite com¬ 
plex in practice, we present, first, this technique 
for the case of frozen nuclei (or for the hydrogen 
atom) and explain in the next section how the 
technique was implemented for the case when the 
nuclei move. Let us consider a time-dependent, 
1-D Schrodinger equation' written (in au) in the 
following form: 


.#( 2,0 


1 d 


dt 


2 <?z 2 ' ' c 


\ 

-f-V^Cz, R) + zE(t) 0, 

( 1 ) 


where 

V c (z, R) = - . . 

yi + (z - R/2) 

1 

- 7 .7 (2) 

y 1 + (z -L R/2 ) 

and E(t) is the laser electric field. This equation is 
usually discretized and solved numerically for |z| 
< z M . We tested this method [8] for the case of the 
1 — D H atom in the laser field of intensity I = 2.2 
X 10 14 W/cm 2 , A = 630 nm, and t p = 20 cycles 
(1 cycle = 2.17 fs). We compared our results with 
those from [17]. The photoelectron spectra from 
this reference show a plateau extending up to 
SU p = 70 eV with a slowly decaying tail up to 130 
eV (70 photons absorbed), where U p = 
e 1 El A /{^mco 2 ) is the electron ponderomotive en¬ 
ergy. Thus, the fastest electron has a speed v M = 
2.3 au and in t = 43 fs can be as far as 4050 bohr. 
Typically, the integration step in space is Sz — 0.25 
bohr, which means that the grids with over 32,000 
grid points must be used for a 43 fs pulse. For n 
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times longer pulses, n time greater grids should be 
used. This constitutes a serious difficulty, even for 
1-D problems. Therefore, a fast algorithm for large 
z's is very desirable. In particular, since the 
Coulomb forces are smooth and negligible for large 
z's, larger integration steps in space 8z, as well as 
much larger steps in time, can be used asymptoti¬ 
cally. 

Let us divide the total grid into three overlap¬ 
ping regions in space, in which different evolution 
algorithms will be used: the internal and two ex¬ 
ternal (asymptotic, for positive and negative z's) 
parts defined by 

internal : |z| < z in and external: z ex < |z| < z M , 

(3) 


lows us to perform the time evolution exactly and 
over an arbitrary time interval by calculating first 
the wave function in the velocity gauge ty v (z r t) 
with the help of the formula 

i fa(z, R, t ) = exp(“f A(f,0)z)^ ex (z, R, t), (11) 

where 

A(f 2 ,t 1 ) = - C 2 E(t)dt (12) 

is the electric field area over the interval t 1/ 1 2 . 
Next, we calculate the Fourier transform <p v (p, t) 
of ty v (z, 0/ which evolves in time according to the 
equation 

<P V ( p, R, *2) = U(t 2/ tjcpyi p, R, fj), (13) 


where z in — z ex = z 0 > 0 is the size of the overlap 
of both regions. In this overlapping (or 
"matching") region, an absorbing potential is in¬ 
troduced, which we have chosen in the following 
form: 



for z ex < |z| < z in . 

(4) 


where the exact propagator U is given by 


1 

U(t 2 ,t x ) = exp - — / 2 (p 2 + 2 A(f,0)p 

2 J h 


4- A 2 (f, 0)) dt 


. (14) 


In our algorithm, we perform the series of fol¬ 
lowing operations at each time t k = k 81: 


The electron wave function i//(z, R, t ) is split into 
internal if/ int and external if/ ext parts, by applying 
this absorbing potential in the overlapping region, 
at times t = k 8t, where 8t is much larger than 
the integration time dt (dt = 0.03 au, St = 
150 dt = 0.05 cycle, i.e., in our simulations for A = 
600 nm, 8t is much less than is the laser cycle) 
with the help of the following formulas: 

l Hz, R, t) = ty in (z, R, t) + ty ex (z, R, t), (5) 
^• n (z, R, t) =/(z, 8t)ik(z, R, t), (6) 

^ Y (z, R, 0 = (1 ~/(z, 8t))il/(z, R, t), (7) 

where 

f(z,8t) =exp(-8f V^ s (z)) for z ex <\z\<z in , 

( 8 ) 

/(z, 50 = 1 for \z\<z ex , (9) 

and 

/(z, 8t ) = 0 for |z| > z in . (10) 

Our calculation scheme relies on the assumption 
that in the asymptotic zone (defined by |z| > z ex ) 
the Coulomb potential can be neglected. This al- 


(i) The exact time evolution (including field 
+ Coulomb potential) of ip in is evaluated 
in the internal zone (|z| < z in ) using the 
split-operator spectral method [25], as orig¬ 
inally applied to Hj [25]. 

(ii) The outgoing wave is eliminated from the 
internal zone with the help of Eq. (6). 

(iii) The splitting operation described by Eq. (7) 
is performed to provide the outgoing wave 
ip ex defined in the "matching zone," z ()J < 

Izl < z in . 

(iv) The velocity gauge wave function i/j v (in 
the external region) and its Fourier trans¬ 
form cp v are calculated in the overlap 
region: 

<Pv(P't) t) 

= (27r) 2 [ Ul exp(-ipz)i[f v (z, t k ) dz. (15) 

} z cx 

(v) The momentum wave function <p( p, tf) is 
propagated to some final time t*> t p (at 
which the photoelectron spectra are to be 
calculated), using the exact formula 

<P V ( p, R, t f ) = U(t f ,t k )cp v (p,R,t k ). (16) 
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(vi) The momentum wave function cp v ( p, R, tf) 

is added to p, R, fy)'s accumulated 

at previous steps, i.e.: 

<P ( v k) ( p, R, t f ) = (pi k ~ 1 K p, R, t f ) 

+ <p v (p, R, t f ). 

(vii) The loop ends here and calculation starts 
at step (i) with the integer k replaced by 
k + 1. 


The most difficult part of the scheme presented 
here is the choice of the free parameters: V abs , St, 
z in , and z 0 . Several obvious necessary (but not 
sufficient) physical condition should be satisfied. 
First, the external zone (|z| > z ex ) in our scheme is 
the asymptotic region in which we neglect the 
Coulomb forces; therefore, we must require that at 
z = z ex the Coulomb force is much less than the 
electric force eE M , i.e.: 


4” « (17) 

2 - v 


Second, our scheme is based on the assumption 
that the wave entering the "matching" zone (of 
length z 0 ) does not return back to the nucleus, as 
well as the wave exiting this zone does not return 
to it. This means that we should require that the 
classical ponderomotive radius a 0 is much smaller 
than is z ex and also smaller than the size of the 
matching "zone" z 0 : 


time necessary to move for the fastest electron 
across the matching "zone." Assuming that the 
fastest electron has the energy E max — 8II, we 
thus get 

8t <k fi= . (19) 

4ymli p 

In our calculations, St is also much less than the 
laser cycle. Finally, the absorption should occur on 
the distance z 0 longer that de Broglie wavelength 
of the slowest electron. Assuming that the slowest 
electron has the energy h co, we thus get 

z °» • <2o> 

For the maximal laser intensity chosen in this 
article, I — 2.0 X 10 14 W/cm 2 , and the wavelength 
A = 600 nm, the values of two relevant parameters 
are a 0 — 13.1 bohr, U p = 0.24 au = 6.5 eV, and 
a) = 0.076 au — 2.066 eV. We list in Table I the 
values of all parameters which we used in our 
calculation and which satisfy all inequalities Eqs. 
(17)-(20). 

The spectra of ionized electrons, defined for 
each R, where calculated according to the formula 

S(E,R ) = (| <p + ( p(E), R, f,)| 2 

+ \ (p ^ p (E),R,t f )\)-^, (21) 


eE 

and a 0 ^ z 0 , where a 0 = -r • 

mco 

(18) 

The above conditions, (17) and (18), ensure also 
that all possible interferences in continuum (which 
are expected to take place in the near nuclei zone) 
are appropriately taken into account by the exact 
algorithm used in the internal zone. In addition, 
we must require that the matching should be re¬ 
done after the time interval St shorter than the 


where cp + ,(cp_) are wave function corresponding 
to the electron moving in the direction of positive 
(negative) dir ection along the z axis, respectively, 
and p(E) = yJlmE . We checked the method by 
comparing our results for a 1-D H-atom with those 
of [17,24] and with the results from the one-box 
calculations. Slight disagreement between the 
heights (not positions) of several first photoelec¬ 
tron peaks was seen if our spectra were calculated 
right at the end of the pulse. This disagreement 
disappears when the spectra are calculated at later 


TABLE I_ 

Values of parameters (In atomic units) used in the wave-function splitting scheme. 

df St dz z ex Zq Hj n H ex ^ abs ^ max ^ R 

0.03 4.5 0.25 512 150 4196 7168 2 70 840 

n in is number of grid points in the internal zone (|z| < z in ) and n ex is the number of grid points in the external zone, in momentum 
space with the momentum resolution Ap = 27T-/7168 a.u. 
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times, tjr > t p (with the laser turned-off for tf > t p x 
since then all slow electrons have enough time to 
leave the matching zone. Therefore, in all calcula¬ 
tions, we allow the system to evolve without the 
laser field for several subsequent cycles and then 
calculate the ATI spectra (see [8] for more details). 

The two-box integration technique presented 
here was first applied for calculating the ATI spec¬ 
tra from aHj molecule frozen at an internuclear 
distance R [8] and those results were used later for 
comparison with the full non-Born-Oppenheimer 
calculations, presented in next sections. 


A Non-Born-Oppenheimer 1-D Model 
of H 2 + in Intense Laser Fields 


The potential V c (z, R ) was neglected in the exter¬ 
nal zone, whereas the Coulomb repulsion 1/R was 
retained in H(R). 

We used laser pulses of total duration 20 cycles 
(40 fs), having the wavelength A = 600 ran, with a 
five-cycle rise and fall. The values of various pa¬ 
rameters used in the present calculations are given 
in Table I. We performed numerically the time 
evolution of the wave functions \p in (z,R,t) and 
the external (pip, R, t) for an additional three cy¬ 
cles (6 fs), i.e., up to the final time fy = 23 cycles, 
using the Hamiltonian without the electric field, in 
order to allow some more electron-proton waves 
to leave the interaction region. 

The initial wave function, at t = 0, was as¬ 
sumed to be in the following form: 


We solved numerically the complete, three- 
body, time-dependent Schrodinger equation for 
moving electrons and nuclei (in au): 

difriz, t) 

i -—- = H(z, R, t)ipiz, R, t ) 

at 

where X 

H(z, R, t) = H r (R ) + V C ( Z/ R) + H z (z), (22) 


H,(z ) = -j8 


dZ 


KZE(t), 


1 d 1 1 
Hi R) — - —— T 


m p dR 2 R ' 




(2 m p + m e ) 
4 m p m e 


K = 1 + 


m. 


2 m p + m e 


(23) 


, (24) 


i//(z, R,0) = x(R)^o(Rr z), (25) 

where i/> 0 (R, z) was the eigenfunction of the elec¬ 
tron Hamiltonian in H 2 (without the laser field) at 
fixed R. Two distinct initial condition at t = 0 
were used: 

(i) Hj was initially prepared in its vibrational 
v = 6 *(R) = * 6 (R) state, or 

(ii) at t — 0, H 2 was prepared in a superposi¬ 
tion of H 2 vibrational states, coinciding with 
the ground vibrational state of the H 2 
molecule, i.e., 

x(R) = A'iT 2 = E c v Xv(R), (26) 

v = 0 


where 


and m c and m p (m e = 1 in au) are, respectively, 
the electron and proton masses. The Hamiltonian 
of Eq. (22) is the exact three-body Hamiltonian 
obtained after separation of the center-of-mass mo¬ 
tion [5]. The time evolution of the internal wave 
function ij/ in was performed in the same way as 
described in [5,25] (using the split-operator 
method), whereas the external part (obtained re¬ 
covered from the absorber at each 81 step) 
cp (k K p, R, t) was evolved with the help of the 
propagator [Eq. (14)] for times t 1 and t 2 = t 1 + 8t, 
where 81 = 150 dt (as previously). After each 
"big" time step 8t was accomplished, the new 
portion (pip, R, t + 81) was added to the previ¬ 
ously accumulated portion. The operation 
exp(-/df H(R)) was applied to the accumulated 
part of q>^ k Kp, R, t ) at each "small" time step dt. 


/ CO 

dR Xl ,(R) X* 2 (R) (27) 

J 

are the Franck-Condon factors. 

The latter initial condition, (ii), is close to the 
experiments, in which H J is prepared by a steeply 
rising edge of a pulse, which rapidly ionizes the 
H 2 molecule. 

To interpret the full calculation results, we first 
calculated the ionization rates for 170 values of 
R's, frozen, in the range from 3 to 33 bohr. The 
ionization rates for the laser wavelength A = 600 
nm are displayed in Figure 1. One observes strong 
charge resonance enhanced ionization (CREI) peaks 
around 7 bohr, which were discussed recently by 
many authors [10-13]. The CREI effect is incredi¬ 
bly strong for the relatively low intensity I = 5 X 
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FIGURE 1 . Ionization rates as a function of internuclear 
distance for laser intensities / = 5x10 13 ,10 14 , 2 x 10 14 , 
and 4 x 10 14 W/cm 2 at wavelength 600 nm. 


10 13 W/cm 2 and practically disappears at I = 4 X 

10 14 W/cm 2 , when at a short internuclear distance 
R = 4 au, the Hj ion ionizes with a rate close to 
the H-atom rate and the ionization rate curve is 
relatively flat. Note that in addition to those peaks 
some unexpected features appear for larger R val¬ 
ues; in particular, for lower intensities, I = 5 X 10 13 
W/cm 2 , one observes a deep minimum in the 
ionization rate at R = 16 bohr. It originates from 
the fact that an ionizing electron from one proton 
comes under the influence of the second proton 
and does not have enough time to escape from the 
second proton before the electric-field changes sign. 
Thus, this electron becomes "trapped" between 
the two protons. One may expect that the position 
of this minimum is proportional to the classical 
radius of the electron in the laser field a 0 . This is 
indeed confirmed by the position of this minimum 
for a twice higher intensity, which occurs ]fl times 
further. We are currently attempting to explain 
this minimum using classical dynamics of the elec¬ 
tron [26,27]. 


Kinetic Energy Spectra of Dissociating 
Nuclear Fragments 

The wave functions <// in (z, R, t f ), cp(p, R, tf) al¬ 
low us to calculate the simultaneous spectra of 
nuclear fragments S in ( z, p R , fy) and S ex (p, p R/ fy). 


where p, p R are the momenta of the electron and 
the protons, by simply calculating the Fourier 
transform of the asymptotic part (large R) of 
^ n (z, R, fy), cp( p, R, ty) with respect to the R vari¬ 
able and next calculating the square of absolute 
values of Fourier transforms (see [6] for details). 
The integrated over electronic degrees of freedom 
spectra S( p R ) are plotted in Figure 2 (the contribu¬ 
tions from internal and external boxes are added 
up). 

Two different initial condition were used in our 
calculations (see the previous section for a descrip¬ 
tion of these conditions), on which the spectra 
depend strongly. We show in Figure 2(a) spectra 
obtained from Hj which initially, at t = 0, was 
prepared in its vibrational v = 6 state (two laser 
intensities were used I = 10 14 and 2 X 10 14 



01 23456789 



NUCLEAR KINETIC ENERGY E (eV) 


FIGURE 2. Kinetic energy spectra of dissociating 
nuclear fragments. The numerical simulation was 
initialized either (a) from the v = 6 state of the HJ or (b) 
from the Franck-Condon superposition of vibrational 
states of HJ, defined by Eqs. (26) and (27). Laser 
intensities used: (-) / = 10 14 W/cm 2 ; (.) / = 2 

x 10 14 W/cm 2 , the wavelength A = 600 nm. 
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W/cm 2 ), whereas in Figure 2(b), we show the 
spectra resulting from Hj prepared, at t = 0, in a 
superposition of vibrational states, coinciding with 
the ground vibrational state of the H 2 molecule, 
multiplied by the electronic wave function of 
[see Eqs. (25)-(27)]. Figure 3 shows the same spec¬ 
tra on a logarithmic scale. The low-energy peaks, 
seen Figure 2, are related to the bond-softening 
mechanism [4]: In Figure 2(a), there are two very 
close peaks, at 0.26 and 0.43 eV. Their position is 

close to the net absorption of one photon (pre¬ 
sumably via a multiphoton process) from the v = 6 
state: The energy released into the dissociation 
channel from this process, when, initially, the 
molecule was in its vibrational state v, is given by 
a simple, conservation energy formula: 

E = nh(D - (~I p - E v ), (28) 

where I p = 0.67 au is the ionization potential of 
the 1-D H atom, the energy vibrational level v of 
H 2 (from which photons are absorbed), and n is 
the net number of absorbed photons (in general, it 




FIGURE 3. Same as in Figure 2 but on a logarithmic 
scale. 


is a multiphoton process: an absorption of m pho¬ 
tons followed by the reemission of k photons, with 
the condition that n — m — k > 0). For the wave¬ 
length A = 600 nm used in the present work, h co 
= 2.066 eV. Thus, from v = 6, for n = 1, a peak at 
0.542 eV is expected, and from v = 5, at 0.343 eV. 

Two factors may be responsible for the fact that in 
our simulation two peaks are seen, instead of one 
expected at 0.542 eV: (i) First, a rapid rise, in five 
cycles, can populate the neighboring v = 5 state 
(or, in other words, the peak is shifted by the fact 
that our pulse is spectrally very broad), and (ii) the 
downward Stark shift of the level v = 6 occurs. 
Similarly, the low-energy peak seen in Figure 2(b) 
originates from the net two-photon absorption from 
low vibrational states: Using n = 2, v = 0, 1 or 2, 
one gets the peak positions at 1.25,1.50, or 1.74 eV, 

respectively, whereas the simulation (which used 
the H 2 v = 0 state, as initial input for the vibra¬ 
tional wave function) yields a broad peak at 
E — 1.06 eV. Again, a considerable spectral width 

of our pulse and the Stark shifts are the main 
reason of the observed discrepancy between the 
position of expected and seen peaks, i.e., the pres¬ 
ence of photons having a frequency only 5% less 
than that of the carrier frequency can shift the peak 
by 0.2 eV or more and thus explain the observed 
shifts. 

The high-energy broad peaks, E > 3.8 eV, rep¬ 
resent the p 4- p 4- e~ Coulomb explosion channel, 
since in our calculation, they originate from the 
external part of the wave function <p( p, R , tjr). At 
laser intensity I = 10 14 W/cm 2 , we see a broad 
peak at £ = 3.8 eV. If the Coulomb explosion oc¬ 
curred with initial kinetic energy equal to zero, 
then we can conclude that the Coulomb explosion 
should be initiated at R = R c = 7.16 bohr, in order 

to release the Coulomb energy E Coulomb = 3.8 eV 
(according to the relation E Cou}omb = 27.21 eV/R c , 
where R c is in bohr), which is in agreement with 
Figure 1, in which three peaks are seen, at 
5.6, 6.8 and 7.6 bohr. If, in addition, one takes into 

account the fact that at R — R c the dissociating 
H 4 p have an initial nonzero kinetic energy 
[varying between 0.2 to 0.8 eV; see the first peak in 
Fig. 2(a)], one can easily understand the appear¬ 
ance of a broad peak centered at 3.8 eV: Simply, 
the explosions from three CREI peaks seen in Fig¬ 
ure 1 will merge to one broad structure. At higher 
intensities. Figure 1 shows a considerable shift of 
CREI peaks toward the lower values of R, in 
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agreement with the shift of the Coulomb explosion 
peak resulting from our calculations [seen in Fig. 
2(b)], which is now centered at E = 4.8 eV and is 
broader than at the lower intensity. It extends up 
to 8 eV, which agrees well with the lower limit of 
CREI peak in Figure 1, which is sharply cut at 3.05 
bohr. Summarizing, the broad peaks in the p + p 
channel seen in Figure 3 agree well with the expec¬ 
tations for the CREI peaks based on the model 
with frozen nuclei. 

We display in Figure 3 the same spectra as in 
Figure 2 but on a logarithmic scale, in order to 
discuss the high-energy tail. One notes that the tail 
of proton spectra depends dramatically on the 
initial conditions: Initializing our simulation from 
the v = 6 of H 2 yields spectra sharply cut below 
10 eV [Fig. 3(a)], whereas the initialization from 
the v = 0 state of H 2 gives a small (100 times 
lower than the principal peak) but a clear peak at 
E — 14 eV [Fig. 3(b)] which originates from the 
Coulomb explosion from the equilibrium position 
of the H 2 molecule R e — 2 au (note that the equi¬ 
libria positions in our 1-D model are 2.0 and 2.6 
au, instead of 1.6 and 2, for H 2 and for Hj, 
respectively). 


Kinetic Energy Electron Spectra from 
a Dissociating Ion 

The ATI spectra presented in this section were 
obtained by integrating over R the formula (21), in 
which the external box function cp( p, R, tj), ob¬ 
tained from the complete, non-Born-Oppenheimer 
calculation was used. The normalized (i.e., the area 
under the spectrum curve is the ionization proba¬ 
bility Pj) ATI spectra are shown in Fig. 4(1 = 10 14 
W/cm 2 ) and in Figure 5 (2 X 10 14 W/cm 2 ). Our 
exact non-Born-Oppenheimer results (solid lines) 
are compared with the ATI spectra originating 
from the 1-D H atom (□ □ □) and from H 2 frozen 
at R = R c (a a a). One notes a considerable en¬ 
hancement of the spectrum around 3 U p and en¬ 
hancement at very large energies, up to 16U p . A 
frozen nuclei calculation shows such enhancement 
at particular large distances (33 au or more [9]), 
which presumably are close to integer multiples of 
the classical radius a 0 : Classical trajectory calcula¬ 
tions show that the electron reaches its maximal 
kinetic energy at distances z max given by [26,27] 

z max = tmr(v 0 /a> + a 0 ), (29) 

where v 0 is the electron's initial velocity. 



electron energy (photon energy) 

FIGURE 4. (-) ATI photoelectron kinetic energy 

spectrum from HJ complete non-Born-Oppenheimer 
calculations compared (a a a) with spectra of the nuclei 
frozen at R = R c = 7.6 au and (□ □ □) with spectra of 
1-D H atom. The laser intensity / = 10 14 W/cm 2 , the 
wavelength A = 600 nm, and the simulation was 
initialized from the v = 6 state of HJ . The pulse duration 
was 20 cycles (1 cycle = 2 fs), with a five-cycle rise and 
five-cycle turn-off. The ATI spectrum was calculated after 
40 cycles. All spectra are normalized: The area under 
each curve is equal to the ionization probability. 



electron energy (photon energy) 

FIGURE 5. Same as in Figure 4 but for the laser 
intensity / = 2 x 10 14 W/cm 2 and R c = 7.0 au. 
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Concluding Remarks 

We have reported that dissociative-ionization of 
molecular ions exhibits several interesting features 
seen in nuclear fragment and electron kinetic en¬ 
ergy spectra. The expected (theoretical and experi¬ 
mental) low-energy peaks in proton spectra do 
show up in our spectra calculated from the non- 
Born-Oppenheimer dynamics. The positions of 
these peaks agree well with theoretical predictions 
for Coulomb explosions released from the CREI 
critical distance R c . In addition to those peaks, we 
observe one- and two-photon absorption peaks, as 
well as weak peaks related to the Coulomb explo¬ 
sion from the H 2 equilibrium internuclear dis¬ 
tance. The ATI electron kinetic energy spectra show 
unexpectedly high tails, extending up to l5U p , 
related, presumably, to the inelastic collisions of 
electrons moving from one nucleus to another. 
Finally, we emphasize the usefulness of the wave- 
function splitting technique for exact calculations 
of kinetic energy spectra of photofragments emerg¬ 
ing from high-intensity laser interaction with 
molecules. This technique allowed us to recover 
the electron flux usually lost at absorbing bound¬ 
aries in previous calculations, with a considerable 
saving in computer memory and CPU time. 
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ABSTRACT: A convenient Gauss-Laguerre quadrature formula is presented for 
integrands which depend on the radial coordinates r x and r 2 of two bodies as well as on 
their relative distance r 12 . This formula generalizes the analytic method by Calais and 
Lowdin [J. Mol. Spectrosc. 8, 203 (1962)] to cases when the analytical evaluation is not 
possible and defaults to an exact method when it is. © 1997 John Wiley & Sons, Inc. Int J 
Quant Chem 65: 513-518, 1997 


Introduction 

P rompted by renewed interest in experimental 
studies of highly charged ions, small atomic 
systems have again become the focus of theoretical 
and experimental investigations. Heliumlike sys¬ 
tems, in particular, are once more center stage in a 
variety of different research projects. Parallel to 
this development in ion physics runs a similar 
activity in the study of atomic processes in plas¬ 
mas. Here, too, multiply ionized two-electron sys¬ 
tems are being studied as well as negative 
ions—predominantly, the negative hydrogen atom, 
which is an important agent of opacity in astro- 
physical plasmas. 
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The focus of much of modern atomic physics is 
on electron correlation in various systems and pro¬ 
cesses. This interest is shared by new develop¬ 
ments in the physics of highly ionized systems, 
including clusters, and by studies of atomic sys¬ 
tems in plasma environments. Two-electron sys¬ 
tems, in particular, play an important role in this 
research because of the relative ease with which 
correlation effects can be identified and calculated. 
New aspects such as the need to include relativis¬ 
tic effects in highly charged ions, for instance, or 
the direct inclusion of screening potentials in 
atomic calculations [1, 2] call for new or at least 
modified theoretical approaches. 

The present research originated from ideas to 
include realistic, time-dependent screening poten¬ 
tials into atomic calculations from scratch , and, by 
doing so, to calculate line broadening and the 
lowering of the continuum threshold simultane¬ 
ously and on the same footing. While the time-de¬ 
pendent aspect is not our focus here, the need for 
wave functions of larger angular momenta than S 
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and P states for heliumlike ions stimulated the 
present study which builds on the landmark arti¬ 
cle by Calais and L5wdin [3] in which the authors 
gave an analytic expression for nested triple inte¬ 
grals of the type "exponential function times inte¬ 
ger powers" in all three arguments. Their Eq. (28) 
reads 


[KLM] 



X 


/r-y + Yn 

dr l2 r™e- c ^ 

ii'i —r 2 l 


J; ^ (p + cj)l 

2 kilimi E E E , 7 

p = 0 q = 0 r = 0 P 'P' 


(K - p + r)l (L — q + M - r)l 
X r\(K - p)l (M — r)\(L — q)\ 

x (A+By ip+q+1 \A+cr (K+1 - p+r) 
X(B + c) _(L+1_? + M_r) . (1) 


Computationally, this formula—whenever it can 
be applied—is advantageous over the quadrature 
formula to be presented in the third section. In 
many cases of interest, however, the latter pro¬ 
vides substantial convenience for programming 
without compromising accuracy. Furthermore, it 
can be generalized to the case of more general 
integrands for which analytic results cannot be 
obtained. In these cases, the quadrature will no 
longer be exact but still of a high degree of accu¬ 
racy. 


A Quadraure Formula for Integrals of 
the Calais-Lowdin Type 

The following development makes use of four 
well-known properties of the Gauss-Laguerre 
quadrature formula: 

■ Accuracy: The quadrature replacement 

00 N 

f f(x)e~ x dx- £/(x„)W„ 

0 n = 1 

is exact if f(x ) is a polynomial in x of 
degree 2N + 2 or less and the quantities x n 
and W n are the standard Laguerre points and 
weights as given, e.g., in Abramowitz and 
Stegun [4]. 


Scaling: An integral of the form 


1 IL i x n \ 


with exactly the same weights W. r 
Shifting of the lower integral limit: 

r oo 

f f(x)e X dx 
J o 

/ CO 

/(£ + y)e^ l dy 

j 

Interchanging of nested integrals: 

/ CC , A X 

dxf(x)e~ Ax j dyg(y)e~ By 
o •'o 

/ CO CO 

dxg(x)e~ Bx j dyf( y)e~ Ay . 


Repeated application of these simple procedures 
leads to a convenient quadrature representation of 
the Calais-Lowdin integral of Eq. (1). We first 
replace r v r 2 , and r 12 by x, y, and z, respec¬ 
tively, and get rid of the absolute value in the 
lower limit by separating the two possibilities 
x < y and x > y. Furthermore, we want the upper 
limit to be infinite and—in the end—the lower 
limit to be zero in order to be able to apply 
Gauss-Laguerre quadrature techniques: 



With the following replacements, the integrals in 
the square brackets are changed to the desired 
form: a = C[z - (y - x)\, so that dz = (1/C)rfcr 
(we notice that y - x is a constant limit for the 
first of the integrals). Similarly, we substitute in 
the remaining integrals <j = C[z - (y + x)\, cr = 
C[z - (x - y)] and a = C[z — (x + y)\, respec- 


514 


VOL. 65, NO. 5 




A QUADRATURE FORMULA FOR CORRELATION INTEGRALS 


tively. By this, the two square brackets change to 


> —C(y —*) 


I cr \ M 

l 0 d<T (c +y ~ x ) e 


e -C(y + x ) 


,00 / (7 \ M 

f 0 da \c +y + x ) e ~ a 


and 


e -C(x-y) 


r °o l cr 

L + 


+ x — y e 


,-C(x + y > 


/ cr 

/„+ 


+ x + y e 


We combine the square brackets with the inte¬ 
grals on their immediate left after substituting 
r = (B + C)(y — x) and t = (A + C)(x - y), re¬ 
spectively: 


C(B + C) 


+>/„"*( 
l He 


B + C 


+ x \ e 


B + C 


e~ a - 


g~Cxg-(B + C)x 

C(B + C)~ 


.0° 

/ dr 

J r\ 


B + C 


+ x e 


/ CT T \ m 

x //‘ r c + ?Tc +2 h 


>.00 ( cr 

L He 


-\ -— 6 

B + C 


(A + B + 2C)C(B + C) 


A + B + 2C 


I’Ht 

CHt, 

J r 00 cr 

da — 

o \ c 


T P \. 

- H-—■ 6 

B + C A+B+2C / 


r l CT T 2p \ 

X / n d<7 C + B+C + A+Ff2C 6 


*oo 

+ (A + B)C(A + C)-l dp { A + bJ 6 P 


*oo 

f dr 

•J n 


T P \ . 

- H-I 6 

A + C A + B/ 


r oo / a t \ m 

X da\— + ——— e 
4 VC A + C/ 


(A + B + 2C)C(A + C) 


oo 

/ dp 

* 0 ° 

| dr 


A + B 


T P\ 

- H-— I £ 

A+C A+B+2C 


f cr r 2 p 

X / dcr —; +——— H—-— , 

Jn \ C A+C A+B+2C 


e-°. (3) 


and 


e Ay l T 

C ( A + C) 1 A + C 


r™ ( CT T 

X / dcr —- H—-— 

Ai \ C A + C 


+ y * 


e _£r - 


e -Cy e ~(A + C)y 

C( A + C) 


f dr h;—^ 

•4 \ A + C 

/*°° / cr 

L He + : 


+ y ^ 


+ -+ 2 y I c <r . 

A + C 


The final integrations require only scaling but 
no shifting of the quadrature. The overall result is 


[KLM] = 


(A + B)C(B + C) 4 l A + B 


J r CC 

dp 

n 


*oo 

/ dr 

Jn 


T p \ L 

- -J-I £ 

B + C A + B 


The corresponding quadrature formula is given 


[KLM] = —.— --- £ wj-++r 

C(B + C)\A +B k % \ A + B 


X £ w ; 


B+C A+B 


/ (J Ti \ M 

X £ W„,U + —!—) 
VC B + C / 


-E wJ--— 

(A + B+ 20+J H A + B + 2C 
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We notice, in particular, that all integrands have 
the general form of a polynomial multiplied by an 
exponential function. The quadrature gives in these 
cases exact results provided that the number of 
quadrature points is at least equal to (D/2) - 1, 
where D is the degree of the polynomial in one 
variable. In the present case, the degree for the 
variable p is equal to M + K + L, while for the 
variables r and a, it is M + K (or M + L) and M, 
respectively. 


The General Quadrature Formula 

The more general integral 



can as well be represented by a quadrature for¬ 
mula. Again, if the functions /, g, and h are 
polynomials in their respective arguments, the 
representation will be exact provided that the 
number of quadrature points is sufficiently large. 
In this case, the sole benefit of a quadrature repre¬ 
sentation over the direct evaluation by the 
Calais-Lowdin techniques is a substantial saving 
in programming work. If, however, the functions 
are not polynomials, an optimally scaled 
Gauss-Laguerre quadrature is still an efficient way 
to evaluate the necessary integrals. The various 


procedures of scaling, shifting, and interchanging 
of integrals are exactly as in the preceding section 
and need not be repeated here. The final quadra¬ 
ture formula is given as 


1 f 1 * ( Pi 

1 = C(B + C) I A + B £ A + B 


X £ (—^- + —— 

,ti 16 \b + c a + b 


T l 


x £ w m h\ — + —— 
tx Me B + C 


1 I * / p, 

(A + B + 2C) L., \ A + B + 2C 


X E W,g(—— + -—- 

ls \B + C A + B + 2C 


^ / cr T, 2 p k 

X E w„M — + — — + ——- 

„f^ 1 m l C B + C A + B + 2C 


C(A + C) A + B CZ 


x E w lg { — + 

& \A + C A + B 


^nt t sj fj 

x E W m h +-— 

»-i Me A + c 


^ i "J-A 


A + B 


--- ( E wjf —-— 

(A + B + 20 ^, kJ \A + B + 2C 


X ,?i A + C + A + B + 2C 


X E W m h[^-+—— + - - -) . 

w _ a I C A + C A + B + 2CJ 


This formula requires 4N k evaluations of the func¬ 
tion / and 4 N k X N ; evaluations of g, while the 
function h needs to be calculated 4N k X N t X 
times. 


Representative Results 

The results of this section—energies of excited 
P and D states of the helium atom—were selected 
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to assess the accuracy of both the method and the 
computer program. Their physical relevance will 
be discussed in a forthcoming publication. The 
evaluation of eigenenergies of the helium atom has 
a long history. After groundbreaking work by 
Hylleraas [5, 6] and Pekeris [7, 8], the use of 
perimetric coordinates has become almost custom- 
ary for bound- and resonance-state calculations 
(see, e.g., [9-11]). To our knowledge, only the 
S-state and the P-state energies have actuality been 
calculated so far. Because of the present interest in 
atomic processes in plasmas, all Coulomb interac¬ 
tions in the Hamiltonian were replaced in the 
present work by screened interactions of the De- 
bye-Hikkel type ( e~ r/D /r ). In this scheme, one 
approximates matrix elements with pure Coulomb 
interactions to a high degree of accuracy by setting 
the parameter D equal to a very large number. In 
the present examples, D has been chosen equal to 
2.0 X 10 6 when comparison with Coulombic re¬ 
sults was wanted. 

As in the earlier work, the form of the wave 
functions was chosen as 

= E C i ,j,k{r 1 L P L (cos d 1 )e~ a ' , '- fi i r * 

i,jX 

±7' 2 L Pr(cos 0 2 )e~ (3 i ri ~ clir2 }e~ ykrn . (6) 

Here, a indicates the spin multiplicity. Singlet 
states are associated with the upper ( + ) sign, and 
triplet states, with the negative sign. The letter L 
indicates the overall orbital angular momentum 
and P L is the Legendre polynomial of order L. 
While the linear parameters C /;A: are obtained from 
the diagonalization of the Hamiltonian matrix, the 
best choice of the nonlinear parameters a if fi ]f and 
y k is an open question. Different strategies were 
used in the previous publications [9, 10]. In the 
present work, we follow the guidelines of the 
former with the exception that we scale the a, /3, 
and y parameters in groups until satisfaction of 
the quantum virial theorem has been achieved, 
because we hold the correct distribution of kinetic 
and potential energy for an important property of 
stationary states. The results of these calculations 
are shown in Table I, rows 3-5. The entries of row 
3 are very close to the data in row 2. This is to be 
expected because with a Debye parameter as large 
as 2 X 10 6 the Debye potential is almost identical 
to the Coulomb potential. The calculations were 
performed with 56 basis functions for the P states 
and 84 for the D states. The quadrature formula 


TABLE I_ 

For various values of the Debye parameter D, the 
binding energies of the lowest states of overall P 
and D symmetry of the helium atom in a Debye 
plasma environment are calculated (rows 3-5) 
and compared to previously available data for the 
case of only Coulomb interactions (row 2). 


D in au 

V 

3p 

'd 

3 D 

00 

2.12384 3 

2.13316 3 

2.05579 b 

2.05580 b 

2 X 10 6 

2.12353 

2.13308 

2.05557 

2.05560 

200 

2.10861 

2.11816 

2.04072 

2.04075 

20 

1.98121 

1.99020 

1.91864 

1.91875 


The results of rows 3-5 were obtained from the nonrela- 
tivistic Hamiltonian in which all Coulomb interactions were 
replaced by Debye potentials e~ r/D /r . 
a Nonrelativistic calculations by Thakkar and Smith [10]. 
b Experimental data taken from [12]. 

for the spatial integration was used with six (P 
states) or seven (D states) quadrature points and 
weights for each of the three radial coordinates. 

It should be emphasized that the results of 
Table I can, in principle, all be calculated from the 
original formula by Calais and Lowdin [Eq. (1)]. 
Numerical quadrature is not required in the com¬ 
putation of D states of two-electron systems even 
when all Coulomb interactions are replaced by 
Debye potentials. To employ the quadrature for¬ 
mula in cases where an analytical formula is not 
available, the quadrature formula has been used to 
calculate two-electron matrix elements in which 
the term has been replaced by a parameterized 
polarization potential [13]. These potentials contain 
products such as 

1 — exp( — ar\) 1 — exp (—arf) 

_ _ . 

r \ n 

Analytic evaluation is no longer possible. Al¬ 
though the numerators are rather steep cutoff 
functions, and, hence, difficult to integrate numeri¬ 
cally, the Laguerre quadrature yielded in all cases 
considered an accuracy of three digits with 20 
Laguerre points and five digits or better with 52 
points. Product wave functions of very diffuse 
Slater-type orbitals were included in the studies, 
but no systematic investigation has been done yet. 
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ABSTRACT: We study the interrelation between molecular-shape changes and 
configurational transitions in isolated grafted chains ("polymer mushrooms") under 
geometrical confinement. The confinement effects are due to a finite-size obstacle, 
representing a simple model for the tip of an atomic-force microscope. Using an off-lattice 
polymer model, we monitor how the molecular shape of a short chain is affected by the 
geometry of the tip. We show the occurrence of shape transitions as the chain "tries to 
dodge" the approaching obstacle. These transitions can be correlated with changes in the 
nature of the dominant chain conformations. Using a descriptor of self-entanglements in 
the polymer chain, we propose a "phase diagram" indicating the regions characterized 
by free chains, confined chains, chains that can "escape" the tip, and chains whose escape 
is "frustrated." The results help in better understanding polymer behavior under 
compression, and they can be useful for designing interphases with targeted properties. 
© 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 519-530, 1997 


Introduction 

T he study of macromolecules at solid-liquid 
interfaces finds applications in the modeling 
of important phenomena such as film adhesion, 
lubrication, and oil recovery [1], In addition, the 

Contract grant sponsors: Fonds de Recherche de l'Universite 
Laurentienne (FRUL); Natural Sciences and Engineering Re¬ 
search Council (NSERC) of Canada. 


behavior of polymers confined within interfaces is 
central for understanding the dynamics in cell 
membranes and the stability of the vesicles used in 
drug delivery [2]. 

The models commonly used to study polymers 
at interfaces include two basic features: (a) a num¬ 
ber of chain molecules grafted onto a flat surface 
[3-9], and (b) various perturbations acting on the 
chains, e.g., compression or a liquid flow [10-19]. 
In the limit of low surface coverage, one can ne¬ 
glect chain-chain interactions and reduce the 
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model to a single grafted chain under various 
deformations, i.e., a "polymer mushroom" [20], In 
this work, we deal with the deformation of a 
permanently grafted polymer mushroom under 
confinement. 

The configurational properties of geometrically 
confined polymer mushrooms has received a good 
deal of attention recently [11, 20-24]. In the case of 
confining grafted chains to an infinite slab, we 
have shown that pressure affects differently the 
various features that characterize the shape of a 
polymer [24]. For instance, it is found that molecu¬ 
lar size, molecular anisometry, and the complexity 
of the chain's self-entanglements do not behave 
identically during configurational changes [24]. 
(The notion of "molecular size" is reserved in this 
work for the mean three-dimensional space actu¬ 
ally spanned by the distribution of atoms in a 
polymer configuration. It must not be confused 
with the "contour length" of the polymer chain or 
the number of monomers.) 

Geometrical confinement can also be used to 
study other phenomena. For instance, a polymer 
mushroom compressed by a finite-size obstacle can 
be used to simulate the effect on macromolecules 
of a scanning-tunneling microscope or atomic-force 
microscope tip (AFM-tip) [20, 21]. Recent studies 
predicted the occurrence of marked changes in 
molecular size as the chain "tries to dodge" an 
approaching AFM-tip. These changes can take the 
form of "escape transitions" or, under special con¬ 
ditions, hysteresis [20, 21]. Previous studies in the 
literature have analyzed these configurational 
transitions by looking at the scaling regimes in the 
molecular size [3, 20, 21]. These analyses, based on 
mean-field approximations, use the notion that a 
chain can be broken up into a number of "blobs" 
whose effective size is predicted from the partial 
number of monomers (i.e., using scaling argu¬ 
ments) [3]. This approach, however, breaks down 
in short polymers [3, 10] or in arbitrarily long 
polymers under strong compression [21]. In addi¬ 
tion, previous work has neglected other aspects of 
polymer shape that do not necessarily behave in 
the same fashion (cf. [24]). In the present work, we 
want to explore these molecular-shape transitions 
in more detail for systems where scaling argu¬ 
ments [3] are not applicable. 

In practice, we analyze the deformation of a 
short polymer chain by an approaching AFM-tip. 
Within this context, we address the following 


questions: 

(a) Are there recognizable configurational tran¬ 
sitions when the effect of confinement is 
purely entropic (i.e., when the AFM-tip 
changes only the configurational space and 
not the energy of the chains)? This situation 
corresponds to the simplest polymer model, 
where monomers present only a purely re¬ 
pulsive (excluded-volume) interaction and 
there are no energy barriers. 

(b) Can molecular features, other than molecu¬ 
lar size, detect the occurrence of configura¬ 
tional transitions during compression? 

(c) What are the shape features characteristic of 
the dominant chain configurations found 
under all possible confinement conditions? 
Is there a finite-size analog to a "phase 
diagram," where the configurational 
"phases" have distinct molecular shape fea¬ 
tures? 

We address these issues by performing com¬ 
puter simulations of an AFM-tip interacting repul¬ 
sively with the polymer mushroom. The AFM-tip 
consists of a truncated cone with a flat disklike 
face exposed to the grafting surface. The polymer 
chain is modeled as an off-lattice continuum with 
excluded-volume interaction. Within the specifica¬ 
tions of this simple model, we explore the com¬ 
pression of the polymer mushroom under AFM- 
tips with variable disk radii and distance to the 
grafting surface. 

The work is organized as follows: The next 
section discusses the models used for the tip and 
the chain, as well as the sampling of configura¬ 
tional space. A following section discusses the 
evaluation of the mean values for molecular-shape 
descriptors. Another section presents the results 
for shape transitions and configurational changes. 
Further comments and conclusions are found in 
the closing section. 


Configurations of a Model Polymer 
Mushroom with Confinement 

We deal with a single polymer chain, perma¬ 
nently grafted by a terminal point to a flat, impen¬ 
etrable surface. The grafted chain has a constant 
number of monomers n. The polymer is approxi- 
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mated by a bead-and-string "necklace" model, 
consisting of n monomers ("beads") and a con¬ 
stant bond length / between consecutive beads 
[25]. hr absence of confinement, the linear chain is 
allowed to adopt all possible conformations sub¬ 
ject to two constraints: (a) the chain does not 
penetrate the grafting surface, and (b) the 
monomers satisfy an excluded volume condition. 
The excluded volume is a repulsive interaction 
between monomers, where nonbonded beads are 
kept at distances of at least r ex (the radius of 
excluded volume). Configurations are found as 
long as r ex <21. (In the limit r ex -> 21, the chain 
becomes a rigid rod.) 

In this work, we deal only with the effects of 
confinement on one type of polymer chain. There¬ 
fore, r ex , /, and n are taken as constants. To pro¬ 
duce chains whose dimensions are comparable to 
grafted oligopeptides in the presence of a "good 
solvent," we have chosen / = 3.8 A, r ex = 3.43 A, 
and n = 25. The r ex value is taken from the repul¬ 
sive part of the Sutherland potential of the 
molecule of C0 2 [26]. Its value is representative of 
the effective mean size of an amino acid residue in 
a polypeptide chain in a good solvent. 

The chain is terminally grafted to the plane by 
the first bead, which is taken as the origin for the 
coordinate framework. The first bond (between 
beans "1" and "2") is chosen perpendicular to the 
plane, and it defines the "z" axis. (This particular 
grafting condition is only introduced for simplify¬ 
ing the computation of self-entanglements [24]. It 
has little influence on the accessible mean molecu¬ 
lar shapes.) 

The 25-bead polymer chains are modeled by an 
off-lattice continuum. The accessible molecular 
shapes are known for "free" (unconfined) linear 
chains [25, 27] and chains confined to a slab de¬ 
fined by two parallel planes [24]. In our present 
case, a finite-size obstacle introduces new con¬ 
straints on the accessible configurations. 

The confining AFM-tip is modeled as a large 
truncated cone, pointing toward the grafting sur¬ 
face. The base is a chosen as circular disk parallel 
to the surface, centered over the grafted terminal 
end of the chain. The shape of the tip is specified 
by three parameters, as illustrated in Figure 1. The 
distance between the base of the tip and the graft¬ 
ing plane is indicated by L and the radius of the 
base is p. The conical shape is defined by the angle 
a. In this work, only one value of the angle was 
considered (a = 45°). 


The parameters L and p control the confine¬ 
ment. A number of special cases are noted: 

(i) For a fixed L value (L > /), confinement 
within a slab is ensured when p = (n — 2)1, 
representing the maximum distance a 
grafted chain can be stretched parallel to 
the surface. Therefore, strong confinement 
corresponds to the case of large p and 
small L values. 

(ii) For a fixed p value (p > 0), the effects of 
confinement will disappear if L > (n — 1)/. 
(In practice, they disappear at smaller L 
values, vide infra). At this point, we find 
the mean shape of the free grafted chain. 
We refer to this case as the "free" or "un¬ 
confined" chain limit. 

(iii) For an arbitrary L value, the unconfined 
chain limit is also reached if the disk is 
vanishingly small ( p -» 0). The limits (ii) 
and (iii) provide a test for the accuracy of 
the simulations. 

The AFM-tip is a repulsive barrier to the poly¬ 
mer mushroom. Therefore, "acceptable" configu¬ 
rations must have neither bead nor bond penetrating 
the tip. This condition is checked bead by bead as 
the chain is built. We proceed as follows: From the 
second bead, we start a random walk, with con¬ 
stant step-length Z. Let r z = ( x it y if z { ) be the posi¬ 
tion vector of a bead that meets the excluded 
volume criteria. This tentatively accepted bead lies 


P 



1 


FIGURE 1. Geometrical parameters defining the shape 
of the AFM-tip model. 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


521 




ARTECA 


outside the tip if 

xf + yf > [ p + (z t — L)tan a] 2 , for z i > L. 

( 1 ) 

Whenever Eq. (1) is not satisfied, the partial chain 
conformation is rejected and the search restarts. 
However, two bonded beads could still lie outside 
the tip, yet their bond intersects it. Our model 
excludes also this possibility. Whenever two con¬ 
nected beads with positions r t - and r i _ 1 satisfy Eq. 
(1), one must ensure that all points r along their 
bond lie outside the three-dimensional space occu¬ 
pied by the tip, , i.e., 

r = r i _ 1 + t(r f - r^) £ £l tip , for all t e [ 0 , 1 ]. 

( 2 ) 

This condition is satisfied if the following second- 
degree equation for t: 

X,(0 2 + Y,(f ) 2 = [ p + (Z,(f) - L)tan a] 2 , (3) 

has no real solutions within t e [ 0 , 1 ], where Q/O 
= ^_i + t(q i - with Q = X, Y, Z used 

when 9 = x, y, z, respectively. Whenever a par¬ 
tially built chain satisfies Eq. (3) with t e [0,1], it 
is rejected and the search restarts with a chain 
grown anew from the grafted terminal. 

Our procedure samples the configurational 
space of the confined grafted chains by generating 
uncorrelated conformers . (Our approach is essentially 
the simplest Monte Carlo search [28], where zero- 
energy conformers are accepted and infinite- 
energy conformers are rejected.) When the grafted 
polymers are "grown" in the presence of the AFM- 
tip, all chains that fail the criterion for excluded 
volume or tip repulsion are abandoned and not 
continued. We retain only those chains that are 
built by adding all beads sequentially without a 
single rejection . These off-lattice simulations are 
computationally demanding, but they produce a 
convenient set of independent conformers. Simple 
statistics can then be used to compute mean 
molecular-shape properties and study the effect of 
confinement on them. A few thousand "accepted" 
conformers are enough to reach accurate estimates. 

Our approach poses a heavy demand on the 
random number generators used for sampling con¬ 
formations. One must ensure that the generator is 
not exhausted when the configurational space be¬ 
come very small, e.g., for small L and large p 
values. The performance of different generators 


was tested and compared. Overall, the most reli¬ 
able generator is the one by L'Ecuyer [29, 30], 
which combines two random sequences, with ad¬ 
ditional shuffling to produce a period of length ca. 
2 X 10 18 . This is the only generator that gives ac¬ 
ceptable results in the limit of strong confinement. 
At lower confinement, an optimized "shuffled" 
version [29] of the built-in generator in DEC- 
Fortran ( MTH$ Random , with a period of ca. 10 12 ) 
produces statistically identical results, with a 
slightly faster performance. (All computations were 
carried out on a DEC 255/233 AXP workstation.) 

As an illustrative example of representative 
polymer configurations. Figure 2 shows a series of 
AFM-tips with the same radius (p = 20 A), ap¬ 
proaching the grafted polymer. The "accepted" 
configurations satisfy the criteria above with r PY = 
3.43 A and n = 25. 

Figure 2 illustrates a number of features ob¬ 
served in our simulations: First, as the tip ap- 



L= 20 A 





FIGURE 2. Typical deformations of grafted chains 
caused by an approaching AFM-tip with constant radius, 
p = 20 A. 
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proaches, deformed chains fold underneath it (i.e., 
they become effectively "confined"). However, if 
compression is increased (and the shape of the tip 
allows it), the polymer chain can "escape" the 
approaching tip by stretching to the border of the 
disk and folding outside the confining space. (Ac¬ 
tual physical escape, of course, is not possible 
because the chain is permanently grafted.) In the 
next sections, we discuss in detail these transfor¬ 
mations by calculating the mean molecular-shape 
features of sampled chains. 

Note that we do not analyze the dynamics of 
deformation and "escape" from the tip. Rather, we 
extract static information by computing a large 
number of polymer configurations, compatible 
with the imposed geometrical constraints. More¬ 
over, all phenomena studied in this work are 
strictly due to the interplay between the spatial 
configurations and their associated shape. There is 
no energy cost to the chain deformations, since the 
polymer has only a hard-sphere potential. All ef¬ 
fects on molecular shape are exclusively entropic 
and due to a change in configurational space vol¬ 
ume during compression. In this sense, the present 
approach complements other models used in the 
literature to study confinement on polymer mush¬ 
rooms [20, 21]. 


Mean Molecular-Shape Features for 
Grafted Polymer Configurations 

An "accepted" chain configuration, denoted by 
K f , can be characterized by a number of molecu¬ 
lar-shape features. These properties include some 
that depend exclusively on nuclear (or "bead") 
coordinates, such as molecular size and anisome- 
try. Other important features can be conveyed by 
characterizing the chain's self-entanglements, 
which depend both on nuclear coordinates and 
chain connectivity [31]. (Self-entanglements refer 
to the "twists and turns" in a given linear chain 
configuration. In principle, entanglements do not 
depend explicitly on the molecular size.) Recently, 
we showed that the simultaneous use of molecular 
size and self-entanglement can provide a thorough 
characterization of confined polymer conforma¬ 
tions [24], as well as configurational transitions 
during molecular dynamics [32]. These shape fea¬ 
tures do not necessarily behave the same way 
during compression [24]. In this work, we follow 
both molecular size and self-entanglements when 


the AFM-tip approaches the grafted chains. Our 
goal is to test whether chain self-entanglements 
can be used to monitor the occurrence of configu¬ 
rational "escape" transitions. 

Molecular size is measured by the radius of 
gyration. Let R M be the position of the center of 
mass and {r if i = 1 ,2 , ..., n) the coordinates for the 
n beads, both measured with their origin at the 
grafted terminal. The "instantaneous" square ra¬ 
dius of gyration for a given conformer is 

R 2 G = i-X>-R M || 2 . (4) 

n j-i 

We also follow the mean height of the polymer ( h ), 
measured from the grafting plane. Using the coor¬ 
dinates in Eq. (1), we have 

1 n 

h — — XI z i / where > 0, Vi. (5) 

n i=l 

Chain self-entanglements take into account the 
"trace" of polymer chain, in addition to the 
monomer positions. An effective measure of entan¬ 
glement complexity can be given in terms of the 
probability distribution of over crossings, denoted by 
{ A N (n)} [33], This distribution measures the prob¬ 
ability of producing a two-dimensional projection of 
the n-bead polymer with N "double points" (i.e., 
bond-bond "crossings"), averaged over all possi¬ 
ble projections. The more "entangled" the chain, 
the higher is the probability that large overcross¬ 
ing numbers dominate the projections. In contrast, 
a rodlike chain is unentangled [i.e., A 0 (n) is the 
largest contribution to the probability]. 

A single descriptor for the above distribution is 
the mean number of overcrossings N. In simple words, 
N is the "effective" number of bond-bond cross¬ 
ings in an arbitrary projection of the grafted chain, 
averaged over all directions in space . For any arbitrary 
projection of a linear polymer configuration, the 
maximum possible number of overcrossings is 
maxN = (n — 3)(n - 2)/2 [33]. Depending on the 
shape of the conformation considered, most of the 
projections may yield low N values (if the poly¬ 
mer is swollen or unentangled) or high N values 
(if the polymer is entangled). The mean number of 
overcrossing, N, will usually be much smaller 
than maxN (cf. discussion in [27] and [33]). In 
practice, N is evaluated as an average over many 
randomized projections (e.g., 5 X 10 4 ) [24, 33]. 

If the ensemble of "accepted" chain configura¬ 
tions is denoted by {K,}, then {D(K /)} is the set of 
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values accessible to a given shape descriptor. Since 
the sampled configurations are statistically inde¬ 
pendent, the configurationally averaged shape de¬ 
scriptors are computed as the simple mean within 
the {K f } ensemble. The configurational averages 
are denoted by ( D). 

For the objectives of this work, we focus mostly 
on the changes in entanglements during confine¬ 
ment, as characterized by ( N ). Some exact and 
numerical properties are known for this descriptor 
in the case of long random walks [27, 34-36]. 
Below, we discuss the behavior of ( N ) (and con¬ 
trast it with that of (Rq) 1/2 and (ft)) as the chains 
are deformed by the approaching AFM-tip. 


Dependence of Configurationally 
Averaged Shape Descriptors on 
Geometrical Confinement 

Averaged shape-descriptor values were com¬ 
puted for various tip geometries, as specified by 
the disk radius p and the tip-surface separation L. 
For a descriptor D, the average ( D( p, L)) is eval¬ 
uated over chains with n and r ex fixed. Its value 
for 'Tree" grafted chains (i.e., with no confinement 
effects) is (D)f ree . As mentioned above, the con¬ 
fined chains reach ( D)f ree in two limit conditions, 
when the top is far away from the surface and 
when the tip is vanishingly thin, i.e., 

lim (D( p, L)> = lim (D( p, L)) = (D) free . (6) 

p-> 0 L -» oo 

Our simulations verify these limits within the ac¬ 
curacy allowed by the algorithm. 

To understand the overall changes in molecular 
shape due to the presence of the tip, we discuss 
two cases separately: (A) the compression of 
grafted chains by tips with constant radius; and 
(B) confinement due to tips at constant distance to 
the surface, but variable radius. 

COMPRESSION EFFECTS WITH VARIABLE L 
AT CONSTANT p VALLES 

The basic phenomenon discussed here is illus¬ 
trated in Figure 2. In agreement with results in the 
literature for long chains with a stretching poten¬ 
tial [20, 21], we find conformations where the 
chains escape from under the tip. As we show 
below, these "transitions" can be accompanied by 
distinctive changes in molecular shape. 


First, it must be noted that only some molecu¬ 
lar-shape descriptors are useful for following shape 
transitions. If one analyzes the mean heights of the 
chains, (ft), only a continuous behavior is ob¬ 
served during compression. Figure 3 shows the 
results for the variation of (ft) as a function of the 
tip-surface separation L, for various values of 
disk radjus p. The tip with the smallest radius 
(p = 3 A, curve a) causes small confinement ef¬ 
fects. In this case, the mean height shows the 
smallest variation in terms of L. In contrast, the 
confinement effects are magnified for radii p > 40 
A (curves d and e). (In fact, the polymer mush¬ 
room is effectively confined to a slab for all L and 
p > 40 A.) However, as the radius changes, (ft) 
shows no distinctive features that can be associ¬ 
ated with shape transitions. 

Figure 4 shows a more interesting behavior 
when we _monitor chain self-entanglements in 
terms of (N). An analysis of Figure 4, reveals the 
following features: 

(a) Confinement effects on entanglements dis¬ 
appear for L > 40 A (the free chain limit). 

(b) For all disk radii, ( N ) shows a maximum at 
a particular tip-surface separation. The 
maximum is the largest when the confine¬ 
ment effects are maximized (large p) and 
becomes vanishingly small as confinement 
effects disappear (small p). 



Tip-surface separation L (A) 


FIGURE 3. Variation in the configurationally averaged 
height of the grafted chains (h), as a function of tip — 
surface separation L, for various tip radii p. The labels 
stand for the following values of radius: (a) p = 3 A; (b) 
p = 11 A; (c) p = 19 A; (d) p = 40 A; (e) p = 70 A. The 
last two curves are indistinguishable at the scale used. 
Error bars are omitted for clarity. 
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Tip-surface separation L (A) 

FIGURE 4. Variation in configurationally averaged 
mean overcrossing numbers for grafted chains, (A/), as 
a function of tip-surface separation L, for various tip 
radii p. The curve labels are the same as in Figure 3. 
Error bars are omitted for clarity. 


(c) The tip“Surface_separation L leading to the 
maximum in ( N > changes with the radii p. 

(d) When the compression effects are very 
strong (i.e., at very small L values for a 
fixed p), the mean overcrossing number de¬ 
creases fast. 

Recently, we discussed the occurrence of a max¬ 
imum in self-entanglements for chains confined 
within a slab (p —> oo) [24]. Using similar argu¬ 
ments, we can interpret some of the results above 
as follows: 

(i) Within a slab (p —> co), compression can 
produce compact configurations. As L de¬ 
creases, chains are " forced" to fold over 
themselves, thus increasing the complexity 
of entanglements. 

(ii) As L decreases for large p, the configura¬ 
tional space accessible to the chains be¬ 
comes smaller. Eventually, further com¬ 
pression squeezes the chains closer to the 
surface, thus preventing self-entangle¬ 
ments. As a result, (N) diminishes under 
strong compression. In other words, (N) 
must exhibit a maximum as function of L. 

o 

(iii) For smaller tip radii, e.g., p < 40 A, the 
chains can access to a larger configura¬ 
tional space by "stretching" to the border 
of the tip and folding away from the disk 


(cf. Fig. 2). These "partly escaped" chains 
avoid compactification by the tip, thereby 
reducing the complexity of their entangle¬ 
ments. As a result, (N) shows a smaller 
maximum than the one for large p. 

(iv) For very small disk radii, the chains can 
"dodge" the tip for all L values, except 
when compression is extreme (L values 
close to Z). These "fully escaped" chains 
effectively resemble the free grafted chains 
over a large range of L values. 

The averaged molecular size, given in terms of 
the mean radius of gyration, {R 2 c ) l/2 , parallels 
qualitatively the results of the mean number of 
overcrossings. Figure 5 shows the variation of 
< R 2 c ) l/2 as a function of L. Whereas ( N ) exhibits 
a maximum, the radius of gyration exhibits a mini¬ 
mum during compression, confirming that chains 
become more compact. Note, however, that the 
position of the critical points are not coincident , i.e., 
maximum compactess and maximum entangle¬ 
ment complexity are not found simultaneously for 
the same tip geometry. (This agrees with the re¬ 
sults for compression within slabs [24]). Therefore, 
the behavior of ( N ) can provide new insights on 
shape transitions. 

Summarizing, the presence of a maximum in 
( N ) for large p values is a indication of a "shape 
transition" between compact entangled chains , on 
the one hand, and squeezed (minimally entangled ) 
chains , on the other. However, the shift of such a 



Tip-surface separation L (A) 


FIGURE 5. Variation in configurationally averaged 
radius of gyration for grafted chains, (Rq) V2 , as a 
function of tip-surface separation L, for various tip radii 
p. The curve labels are the same as in Figure 3. Error 
bars are omitted for clarity. 
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maximum with p suggests a new regime, where 
chains reach intermediate compactness and entan¬ 
glements through "escape transitions." To fully 
characterize this regime, we discuss below the 
behavior of (N) as a function of the disk radius, 
at constant values of the tip-surface separation. 

COMPRESSION EFFECTS WITH VARIABLE p, 

AT CONSTANT L VALLES 

We consider now a "virtual" deformation, 
whereby the tip remains at a constant distance to 
the grafting surface while decreasing its radius at 
the base. The typical shape changes accompanying 
this "transformation" are displayed in Figure 6, 
for variable p and a small tip-surface separation 
(L = 7 A). 

Figure 6 illustrates how chains adopt flattened 
conformations at strong confinement (e.g., p = 40 
A and p = 30 A). In this case, since L is very 
small, the chain cannot fold over itself and it is 
forced to stretch. However, since the disk is too 
wide (or, conversely, the chain is too short), only 
fezv beads can marginally "escape" the tip. We use 
the term "frustrated chains" to denote these con¬ 
figurations, where the polymer "fails" to escape 
the tip by any significant amount, even though the 



p = 40A 


v 

p = 30A 


M' 

p = 20A 


p = 10A 


FIGURE 6. Typical deformations of grafted chains 
caused by a series of AFM-tips with variable radius p, at 
a constant tip-surface separation (L = 7 A). 


chains are stretched. (The term is used in only 
qualitative analogy to the concept of "frustration" 
in rugged potential energy surfaces [37], where 
conformations trapped in local minima are unable 
to escape to lower minima. In our case, "frustrated 
chains" cannot reach the entangled conformations 
that are characteristic of free grafted chains.) 

The occurrence of "frustrated chain" configura¬ 
tions depends strongly on the values of L and p. 
For small disk radii p, most of the chain can 
effectively avoid the tip by adopting "escaped 
chain" configurations, in contrast to "frustrated 
chains" (cf. bottom of Fig. 6). On the other hand, if 
confinement is reduced by increasing the 
tip-surface separation, then chains will be able to 
fold over themselves. As a result, their mean con¬ 
figurations will resemble "confined chains" with 
high entanglement, rather than "escaped" or 
"frustrated" chains. In summary, we will expect 
the regime of "frustrated chains" to disappear, 
regardless of the tip radius, whenever the 
tip-surface separation is larger than some critical 
value L c . 

Our computer simulations convey these notions 
quantitatively. Figure 7 shows the configuration- 
ally averaged mean overcrossing numbers for the 
"transformation" in Figure 6. (The case L — 7 A is 
the tightest confinement that we have been able to 
handle.) Despite the large error bars associated 
with configurational fluctuations, a rich behavior 
in (N) is clear. The results in Figure 7 can be 


<N> 



FIGURE 7. Variation in configurationally averaged 
mean overcrossing numbers for grafted chains, <A/), as 
a function of tip radius p, at a constant tip-surface 
separation (L = 7 A). Error bars represent 95% 
confidence intervals. 


526 


VOL. 65, NO. 5 






SHAPE TRANSITIONS IN POLYMER MUSHROOMS 


summarized as follows: 

(a) Chain entanglements are constant for p > 50 
A, where the chain is effectively confined 
within a slab. We find here "confined" con¬ 
formations. 

(b) Cham entanglements diminish when the 
disk radius decreases from 50 to ca. 30 A, 
where they reach a minimum. In this case, 
the dominant chain conformations become 
minimally entangled by stretching to the bor¬ 
der of the disk, where only some terminal 
beads can escape. We find here "frustrated 
chain" conformations. 

(c) Chain entanglements increase again when 
the disk radius is reduced below ca. 30 A. In 
this case, an increasingly larger section of 
the chain can escape the tip and "fold" on 
the outside. We find here "escaped chain" 
conformations. 

Figure 8 shows the corresponding results for the 
variation in the chain mean radius of gyration 
under the same conditions. In this particular case, 
there is a perfect match between the behavior of 
(N) and {R 2 G ) l/2 . As the radius of the tip is 
reduced, the maximum in (R G ) 1/2 marks the tran¬ 
sition from the stretched ("frustrated") configura¬ 
tions with large molecular size to the folded 
("escaped") configurations that are more compact. 

The presence of a minimum in ( N ) vs. p is a 
characteristic of small L distances. Figure 7 can be 



function of tip radius p, at a constant tip -surface 
separation (L = 7 A). Error bars represent 95% 
confidence intervals. 


contrasted with the behavior of chains at larger 
tip-surface_distances. Figure 9 shows the depen¬ 
dence of (N) vs. p over a range of L distances. 
(Error bars are omitted for clarity. The smoothed 
curves in Fig. 9 are polynomial fittings to the 
computed mean values in shape descriptors.) As L 
increases, the curves follow a qualitatively similar 
pattern until L ~ 14 A. For L > 14 A, (N) does 
not have a minimum for the nonzero tip radius. In 
other words, the dominant configurations for L > 
14 A pass directly from "confined chains" to 
"escaped chains." The critical tip-surface separa¬ 
tion for this transition can thus be estimated at 
L c ~ 3 .71, for the present chains where l ~ 1.11 r ex . 
(The value of L c will depend strongly on the 
excluded volume interaction.) Concluding, the 
shapes features represented by the minima in 
(N( p, L)> vs. p appear to be specifically associ¬ 
ated with the regime of "frustrated chain" config¬ 
urations. 

“PHASE DIAGRAM” FOR THE REGIMES OF 

ACCESSIBLE MOLECULAR SHAPES 

The results above can be summarized by repre¬ 
senting the molecular-shape features accessible to 
polymer mushrooms in a two-dimensional ( p, L)- 
plot. We can regard this plot as the analog of a 
"phase diagram" for the configurations of a 



FIGURE 9. Variation in the configurationally averaged 
overcrossing numbers for grafted chains < N), as a 
function of tip radius p, at various tip-surface 
separations. The labels stand for the following values of 
tip-surface separation: (a) L = 8 A; (b) L = 10 A; (c) 

L = 12 A; (d) L = 14 A; (e) L = 17.5 A; (f) L = 22.5 A. For 
clarity, the last two curves (e, f) are indicated by thin 
lines and error bars have been removed. 
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finite-size system. In this context, a descriptor of 
chain entanglements plays the role of an order 
parameter for labeling the "phases." [The analogy 
is only qualitative since the shape properties vary 
smoothly. The separation into sharp regions with 
distinct dominant conformations is somewhat arbi¬ 
trary. In our convention, the boundaries are de¬ 
fined by the (p, L) values satisfying ( N( p, L)> = 

< N )free-] 

Figure 10 shows all the results for <N(p, L)>. 
The global minimum is found at small L and 
intermediate p values, whereas the global maxi¬ 
mum appears at large p and intermediate L val¬ 
ues. The flat regions on this surface represent 
shapes similar to those of free grafted chains. 

Using Figure 10, we can classify the configura¬ 
tions according to distinctive molecular-shape 
properties. The resulting "phase diagram" based 
on chain entanglements is shown in Figure 11. (A 
comparable, yet not identical, diagram can be given 
in terms of the radius of gyration.) Frustrated 
chains (with minimal entanglement) appear at 
L<L C , whereas fully escaped chains (with low 
entanglement) can only be found at small p val¬ 
ues. The separation between the escaped chains 
and the free chains (indicated by a dotted line) is 
artificial, since a faraway AFM-tip with small disk 
radius will not affect the polymer mushroom. The 
remaining boundary curves correspond to the tip 
geometries for which the mean molecular shape of 
the chains coincides with that of a free grafted 
mushroom [i.e., (N(p, L)> = ( N)f ree ]. 


Figure 11 summarizes the results of this work. 
The "phase diagram" conveys the interplay be¬ 
tween configurational and shape transitions due to 
variable confinement. The main features of this 
diagram should remain unaltered for longer chains 
(whenever the same repulsive potential is used) or 
for chains that are not perpendicularly grafted to 
the surface. 


Further Comments and Conclusions 

In this work, we have shown that chain self¬ 
entanglements can be used for monitoring the oc¬ 
currence of configurational transitions (i.e., a 
change in the nature of the dominant configura¬ 
tions due to an external perturbation). Conceptu¬ 
ally, the main result is the fact that the dominant 
configurations can indeed be classified into regimes 
(or "phases") by using the behavior of molecular- 
shape descriptors. The results in Figure 11 use 
mean overcrossing numbers as a descriptor. A 
similar analysis can be done by using other shape 
features, although "phase" boundaries will shift 
since distinct descriptors are generally uncorre¬ 
lated. 

The diagram in Figure 11 should be a valid 
qualitative guide to predict the behavior of chains 
interacting with obstacles. Note, however, that the 
nature and number of the "shape regimes" will 
change with the model chosen for the AFM-tip. 
For curved-surface tips, or tips with indentations, 




50 


FIGURE 10. Dependence of the mean number of overcrossings on geometrical confinement over a wide range of 
values of tip radius p and tip-surface separation L. 
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0 50 

p(A) 

FIGURE 11. Qualitative configurational “phase” 
diagram indicating the molecular-shape regimes 
encountered during compression by a flat AFM-tip. The 
boundaries are estimated from the results in Figure 10, 
using the convention <A/(p,L)> = <A/> free ■ The dotted 
line indicates that the distinction between free chains 
and “escaped” chains is blurred for large L and small p 
values. 


a more complex pattern of shape transitions can be 
expected [21]. 

The present results from chain simulations com¬ 
plement previous work in the literature that is 
only valid for very long chains [20, 21]. Our classi¬ 
fication of short-chain conformers according to 
dominant shape features could not have been done 
by using scaling arguments. 

Our results contribute to a better understanding 
of "escape transitions" [20], since we have now 
shown that configurational changes analog to 
"escape transitions" can also be found in polymers 
with purely repulsive potentials. As discussed 
above, our configurational and shape transitions 
are strictly entropic phenomena. All "accepted" 
conformations have equal (zero) energy, as there is 
neither attraction between monomers or nor an 
energetic cost for stretching a chain. Nevertheless, 
the results show that "escape transitions" can still 
be recognized by the behavior of molecular-shape 
descriptors. Until now, these interesting phenom¬ 
ena had only been found before in more compli¬ 
cated polymer potentials [20, 21]. Note, however, 
that our model has no energy barriers and thus 
shape transitions will not exhibit hysteresis [21]. 
The behavior of chain self-entanglements under 
compression for realistic monomer-monomer in¬ 


teractions remains an interesting question open to 
study. 
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ABSTRACT: We describe the development and applications of a new electronic 
structure method that uses a real-space grid as a basis. Multigrid techniques provide 
preconditioning and convergence acceleration at all length scales and therefore lead to 
particularly efficient algorithms. The salient points of our implementation include: (i) 
new compact discretization schemes in real space for systems with cubic, orthorhombic, 
and hexagonal symmetry and (ii) new multilevel algorithms for the iterative solution of 
Kohn-Sham and Poisson equations. The accuracy of the discretizations was tested by 
direct comparison with plane-wave calculations, when possible, and the results were in 
excellent agreement in all cases. These techniques are very suitable for use on massively 
parallel computers and in O(N) methods. Tests on the Cray-T3D have shown nearly 
linear scaling of the execution time up to the maximum number of processors (512). The 
above methodology was tested on a large number of systems, such as the C 60 molecule, 
diamond. Si and GaN supercells, and quantum molecular dynamics simulations for Si. 
Large-scale applications include a simulation of surface melting of Si and investigations 
of electronic and structural properties of surfaces, interfaces, and biomolecules. © 1997 
John Wiley & Sons, Inc. Int J Quant Chem 65: 531-543, 1997 


Introduction 

O ver the course of the last several decades, 
plane-wave-based methods have been used 
to perform electronic structure calculations on a 

Correspondence to: J. Bernholc. 

International Journal of Quantum Chemistry, Vol. 65, 531-543 (1997) 
© 1997 John Wiley & Sons, Inc. 


wide range of physical systems. The Car- 
Parrinello (CP) and other iterative methods [1, 2] 
have made such calculations possible for systems 
containing several hundred atoms [3]. While these 
methods have been very successful, several diffi¬ 
culties arise when they are extended to physical 
systems with large length scales or containing 
first-row or transition-metal atoms. Special tech- 
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niques have been developed to handle some of 
these problems. Optimized pseudopotentials [4, 5], 
the augmented-wave method [6], plane waves in 
adaptive-coordinates [7—9], and preconditioning 
combined with conjugate-gradient techniques 
[10,11] have had considerable success; however, 
these techniques are still constrained by the 
plane-wave basis, which requires periodic bound¬ 
ary conditions for every system and fast Fourier 
transforms (FFTs) to efficiently transform between 
real and reciprocal space. In particular, FFTs in¬ 
volve nonlocal operations that impose constraints 
on the adaptability of these algorithms to mas¬ 
sively parallel computer architectures because they 
perform best on problems that can be divided into 
localized domains. 

It has been appreciated for some time that there 
are potential advantages to performing electronic- 
structure calculations entirely in real space. 
Boundary conditions are not constrained to be 
periodic; this permits the use of nonperiodic 
boundary conditions for clusters and a combina¬ 
tion of periodic and nonperiodic boundary condi¬ 
tions for surfaces. By employing nonuniform 
real-space grids, it is possible to add resolution 
locally, e.g., for a surface or cluster calculation. A 
basis that uses a high density of grid points in 
regions where the ions are located and a lower 
density of points in vacuum regions can lead to 
order of magnitude savings in the basis size and 
total computational effort [12]. More importantly, 
the use of a real-space basis opens up the possibil¬ 
ity of using multigrid iterative techniques to ob¬ 
tain solutions of the Kohn-Sham equations. Multi¬ 
grid methods [13], which provide automatic 
preconditioning on all length scales, can greatly 
reduce the number of iterations needed to con¬ 
verge the electronic wave functions. Furthermore, 
the real-space multigrid formulation does not in¬ 
volve long-range operations and is particularly 
suitable for parallelization and O(N) algorithms 
[14, 15], because every operation can be parti¬ 
tioned into hierarchical real-space domains. 

There have been a number of previous real-space 
grid-based electronic structure calculations. The 
finite-element method was applied by White et al. 
[16] to one-electron systems. They used both 
conjugate-gradient and multigrid acceleration to 
find the ground-state wave function. Two of the 
present authors [12] used nonuniform grids 
with locally enhanced regions in conjunction with 
multigrid acceleration to calculate the electronic 
properties of atomic and diatomic systems with 


nearly singular pseudopotentials. They verified 
that the preconditioning afforded by multigrid 
was effective in multilength-scale systems. 
Chelikowsky et al. [17] have used high-order 
finite-difference methods and soft nonlocal 
pseudopotentials on uniform grids to study the 
properties of Si and Ge clusters. The present au¬ 
thors have developed a real-space grid method 
with multigrid methods and tested it extensively 
against plane-wave results [18]. Other recent real- 
space work includes finite differencing on warped 
grids [19, 20], wavelet bases [21, 22], hierarchical 
nonlinear grids applied to diatomics [23], and full 
potential multigrid calculations for atoms and di¬ 
atomics [24]. 


Real-Space Calculations 

Several issues that are absent from plane-wave 
or orbital-based methods arise when using a real- 
space grid approach. In the former case the wave 
functions, potentials, and the electronic density are 
representable in explicit basis functions, and thus 
are known everywhere. In a real-space grid imple¬ 
mentation, these quantities are known only at a 
discrete set of grid points, which can introduce a 
spurious dependence of the Kohn-Sham eigenval¬ 
ues, the total energy, and the ionic forces on the 
positions of the ions with respect to the real-space 
grid. We have developed a set of techniques that 
can overcome these difficulties and can be used to 
compute accurate static and dynamical properties 
of large physical systems. In our formalism the 
wave functions, density, and potentials are di¬ 
rectly represented on a three-dimensional real- 
space grid with linear spacing h gri6 and number of 
mesh points N grid . The ions are represented by 
soft-core norm-conserving pseudopotentials [25]. 
Exchange and correlation effects are treated using 
the local density approximation (LDA) of density 
functional theory. 

For reasons of accuracy and computational effi¬ 
ciency, we discretize the Kohn-Sham equations in 
a generalized eigenvalue formulation: 

A[* n ] + B[VM = e n B[* J- (1) 

A and B are the components of the Mehrstellen 
discretization [26], which is based on Hermite's 
generalization of Taylor's theorem. It uses a 
weighted sum of the wave function and potential 
values to improve the accuracy of the discretiza- 
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tion of the entire differential equation, not just the 
kinetic energy operator. The weights for the 
0(/Zg rid ) discretization are given in Table I. We 
have found that the fourth-order Mehrstellen dis¬ 
cretization produces equivalent or better accuracy 
than the sixth-order central-finite-difference one. 
The higher accuracy of the Mehrstellen approach is 
achieved by using a nonseparable discretization 
and more local information (second nearest- 
neighbor points, for example). 

We have found that pseudopotentials, espe¬ 
cially the rather hard ones, must be smoothed 
before they are suitable for real space calculations. 
Smoothing is necessary to avoid high-frequency 
variation on the real-space mesh which reduces 
accuracy and slows convergence. This variation is 
absent from plane-wave calculations because the 
basis cuts off the large-G components of the poten¬ 
tials exactly [27]. To achieve this same cutoff effect 
in real space, we use the following filtering proce¬ 
dure: The radial pseudopotentials are Fourier- 
transformed to G space. They are multiplied by a 
cutoff function which decreases rapidly beyond 
rr/iah), where a is typically in the range 1.5-2.0. 
The filtered potentials are then back-transformed 
to real space. We have found that the Fourier- 
filtered pseudopotentials retain the accuracy of 
those used in plane-wave calculations. 

To test the smoothness of the pseudopotentials, 
an isolated atom or a bulk system is displaced 
relative to the grid. The magnitude of the variation 
in total energy is a measure of smoothness of the 
pseudopotentials. In Figure 1 we show the change 
in the LDA energy of a carbon atom and a dia¬ 
mond supercell as a function of distance from a 
mesh point. In addition, the degeneracy pattern of 
the eigenvalues and the ionic forces are examined. 
If the pseudopotential is insufficiently smooth, the 
ionic forces in a symmetric lattice, for example, 
will not be small (vanish). 

To facilitate comparisons with plane-wave cal¬ 
culations, we define an energy cutoff for the real- 
space calculation, ir 2 /2h 2 [Ry], to be equal to that 

TABLE I_ 


Mehrstellen discretization weights in 3D 
(h is the grid spacing). 

Grid point 

6/7 2 *A 

6*B 

Central 

24 

6 

Nearest neighbors 

-2 

1 

Second neighbors 

-1 

0 



FIGURE 1. Variation of the total energy as a carbon 
atom and a 64-atom diamond supercell is displaced 
rigidly relative to the grid along a coordinate axis. The 
grid spacing h is 0.336 bohr. 


of a plane-wave calculation that uses a FFT grid 
with the same spacing as the multigrid calculation 
[28]. 

To efficiently solve Eq. (1), we have used multi¬ 
grid iteration techniques that accelerate conver¬ 
gence by employing a sequence of grids of varying 
resolutions. The final solution is obtained on a grid 
fine enough to accurately represent the pseudo¬ 
potentials and the electronic wave functions. If the 
solution error is expanded in a Fourier series, it 
may be shown that iterations on any given grid 
level will quickly reduce the components of the 
error with wavelengths comparable to the grid 
spacing but are ineffective in reducing the compo¬ 
nents with wavelengths large relative to the grid 
spacing [13, 29]. The solution is to treat the lower 
frequency components on a sequence of auxiliary 
grids with progressively larger grid spacings, 
where the remaining errors appear as high- 
frequency components. This procedure provides 
excellent preconditioning for all length scales pre¬ 
sent in a system and leads to very rapid conver¬ 
gence rates. The operation count to converge one 
wave function with a fixed potential is C>(N grid ), 
compared to 0(N grid log N grid ) for FFT-based 
approaches [2]. 

To summarize, one self-consistent iteration con¬ 
sists of a multigrid step to solve Eq. (1), followed 
by the orthogonalization of the orbibals and an 
update of the electronic density. The components 
of the effective potential that depend on the den¬ 
sity (Hartree and exchange-correlation) are then 
recomputed for the new density, and the process is 
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repeated until self-consistency is reached. The 
Hartree potential is computed by solving Poisson's 
equation using multigrid iterations on the corre¬ 
sponding Mehrstellen discretization. The nonlocal 
ionic pseudopotential is evaluated in real space 
using the Kleinman-Bylander [30] separable form. 
The operation count to converge one wave func¬ 
tion with a fixed potential is 0(N id ), compared to 
0(N grid log N grid ) for FFT-based approaches [2]. In 
addition, all operations except orthogonalization 
are short-ranged, which allows for efficient 
parallelization and a natural implementation of 
O(N) algorithms [14, 15]. 


Molecular Dynamics 

Ab initio molecular dynamics (MD) simulations 
require accurate ionic forces so that higher-order 
integrators and large time steps can be used to 
achieve long simulation times. If the grid layout, 
whether uniform or nonuniform, does not change 
when the ions move, the Hellmann-Feynman (HF) 
force theorem [31] applies with no Pulay correc¬ 
tions [32]. However, the calculation of the HF 
forces on a real-space grid requires careful consid¬ 
eration of numerical errors because the derivative 
of even a filtered pseudopotential can introduce 
high-frequency components. The quality of our 
forces is sufficient, however, for even long simula¬ 
tions, as exemplified in Figure 2, which shows that 
the total energy for a 64-atom cell of bulk Si at 
1100 K is conserved within 27 fxeV per atom. 


§, 
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7.0 f- 
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FIGURE 2. Potential, kinetic, and total energy of a 
molecular dynamics simulation of a 64-atom silicon cell 
at a temperature of 1100 K. Third-order Beeman-Verlet 
integration with an 80-a.u. time step was used for the 
integration of the ionic equations of motion. The total 
energy curve is multiplied by a factor of 100. The potential 
and total energies have been shifted by 251.171 a.u so 
that they could appear together. 


compact nature of the Mehrstellen operator re¬ 
quires the exchange of only small amounts of data 
between nearest-neighbor processors. In tests of 
the code, ported from the Cray YMP-C90 to the 
massively parallel Cray T3D, we found nearly per¬ 
fect load balancing (90%) on all systems tested. 
These tests used up to 512 processors. 


Test Results 


Parallelization 

There are two apparent parallelization schemes 
for a real-space grid method. First, the orbitals 
could be distributed over processors and stored 
there. Its major advantage is that our high opti¬ 
mized serial code would require few changes. 
However, the orthogonalization of the orbitals 
would perform inefficiently because entire orbitals 
must be exchanged between processors. Our pre¬ 
ferred scheme is data decomposition, where real- 
space domains are distributed over processors. 
Orthogonalization becomes sufficient because 
overlap integrals can be computed in two parts: 
the intraprocessor portion is fully parallel, and the 
interprocessor part sends only the intermediate 
sums between processors. Just as important, the 


A significant advantage of the multigrid method 
is the speed of convergence to the electronic ground 
state for a given initial ionic configuration. For 
systems requiring a small energy cutoff or of small 
size, the speed advantage with respect to CP-based 
methods it not substantial. However, for systems 
requiring a large energy cutoff, or of large dimen¬ 
sions, this advantage—as measured in actual 
computational time—is typically an order of mag¬ 
nitude. This is because the maximum stable time 
step in the Car-Parrinello method must be much 
smaller than in the multigrid approach. Our initial 
tests of the real-space multigrid method were dis¬ 
cussed previously [12,18] and included a full com¬ 
parison with established plane-wave results. 

To illustrate the ability of the multigrid method 
to handle ill-conditioned systems, test calculations 
were performed on a 64-atom GaN cell in the 
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zinc-blende structure with the Ga 3d electrons in 
valence. A grid spacing of 0.14 bohr was used, 
which corresponds to an energy cutoff of 250 Ry in 
a plane-wave calculation. Only the T point was 
included in k -space sampling. Starting with ran¬ 
dom initial wave functions, only 75 self-consistent 
steps were required to converge the total energy. 
Recently, several calculations have been per¬ 
formed on GaN that explicitly included the 
d-electrons in valence; the multigrid results are in 
very good agreement with these calculations (see 
Table II). 

In addition, the energy of formation of the Ga 
interstitial impurity in a 64-atom GaN supercell 
was calculated to be 7 eV. The ions were not 
relaxed at this 250-Ry cutoff in order to conserve 
computer resources. Instead, ionic coordinates from 
a fully relaxed Ga interstitial calculation with the 
Ga 3 d electrons in core were used. As a check, the 
interstitial was also fully relaxed with the Ga 3 d 
electrons in valence, but at a 160-Ry cutoff with a 
slightly softer pseudopotential. The maximum 
difference in ionic coordinates between the two 

o 

relaxed structures was 0.1 A. 

The present results for the Ga interstitial con¬ 
firm our previous values, obtained with the Ga 3 d 
electrons in core [33]; namely, the Ga interstitial is 
one of the low-energy native defects in GaN. 
Recent experimental results indeed find indirect 
evidence for its presence, since Ga precipitates are 
formed upon annealing [34]. 

We have also studied the wurtzite phase of 
GaN in a 32-atom supercell with the d electrons in 
core. The discretization in the hexagonal plane is 
straightforward because the grid lines can be 
mapped onto a Cartesian plane. The Kohn-Sham 
eigenvalues computed by the real-space and 
plane-wave methods are in excellent agreement. 

An isolated C 60 molecule was selected as an 
example of a nonperiodic system. The simulation 


TABLE II_ 

Comparison of GaN results obtained with the 
Ga 3d electrons in valence. 3 



Band gap 

Cohesive energy 

Full potential LMTO [56] 

2.00 

10.89 

240 Ry, plane waves, 



2-atom cell [57] 

1.89 


Multigrid, 64-atom cell 

1.88 

10.64 


a The multigrid calculation uses a 64-atom supercell, 250-Ry 
cutoff, and f point sampling. See text [eV]. 


cell was a cube of length 23 bohrs and the grid 
spacing was 0.360 bohr. The initial ionic coordi¬ 
nates were generated using the classical Tersoff- 
Brenner potential [35], and the electronic wave 
functions were set to zero on the boundaries of the 
cell. After the convergence of the electronic sys¬ 
tem, the ions were relaxed using the same relax¬ 
ation scheme as before. Two distinct bond lengths 
were found in the final structure, corresponding to 
the carbon-carbon single and double bonds. There 
were twice as many single bonds as double bonds, 
and the average double and single bond lengths 
were 1.39 and 1.44 A, compared to 1.41 and 1.45 A 
obtained in a previous CP calculation [36] for the 
C 60 solid. The standard deviations of the bond- 
length distributions were on the order of 10 ” 3 A in 
both calculations. The experimental values for the 
solid are 1.40 and 1.45 A, respectively. 

Next, we examine the efficiency of multigrid 
acceleration. Table III illustrates the convergence 
properties as a function of grid resolution for an 
8-atom diamond cell. The observed convergence 
rates are largely independent of the energy cutoff. 
The number of self-consistent field (SCF) steps 
required to converge the density is also nearly 
independent of the system size. At an equivalent 
cutoff of 35 Ry, the multigrid method required 17 
steps to converge the total energy of the 8-atom 
diamond cell to a tolerance of 10 _8 . When the 
same calculation was performed for a 64-atom cell 
only 20 SCF steps were needed. 

Finally, we compare the convergence rates for 
four iterative-diagonalization schemes: steepest 
descents, multigrid, steepest descents with peri¬ 
odic subspace rotation, and multigrid with 
periodic subspace rotation. Subspace rotation 
(subspace diagonalization) speeds up iterative 
diagonalization schemes because it efficiently re¬ 
solves exact or near degeneracies between orbitals. 

TABLE III_ 


Multigrid convergence tests In an 8-atom 
supercell. 3 


Grid spacing 

Cutoff 

Total energy 

SCF steps 

0.421 

25.0 

-44.87968 

22 

0.336 

35.0 

-45.03331 

17 

0.280 

60.0 

-45.06240 

21 

0.210 

110.0 

-45.06605 

26 


a Grid spacing [bohr], equivalent plane-wave cutoff [Ry], 
total energy [a.u.], and the number of steps to converge the 
density. 
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In the tests, the initial wave functions were ran¬ 
domly chosen, a worse-case scenario, and the time 
step was the same for each scheme. Ordinary 
multigrid rapidly decreases the error, but the con¬ 
vergence stalls after ~ 15 steps because the den¬ 
sity is changing slowly. Periodic subspace rotation 
sorts out the small degeneracies between states, 
and when several unoccupied states are included 
in the calculation, it mixes in new information 
from the unoccupied sector. This greatly improves 
the convergence rate, even for steepest descents; 
see Figure 3. 



FIGURE 3. Convergence rates for a 64-atom diamond 
cell with a substitutional N impurity at a 63-Ry equivalent 
cut-off. The convergence rate, log 10 (£ - £ 0 ), is plotted 
against the number of self-consistent field (SCF) steps. 
Random initial wave functions were used with a constant 
initial density. SD represents convergence rates for the 
steepest descents algorithm, MG is for multigrid, SD-SD 
is steepest descents with subspace diagonalizations, 
and MG-SD is multigrid with subspace diagonalizations. 


Surfaces of Wurtzite GaN 

The Perdew-Zunger parametrization [37] 
of the Ceperley-Alder form [38] of the exchange- 
correlation energy was chosen. Nonlocal, norm- 
conserving pseudopotentials [25, 39, 40] were 
included using the Kleinman-Bylander approach 
[30]. For gallium, a recently developed norm- 
conserving pseudopotential, which includes a 
nonlocal core correction [41] and permits an 
efficient description without the need for an ex¬ 
plicit treatment of the d-valence electrons was 
used. For nitrogen, a standard pseudopotential 
with a neutral configuration as the atomic refer¬ 
ence for all states was employed. These pseudopo¬ 
tentials have been demonstrated to reproduce ac¬ 
curately the bulk properties of GaN [42]. 

The calculated bulk properties are presented in 
Table IV. The theoretical lattice parameters of both 
the zinc-blende and wurtzite forms agree very 
well with experiment; a similar level of accuracy is 
expected for the surface and interface calculations 
described below. The cohesive energy of GaN and 
a-Ga is 10.42 and 3.4 eV, respectively. The binding 
energy of nitrogen molecule is 5.87 eV. The 
theoretical heat of formation of GaN is thus 1.15 
eV, in good agreement with the experimental value 
of 1.14 eV. 

The (0001) surface is a polar surface. The ideal 
surface has dangling bonds with charge distribu¬ 
tions that are energetically unfavorable. If kineti- 
cally permitted, the surface atoms will relax and 
redistribute the "dangling-bond" charge density 
so as to satisfy the valences of all of the surface 


TABLE IV_ 

Calculated bulk properties of zinc-blende and wurtzite nitride semiconductors. 3 


Zinc-blende 


a 0 (A) 

B 0 (MBar) 

AIN 

4.37 (4.38) 

2.02 (2.02) 

GaN 

4.52 (4.5) 

1.70 (1.90) 

InN 

5.01 (4.98) 

1.58 (1.37) 

Wurtzite 

at A) 

3.09 (3.11) 

3.20 (3.19) 

3.55 (3.54) 

c!a 

1.62 (1.60) 

1.63 (1.63) 

1.63 (1.61) 

u (units of c) 

0.378 (0.382) 

0.376 (0.377) 

0.375 

B 0 (MBar) 

1.99 (2.02) 

1.69 (1.95, 2.37) 

1.62 (1.26, 1.39) 


a The values of the gap at the T-pt (£ r ) and of the valence-band width {&E vbw ) are the LDA results. Note that the LDA indirect gap 
in zinc-blende A1N is 3.2 eV. Experimental values are in brackets and follow Ref. 52. 
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species. This can be achieved via structural relax¬ 
ation (with a concomitant charge transfer between 
surface atoms). Recent experimental studies have 
revealed the presence of 2 X 2 reconstructions dur¬ 
ing growth [43]. In this study, we restrict our 
attention to these reconstructions on the gallium- 
terminated face. 

We considered eight different 2x2 reconstruc¬ 
tions, namely the ideal relaxed structure and gal¬ 
lium vacancy, nitrogen adatom, gallium adatom, 
and nitrogen-trimer, each on the T 3 and H 3 (hol¬ 
low) sites; see Figures 4 and 5. We find that adatom 
reconstructions are the most energetically favor¬ 
able of the studied reconstructions. In particular, 
in the gallium-rich case the gallium adatom on 
the T 3 site has the lowest energy, while in the 
nitrogen-rich case, the nitrogen adatom on the H 3 
is the most stable. 

The slabs contained four bilayers of GaN, 
three bilayers of which were relaxed. Relaxations 
in the third bilayer were negligible, indicating 
that a sufficiently thick slab was employed. 
Pseudo-hydrogen of charge 0.75 passivated the 
nitrogen-terminated face [44]. If the work functions 
of the two surfaces are different, then the periodic 
boundary conditions of the supercell enforce a 



FIGURE 4. Schematic top view of the 2x2 
gallium-adatom reconstruction on the (0001) surface of 
GaN. The gallium adatom (in gray) sits above a nitrogen 
in the second layer on the T 3 site. The numbers denote 
gallium atoms on the surface, while the primed numbers 
denote nitrogen atoms in the first subsurface layer. 



FIGURE 5. Schematic top view of the 2 x 2 
nitrogen-adatom reconstruction on the (0001) surface of 
GaN. The nitrogen adatom sits above the hollow (H 3 ) 
site. The numbers denote gallium atoms on the surface, 
while the primed numbers denote nitrogen atoms in the 
first subsurface layer. 


common electrostatic potential in the vacuum and 
result in an unphysical change in the electric po¬ 
tential in the vacuum region equal to the differ¬ 
ence between the two work functions. A large 

o 

vacuum region of 10 A served to reduce the size of 
this field. Calculations including explicitly a cor¬ 
rection for this field did not show significant dif¬ 
ferences in the final equilibrium geometries [45]. 
An orthorhombic supercell with grid spacing of 
0.21, 0.22, and 0.20 a.u. in the x, y, and z direc¬ 
tions, respectively, was employed. Eight different 
reconstructions were studied using T-point sam¬ 
pling. For the lowest energy structures of each 
class, i.e., nitrogen adatom on H 3 site, nitrogen 
trimer on the T 3 site, etc., a further calculation 
using two k points in the irreducible Brillioun 
zone was carried out. 

Figure 6 displays the relative surface energies 
for the five different reconstructions. Under 
gallium-rich conditions and for most of the range 
of the chemical potential, the lowest energy recon¬ 
struction is the gallium-adatom on the T 3 site; 
while under nitrogen-rich conditions, the recon¬ 
struction with the nitrogen-adatom on the H 3 site 
is preferred energetically. 
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FIGURE 6. Formation energy vs. gallium chemical potential for the (0001) surface. The maximum chemical potential 
for N (Ga) is equal to the energy per atom calculated for N 2 (bulk Ga). Two k points are sampled in the irreducible 
Brillioun zone. 


The gallium-adatom reconstruction is shown in 
Figure 4. The adatom sits on the T 3 site above the 

subsurface nitrogen atom; the gallium-nitrogen 

° 

distance is 2.46 A. The gallium-adatom surface- 
gallium bond-length is 2.40 A which is very close 
to the Ga-dimer distance in a-gallium (2.44 A). 
The bond angle between two gallium surface atoms 
and the adatom is 82°. The in-plane relaxation of 
the surface atoms is negligible; the only observed 
relaxation of the first bilayer is along the z direc¬ 
tion. The three gallium atoms bonded to the 
adatom remain coplanar to a significant degree, 
while the remaining unbonded gallium surface 
atom relaxes into the slab. The proximity of the 
gallium adatom to the nitrogen subsurface atom 
lowers the electrostatic energy and makes this par¬ 
ticular configuration energetically more favorable 
than the gallium adatom reconstruction with the 
adatom on the hollow site. 

The nitrogen-adatom reconstruction is shown in 
Figure 5. The adatom sits on the hollow (H 3 ) site. 
The nitrogen-adatom surface-gallium bond-length 
is 2.0 A and is very close to the Ga-N bond length 
distance in GaN bulk (1.96 A). The bond angle 
between two gallium surface atoms and the nitro¬ 
gen adatom is 89°. The relaxation perpendicular to 
the surface is very similar to that in the gallium- 


adatom reconstruction. This configuration is stabi¬ 
lized because it separates the nitrogen adatom 
from the subsurface nitrogen atom and thereby 
lowers the electrostatic repulsion. 


Onset of Melting of Si (100) Surface 

We now turn to the melting of the Si (100) 
surface. This is a classic problem, whose micro¬ 
scopic details are not well understood. This is 
particularly true of covalent materials like Si, 
whose surfaces are characterized by reconstruc¬ 
tions, steps, islands, and other surface defects—all 
of which are expected to play a role in the micro¬ 
scopic aspects of the melting process. As a first 
step toward this goal, we have carried out simula¬ 
tions of the melting process of the Si (100) surface 
with finite-temperature ab initio methods. The 
temperature of the Si ions were controlled with a 
chain of five, linked Nose-Hoover thermostats [46]. 
Because the electrons are always quenched back 
onto the Born-Oppenheimer surface after every 
time step, no additional thermostat is needed for 
the electrons. The in-plane cells consisted of 16 
atoms per layer and the basic time step of the 
simulation was 100 a.u. 
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The melting of the Si (100) surface has previ¬ 
ously been investigated with classical molecular 
dynamics simulations [47], and the thermodynam¬ 
ics of melting has recently been investigated by ab 
initio techniques [48]. However, it is now known 
that classical potentials do not reproduce well sev¬ 
eral aspects of the melting process. Upon melting. 
Si goes from a fourfold coordinated semiconductor 
to a metallic liquid. The density of the liquid is 
about 10 percent higher than in the solid. The 
average coordination number is between 6 and 7, 
which is rather low for a metal. This low coordina¬ 
tion number is indicative of persistent remnants of 
covalent bonding. 

A relaxed six-layer slab of Si, with the bottom 
layer saturated by fixed H atoms, was heated up 
to 1500 K over a 1.5 ps period of time. Over the 
next 0.3 ps, the average temperature was raised to 
1700 K, which is the melting temperature of bulk 



0.00 ps 



1.00 ps 



2.00 ps 


FIGURE 7. Side view of sample configurations 


Si. Sample configurations as a function of time are 
shown in Figure 7. 

We have characterized the structural, thermo¬ 
dynamic, and electronic properties changes that 
take place during the melting process. However, 
here we will only briefly summarize some of the 
main features, while a full description will be 
published elsewhere [49], As the melting of the 
layers proceeds, there is a change in the density of 
the top layers as function of the vertical distance. 
While initially each of the layers is well separated 
and defined, the atomic density of the top layers 
becomes much more uniform at the end of the 
simulation time. During this time, the pair distri¬ 
bution function of the top four layers becomes 
much more liquidlike, with the second peak be¬ 
coming essentially washed out. At the same time, 
the angular distribution function g 3 (0) changes by 
decreasing its large peak about the tetrahedral 



0.50 ps 



1.50 ps 



2.50 ps 


Si (100) surface at 1700 K as the top layers melt. 
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angle, broadening and developing a second peak 
in the 50-60° range, as shown in Figure 8. All of 
this is characteristic of a melted system. As a probe 
of the electronic properties, we have computed the 
local density of states for the bulk and liquid 
domains. These are compared in Figure 9. Note the 
relatively large buildup of states at the Fermi level 
for the liquid, characteristic of a metal, and 
the decrease in number of states for the bulk 
semiconductor. 

Ongoing Work 

ORDER-/V REAL-SPACE IMPLEMENTATIONS 

For very large systems, the complexity of the 
calculations increases as 0(N^ oms ) in computer 
time due to orthogonalization, and the memory 

Pair Distribution 



Angular Distribution 



Degrees 


FIGURE 8. Angular correlation function before and after 
melting. Note the decrease in the large peak near the 
tetrahedral angle. 


increases as 0(N/ oms ). Several approaches that 
reduce both types of scaling behavior to 0(N atoms ) 
have been proposed. These O(N) methods rely on 
the assumption that physial properties are local, 
and that the orbitals [14] and/or the density ma¬ 
trix [15]—when localized in finite volumes about 
each atom—can describe the electron charge den¬ 
sity, kinetic energy density, etc. to useful accuracy. 
More specifically, in most of the approaches the 
wave functions or the density are expanded in 
localized functions, usually atom- or bond- 
centered. The overlaps are then set to zero beyond 
a certain localization radius, and the size of this 
radius determines the accuracy. A detailed de¬ 
scription of the methodology is beyond the scope 
of this work, but one can derive variational expres¬ 
sions in terms of these functions that lead to sparse 
matrix operations and O(N) scaling. One should 
point out that only the range of the localized 
functions is predetermined, their shapes change 
during minimization. In the wave-function-based 
approaches, a useful analogy is to think of these 
orbitals as generalized Wannier functions; indeed, 
they become orthogonal at convergence. A recent 
review article provides an in-depth description of 
the various approaches [50]. In the quantum 
chemistry community there have been analogous 
developments aimed at achieving linear scaling in 
standard packages, which use Gaussian functions. 
To date, linear scaling has been achieved for the 
Coulomb part [51,52], the exchange-correlation 
part within density functional theory [53], and 
most recently in the diagonalization part [54], fol¬ 
lowing the Li-Nunes-Vanderbilt density-based 
approach. 

We have implemented a real-space self- 
consistent version of the Galli-Parrinello func¬ 
tional and tested it on 64- and 216-atom supercells 
of bulk Si. The Galli-Parrinello functional is varia¬ 
tional, and the accuracy of the calculations de¬ 
pends strongly on the localization radius. In Table 
V, we show the errors per atom in the self- 
consistent total energy as a function of the radius. 
It is clear from the results that fairly large radii 
encompassing tens of atoms are necessary for an 
accurate description. We are currently testing other 
forms of O(N) methods, comparing their accuracy 
and efficiency. 

SOLVATED BIOMOLECULES 

In collaboration with L. Pedersen (UNC-Chapel 
Hill), we are investigating the properties of a 
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Surface (z > 5.8 A) 


Before 


0.10 


0.05 


0.00 



Bulk (z < 5.8 A) 


Before 


0.10 


0.05 


0.00 



After 


After 




E-E f (eV) E-E f (eV) 

FIGURE 9. Local density of states in units of states per electron volt per atom after melting in the top layers and in the 
deeper “bulklike” region. 


TABLE V_ 

Error per atom due to localization in a 
self-consistent implementation of the O(N) 
Galli - Parinello functional . 3 


Localization radius (a.u.) 

Error (eV / atom) 

5.5 

1.54 

6.0 

1.03 

6.5 

0.84 

7.5 

0.19 

8.0 

0.06 

12.5 

0.03 


a See text. 


protein associated with Alzheimer's disease, the 
amyloid ^-peptide C 146 O 45 N 42 H 210 . The LDA cal¬ 
culations were performed on the Cray-T3D (128 or 
256 processors), using a structure obtained from 
nuclear magnetic resonance (NMR) data that was 
refined by a molecular-mechanics force field pro¬ 
gram (AMBER). In Figure 10 we plot the electronic 
charge density of various parts of the protein. 
Presently, we are relaxing the system quantum 
mechanically in the presence of water, which is 
treated by a classical potential. We have also im¬ 
plemented the generalized gradient approximation 
to DFT (GGA) [55], which is known to improve the 
description of hydrogen bonding as well as the 
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FIGURE 10. Three views of the electronic charge 
density of the amyloid /3-peptide associated with 
Alzheimer’s disease. The top view shows the entire 
molecule; the lower two views are close-ups of the first 
quarter and second third of the molecule, respectively. 

values of the binding energies. In tests on an 
amino acid (phenylalanine), the GGA disassocia- 
tion energy was lower by 7% than the LDA value 
of 142.59 eV. 


Summary 

We have developed a methodology for perform¬ 
ing large-scale ab initio electronic structure calcu¬ 
lations entirely in real space. By using highly 
efficient multigrid techniques to accelerate 
convergence rates, we are able to efficiently 
converge even difficult ill-conditioned systems, 
which require high-energy cut-offs or large length 
scales. The multigrid method is readily adaptable 
to parallel computer architectures, and an efficient 
parallel implementation was developed. The de¬ 
tails of our implementation, which include new 
compact discretization schemes in real space for 
systems with cubic, orthorhombic, and hexagonal 
symmetry, and new multilevel algorithms for the 
iterative solution of Kohn-Sham equations, are 
described in Ref. [18]. This methodology was tested 
on a large number of systems, including the C 60 
molecule, diamond. Si and GaN supercells, and ab 
initio MD simulations. Further applications in¬ 


clude a simulation of surface melting of Si, struc¬ 
tural properties of a large amyloid /3 peptide 
(Ci 4 6 0 45 N 4 2 H 210 ) implicated in Alzheimer's dis¬ 
ease, and electronic and structural properties of 
GaN surfaces and In^Ga^^N quantum wells. The 
accuracy of the method was also tested by direct 
comparison with plane-wave calculations, when 
possible, and the results were in excellent agree¬ 
ment in all cases. 
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ABSTRACT: Relativistic correlation energies have been computed for the closed-shell 
ground states of Zn and Cd and their singly ionized open-shell ions by means of a 
recently developed relativistic many-body perturbation theory for open-shell multiplet 
states. All electrons were included in the correlation calculations. Analytic basis sets of 
Gaussian spinors are employed to expand the upper and lower components of the Dirac 
four spinors in closed- and open-shell matrix Dirac-Fock and relativistic many-body 
perturbation procedures. The ionization energies have been computed for the atoms, and 
the effects of relativity, electron correlation, and the nonadditive interplay of these effects 
on the ionization energies are analyzed. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 
545-554, 1997 


Introduction 

I n heavy-atom systems, relativistic and electron 
correlation effects are large and strongly inter¬ 
twined [1]. Therefore, accurate treatment of 
many-electron systems makes it necessary to go 
beyond the independent particle model to include 
both effects simultaneously. Fully relativistic 
many-body methods for atoms and molecules, 
which simultaneously account for both effects, are 
currently being developed and tested by several 
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groups [2-16], using basis sets of either "local" or 
"global" functions. Three many-body techniques 
developed from nonrelativistic electronic structure 
theory have been successful in recovering correla¬ 
tion energy in relativistic treatments. They are 
relativistic many-body perturbation theory (MBPT) 
[2-6], relativistic configuration interaction (RCI) 
[7-12], and relativistic coupled-cluster (RCC) the¬ 
ory [13-16]. 

In recent studies [6], we have reported a state- 
specific relativistic MBPT for general open-shell 
systems in which our single Fock operator method 
[17] is employed to make a Moller-Plesset-type 
separation of the relativistic many-electron Hamil¬ 
tonian. Relativistic correlation energies were com- 
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puted for the open-shell ground states of the Group 
IB and IIIA elements up to Z = 79 as well as their 
singly ionized closed-shell ions. The effects of rela¬ 
tivity and electron correlation and the nonadditive 
interplay of these effects on their ionization ener¬ 
gies (IE) were analyzed. The state-specific MBPT 
involves a full implementation of generalized 
Moller-Plesset perturbation theory [18] applied to 
general open-shell reference wave functions. 

Relativistic effects are known to be important in 
the transition-metal atoms. The differential effects 
on the ionization energies of the first-row transi¬ 
tion metals are as large as 0.5 eV in some cases [19, 
20]. Therefore, even in the first-row transition met¬ 
als, accurate determination of electronic structures 
requires approaches which can account for the 
nonadditive interplay of relativity and electron 
correlation. The purpose of the present study is to 
report on the application of our relativistic MBPT 
[6] to the Group IIB elements Zn and Cd and their 
singly ionized ions. In the next two sections, we 
outline our relativistic MBPT and its implementa¬ 
tion with analytic basis set of G spinors (G for 
Gaussian after Grant [21]). In the last section, the 
results of matrix Dirac-Fock (DF) self-consistent 
field (SCF) and all-electron relativistic MBPT cal¬ 
culations on Zn and Cd and their singly ionized 
ions are presented. The effects of relativity and 
electron correlation and the nonadditive interplay 
of these effects on their IE are analyzed. 


Relativistic Many-Body Perturbation 
Theory for Open-Shell Systems 

In c-number theory, the effective many-body 
Hamiltonian most commonly used in relativistic 
atomic and moecular structure calculations is the 
no-pair Dirac-Coulomb (DC) Hamiltonian pre¬ 
scribed by Sucher [22] and Mittleman [23], 

h? c = E,M 0 +.sdE lAi/W (i) 

' ij ' 

J?= L + (l)L + (2)... L + (n), where L + (i ) is the pro¬ 
jection operator onto the space spanned by the 
positive-energy eigenfunctions of the matrix DF 
SCF equation [17]. The projection operator, J?, take 
into accouint the field-theoretic condition that the 
negative-energy states are filled and causes the 
projected DC Hamiltonian to have normalizable 
bound-state solutions. This approach is called the 
no-pair approximation [22] because virtual elec¬ 


tron-positron pairs are not permitted in the inter¬ 
mediate states. h D is a Dirac one-electron Hamilto¬ 
nian: 

h D (i) = ca,- • p, + (/3 - 1 )c 2 + v nuc (r,). (2) 

Here a and /3 are the 4x4 Dirac vector and 
scalar matrices, respectively. V nuc is the nuclear 
potential, which for each nucleus takes the form 

Kuc = -Z/r, r > R, 

= —(Z/2R)(3 - r 2 /R 2 ), r < R. (3) 

The nuclei are modeled as spheres of uniform 
proton-charge distribution, Z the charge of nu¬ 
cleus. R is the radius of that nucleus and is related 
to the atomic mass. A, by R = 2.2677 X 10 -5 A 1/3 . 
Adding the frequency-independent Breit interac¬ 
tion, 

B 12 = ~l/ 2 [ a i ’ a 2 + (a 1 -r 12 )(ot 2 -r 12 )/r 1 2 2 ]/r 12/ 

(4) 

to the electron-electron Coulomb interaction, in 
Coulomb gauge, results in the Coulomb-Breit po¬ 
tential which is correct to order a 2 [22, 23]. Addi¬ 
tion of the Breit term yields the no-pair 
Dirac-Coulomb-Breit (DCB) Hamiltonian [22, 23] 

H? CB = EMO +-sGE(1 A, 7 + S, 7 )W +/ (5) 

i ' ij ' 

which is approximately Lorentz covariant and in¬ 
creases the accuracy of calculated fine-structure 
splittings and inner-electron binding energies. 
Higher-order quantum electrodynamics (QED) ef¬ 
fects appear first in order a 3 and are omitted. 

The N-electron Hamiltonian, H, is partitioned 
into a model Hamiltonian H 0 and a perturbation 
V, such that H = H 0 + V. The zero-order Hamil¬ 
tonian H 0 is arbitrary but should be chosen as 
close to the full Hammiltonian H as possible so 
that the perturbation series converges rapidly in 
low order. The zero-order Hamiltonian is usually 
chosen to be a sum of effective one-electron opera¬ 
tors (Moller-Plesset partitioning [24]). In that form, 
all perturbation corrections describe electron corre¬ 
lation. For closed-shell systems, the best results 
have been obtained with Moller-Plesset partition¬ 
ing. In the single Fock operator method of Koc and 
Ishikawa [17] relativistic MBPT for general open- 
shell multiplet states of atoms and molecules re¬ 
tains the essential aspects of Moller-Plesset per- 
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turbation theory. The theory providees a hierarchy 
of well-defined algorithms that allow one to calcu¬ 
late relativistic correlation corrections in nonitera¬ 
tive steps and, in low order, yields a large fraction 
of the dynamical correlation. The no-pair DC 
Hamiltonian £f+ c is 

H? c = H 0 + V, (6) 

where the unperturbed Hamiltonian H 0 is a sum 
of single DF SCF Fock operators R 0 [17] for closed- 
or open-shells: 

h 0 = lr 0 (*) and y = H° c -T.R 0 (0,(7) 

i i 

and [6, 17] 

R 0 = + E ' 

Jfc(eO) 

Here the mean-field operators, F c and F k , and the 
projection operators, and & v , have been 

defined in previous work [6, 17]. In this form of 
partitioning, all perturbation corrections describe 
relativistic electron correlation including the cross 
contribution between relativistic and correlation 
effects. 

The exact many-electron wave function is ex¬ 
panded in configuration state functions (CSFs) 
{<2\(yJP); / = 1,2,...} generated by single, dou¬ 
ble, triple, etc. excitations from the reference CSF, 
<& Q (y 0 JP). Individual CSFs are eigenfunctions of 
the total angular momentum and parity operators 
and are linear combinations of antisymmetrized 
products of positive-energy SCF spinors. The SCF 
spinors are mutually orthogonal, so the CSFs 
(OjfyJP); i — 0,1,2,...} are also mutually orthog¬ 
onal. The unperturbed Hamiltonian is diagonal in 
this space: 

H 0 = ElO i (y i /P))E} 9) <4> i (‘yf/P)l (8) 

i 

so that 

H 0 l*,(y«/P)> = Ef^.iyJP)) (* = 0,1,2,...) 

(9) 

E; 0) is a sum of the SCF one-electron energies s q : 

= Le q N q n q [ 4> f L (10) 

q 

where the occupation number N q = 2j q + 1 and 
n [3>J is the occupancy of the q th spinor in 

friyJPl 


Application of Rayleigh-Schrodinger perturba¬ 
tion theory provides order-by-order expressions of 
the perturbation series for the state approximated 
by I [6]: 

^ = <I> 0 (y 0 JP) + <l> (1) ( JP) + ..-, (11) 

E = E< 0) + E (1) + E (2) + E {3) + •••, (12) 

where 

<t> ( 1 ) (/P) = E ®i(%/P){V, 0 (E 0 - E,.) -1 }, ( 13 ) 

1=1 

£(0) + £d) = £ scp/ (14) 

E (2) = E V 0i V j0 (E 0 - Ej) 1 , (15) 

i- 1 

E (3) = E Voi W ijV jo ( E o - £;) '(E 0 - E ; r\ 

i, j = 1 

(16) 

Here 

v ?7 = (^.(rJP^vi^^/P)) 

and 

= <<!>,■(%■ /P)lW|Oy(yy/P)) with W=V-E (1) . 

In this form, all the perturbation corrections be¬ 
yond first-order describe relativistic electron corre¬ 
lation. Summation over the CSFs in Eqs. (13) 
through (16) are restricted to CSFs constructed 
from the positive-energy branch of the spinors, 
which effectively incorporates in the computa¬ 
tional scheme the "no-pair" projection operator 
F? + of the DC and DCB Hamiltonians. Negative 
energy states, as part of the complete set of states, 
play a role in many-body calculations. However, 
contributions from the negative energy states due 
to creation of virtual electron-positron pairs are 
small [22], of the order a 3 , and are neglected in the 
present study. Neglecting interactions with the 
filled negative-energy sea, i.e., neglecting virtual 
electron-positron pairs in summing the MBPT dia¬ 
grams, we have a straightforward extension of 
nonrelativistic MBPT. In generalized Moller-Ples- 
set formalism, the CSFs {<&,(% JP); i ¥= 0}, gener¬ 
ated by single excitations from the reference CSF, 
do not contribute to the second- and third-order 
energies because the singly excited CSFs, <F f (% JP ), 
generated by the SCF spinors, satisfy the general¬ 
ized Brillouin theorem, (<F f (y ? /P)|H+ c |<E> 0 (y 0 JP)) 
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= 0. Thus V i0 = 0. CSFs generated by excitations 
higher than double also do not contribute to the 
second- and third-order energies, because for them 
V i0 = 0 and H i0 = 0. 

Electron correlation effects are corrections to the 
independent electron approximation, and their im¬ 
portance depends on the choice of the approxima¬ 
tion in which they are calculated. In nonrelativistic 
many-body calculations, the correlation energy, 
E^if is defined as the difference between the exact 
eigenvalue of the N-electron Schrodinger Hamilto¬ 
nian, E^ ct , and the nonrelativistic single-config¬ 
uration Hartree-Fock (HF) energy, E HF [25]: 

NR ^NR h HF . \i/) 

The nonrelativistic definition, Eq. (17), is based on 
the implicit assumption that it is easy to separate 
relativistic effects from an "exact" value of the 
total energy which contains both correlation and 
relativistic effects [25]. 

We employ in our MBPT calculations two dif¬ 
ferent effective N-electron Hamiltonians, the no¬ 
pair DC Hamiltonian, Eq. (1), and the no-pair DCB 
Hamiltonian, Eq. (5). With respect to the two effec¬ 
tive many-body Hamiltonians, we can define DCB 
correlation energy (Eg£]g) as the difference be¬ 
tween the exact eigenvalue Eg£g of the DCB 
Hamiltonian, H+ CB , and the DF SCF energy (E SCF ), 
and DC correlation energy (Eg£ r ) as the difference 
between the exact eigenvalue (E^ x c act ) of the DC 
Hamiltonian, Ef F c , and the DF SCF energy (E SCF ) 
[5, 6]: 

pcorr pexact _ p /io\ 

c DCB ” ^DCB ^SCF' V-LO; 

pcorr pexact _ p /-iq\ 

l DC — c DC ^SCF* 

The difference between the DCB correlation and 
the DC correlation, E^c b — Edc / i- s relativistic 
many-body shift due to the low-frequency Breit 
interaction [5, 6]. 


Computation 

The four-component atomic spinor, is la¬ 

beled by symmetry species k and subspecies m, 
and 


'l'n K m( r ' °r <£) = 


iQ nK ^)x_ Km {e,<j)) 


( 20 ) 


with P nK (r) and Q flK (r ) the large and small radial 
components, respectively, of the wave function. 


The four spinors are assumed to be orthonormal. 
In the matrix DF scheme, the radial functions are 
expanded in basis sets of analytic functions: 

P«Sr) = LCLg^r) (21) 

i 

and 

Q«W=EC, S ,i^,(f), (22) 

i 


where {C^ Kl } and {C„ Ki } are expansion coefficients 
for spinors of symmetry k, and {g^(r)} and {g^(r )} 
are the large and small component basis sets, re¬ 
spectively. 

The large radial component is expanded in a set 
of Gaussian-type functions (GTF) [5, 6, 9, 11, 12, 
14, 15, 17]: 


gxi(r) = AEr n[K] exp(-t; Ki r 2 ) (23) 

with n{ k ] = — k for k < 0, and n[ k ] = k 4- 1 for 
k > 0. AE is the normalization constant. The small 
component basis set ig^ir)} is constructed to sat¬ 
isfy the boundary condition associated with the 
finite nucleus of uniform proton charge distribu¬ 
tion [1, 5, 6,11,15]. With the finite nucleus, GTF of 
integer power of r are especially appropriate basis 
functions because the finite nuclear boundary re¬ 
sults in a solution which is Gaussian at the origin 
[1, 5, 6]. Basis functions which satisfy the nuclear 
boundary conditions are also automatically kineti- 
cally balanced [1, 5, 12]. Imposition of the bound¬ 
ary conditions results in particularly simple forms 
with spherical G spinors [1, 5, 6]. 

Matrix DF SCF and second-order relativistic 
MBPT calculations based on the no-pair 
Dirac-Coulomb Hamiltonian on the closed-shell 
ground S 0 state of Group IIB atoms, Zn and Cd, 
and the open-shell ground 2 S 1/2 state of the singly 
ionized Zn 1+ and Cd 1+ ions employed large well- 
tempered basis sets [26] of G spinors. All electrons 
were included in the relativistic MBPT procedure 
to study the effects of relativity on electron correla¬ 
tion as well as the overall relativity and cor¬ 
relation contributions to the IE of the Group 
IIB atoms. The order of the partial-wave expan¬ 
sion (L max ), the highest angular momentum of 
the spinors included in the virtual space, was 
L m ax = 5 throughout the study. The basis sets 
were 20sl8 pl3dllfl0gl0h for zinc and 
23s22pl6dl4:fl2gl2h for cadmium. The well-tem¬ 
pered exponents were taken from the work of 
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Huzinaga and Klobukowski [26]. Table I shows the 
well-tempered basis set employed in the SCF and 
MBPT calculations on the neutral zinc atom and 
singly ionized zinc ion. 

All-electron relativistic MBPT calculations were 
also performed that included the frequency-inde¬ 
pendent Breit interaction in the first and second 
orders of perturbation theory. These calculations 
were based on the no-pair Dirac-Coulomb-Breit 
Hamiltonian, H+ CB . The speed of light was taken 
to be 137.0359895 a.u. The nonrelativistic limit was 
simulated by setting the speed of light to 10 4 a.u. 
The nuclei were again modeled as spheres of uni¬ 
form proton charge. 


Results and Discussion 

Table II displays the DF SCF energies E SCF , 
second-order Dirac-Coulomb correlation energies, 
Eg£, and first- and second-order Breit interaction 
energies, B (1) and B (2) , for ground-state zinc, Zn°, 
and Zn 1+ in increasing order of partial-wave ex¬ 
pansion. In each entry, the order of the partial-wave 

TABLE 1_ 

Well-tempered exponents of the 
20s18p13d11f10g10h G-spinor basis set 
employed in the DF SCF and RMBPT calculations 
of Zn° and Zn + 1 . 


Symmetry 


Exponent 

S 1/2 

Pl/2,3/2 

^3/2.5/2 

^5/2,7/2 

97/2,9/2 

1^9/2,11/2 

4250042.1 

X 






663740.70 

x 






144800.18 

X 

X 





38457.241 

X 

X 





11633.475 

X 

X 





3957.4984 

X 

X 





1508.8936 

X 

X 





624.94155 

X 

X 

X 




272.06944 

X 

X 

X 

X 



121.77099 

X 

X 

X 

X 

X 

X 

54.721784 

X 

X 

X 

X 

X 

X 

24.885485 

X 

X 

X 

X 

X 

X 

11.536923 

X 

X 

X 

X 

X 

X 

5.2194206 

X 

X 

X 

X 

X 

X 

2.3421989 

X 

X 

X 

X 

X 

X 

1.0625147 

X 

X 

X 

X 

X 

X 

0.46920953 

X 

X 

X 

X 

X 

X 

0.18025741 

X 

X 

X 

X 

X 

X 

0.08726504 

X 

X 

X 

X 

X 

X 

0.04021766 

X 

X 

X 





expansion, the DF SCF energy, and the relativistic 
MBPT energies are given in the first row. The 
values in parentheses in the second row are the 
nonrelativistic energies computed by setting c = 
10 4 . 

In the third column of Table II, we present the 
closed-shell DF SCF energies E SCF for the ground 
1 S 0 (3d 10 4s 2 ) state of Zn° as well as the open-shell 
DF SCF energies for the ground 2 S 1/2 (3d 10 4s 1 ) state 
of Zn 1 + . The fourth column of the table gives the 
second-order Dirac-Coulomb correlation energies 
Eg£ in increasing order of partial-wave expansion. 
The MBPT calculations were also performed with 
the frequency-independent Breit interaction in¬ 
cluded. The Breit interaction is introduced into the 
no-pair DC Hamiltonian, Ef+ C , by adding the 
frequency-independent Breit interaction, Eq. (4), to 
the instantaneous Coulomb interaction 1 /r /; . In the 
fifth column we present the first-order Breit in¬ 
teraction energies, B (1) . In the sixth column the 
second-order correlation corrections to the Breit 
interaction, B (2) , are given in increasing order of 
partial-wave expansion. The B (2) are computed as 
the difference between the second-order correla¬ 
tion correction evaluated with the inclusion of B^ 
in the effective electron-electron interaction and 
the second-order DC correlation correction, Eg^. 
B (2) in correlation energies is the relativistic 
many-body shift that arises from incorporation of 
the frequency-independent Breit interaction into 
the effective two-body interaction. 

The DF energies, —1794.61114 a.u., for the 
ground 1 S 0 state Zn° atom, computed with a basis 
of 20sl8pl3d G spinors agree well with the corre¬ 
sponding numerical DF limit-1794.62 a.u. obtained 
with the numerical finite difference DF program of 
Desclaux [27]. The difference between relativistic 
DF SCF and its nonrelativistic limit for Zn°, i.e., 
the relativistic energy lowering at the SCF level, is 
16.76676 a.u. 

Table II also shows the convergence patterns of 
the perturbation series in increasing partial-wave 
expansion. Because large G-spinor basis sets were 
employed to saturate the occupied and virtual 
spaces in each symmetry, the computed correla¬ 
tion energies have converged to at least four sig¬ 
nificant figures in each order of the partial-wave 
expansion. The convergence rates of the second- 
order Coulomb correlation correction with respect 
to partial-wave expansion are approximately the 
same for relativistic and nonrelativistic MBPT. The 
difference between the second-order Coulomb cor¬ 
relation correction Eg^ at c — 137.0359895 and its 
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TABLE II_ 

DF SCF and MBPT energies (a.u.) of the ground-state Zn° Cs 0 ) and Zn + 1 ( 2 S 1/2 ) in basis sets of 
increasing partial-wave expansion. 3 


^-max 


^SCF 


eo) 

0(2) 

^tot 

2 

Zn° 

-1794.61114 

(-1777.84438) 

-0.96905 

(-0.95893) 

0.76145 

-0.02037 

-1794.83911 

(-1778.80331) 


Zn + 1 

-1794.32478 

(-1777.56381) 

-0.93013 

(-0.92134) 

0.76134 

-0.02036 

-1794.51393 

(-1778.48515) 

3 

O 

C 

N 

-1794.61114 

(-1777.84438) 

-1.42251 
(-1.41104) 

0.76145 

-0.02342 

-1795.29562 

(-1779.25542) 


Zn +1 

-1794.32478 

(-1777.56381) 

-1.36893 
(-1.35949) 

0.76134 

-0.02341 

-1794.95578 

(-1778.92330) 

4 

K1 

13 

o 

-1794.61114 

(-1777.84438) 

-1.56782 

(-1.55679) 

0.76145 

-0.02447 

-1795.44198 

(-1779.40117) 


Zn +1 

-1794.32478 

(-1777.56381) 

-1.51246 

(-1.50351) 

0.76134 

-0.02446 

-1795.10036 

(-1779.06732) 

5 

Zn° 

-1794.61114 

(-1777.84438) 

-1.62873 

(-1.61799) 

0.76145 

-0.02502 

-1795.50344 

(-1779.46237) 


Zn +1 

-1794.32478 

(-1777.56381) 

-1.57272 
(-1.56418) 

0.76134 

-0.02501 

-1795.16117 

(-1779.12799) 


a The values in parentheses are nonrelativistic energies computed by setting c = TO 4 . 


nonrelativistic limit (c — 10 4 )is the relativistic DC 
many-body shift that arises from the use of rela¬ 
tivistic single-particle states and the instantaneous 
Coulomb two-body interaction. The relativistic DC 
many-body shifts are relatively small for Zn° and 
Zn 1+ , 0.01074 a.u. (0.292 eV) for Zn° and 0.00854 
a.u. (0.232 eV) for Zn 1+ , evaluated at L max = 5. 
However, the effect on the IE of Zn° of the many- 
body shifts is significant because these values are 
noticeably different between Zn° and Zn 1+ . For 
heavier systems, Au and Hg, the relativistic DC 
many-body shifts are an order of magnitude larger 
than that in Zn [5, 15]. 

We have summarized in Table III the IE for the 
ground 2 S 0 state of the zinc atom obtained in 


TABLE III_ 

Computed ionization energy (eV) of 
Zn° in increasing partial-wave expansion. 


^-max 

Ionization energy (eV) 

Relativistic 

Nonrelativistic 3 

2 

8.849 

8.658 

3 

9.248 

9.037 

4 

9.296 

9.085 

5 

9.314 

9.099 

Experiment 13 

9.39 



a Nonrelativistic limit (c = 10 4 ). 
b Experiment (Ref [28]). 


partial-wave expansions up to L max = 5. These IEs 
were computed by subtracting the total energy of 
Zn°, from the last column of Table II, from that of 
Zn 1 + in each order of the partial-wave expansion. 
The table illustrates the convergence pattern of the 
computed IE of Zn° in increasing partial-wave 
expansion. For comparison, we also list in the last 
row the experimental IE [28]. As the order of 
partial-wave expansion increases, the computed 
IE smoothly converges to approximately 9.32 eV. 
The experimental value is 9.39 eV [28]. The IE, 
9.314 eV, computed by relativistic MBPT with 
the partial-wave expansion L max = 5, has con¬ 
verged to within 0.01 eV of the extrapolated value 
(L max = oo) in the partial-wave expansion. The 
residual discrepancy of 0.07 eV between theory 
and experiment is due primarily to the truncation 
of MBPT at second order. The all-electron third- 
order correlation corrections for Zn° and Zn 1 + are 
much more computationally demanding than the 
second-order, and are therefore neglected in the 
present study. The IE computed at the nonrela¬ 
tivistic limit, 9.099 eV, underestimates the experi¬ 
mental value by 0.3 eV because of the neglect of 
relativistic effects. 

When one compares experimental IEs of transi¬ 
tion metals to those computed by nonrelativistic 
quantum chemical methods, one must correct for 
relativistic contributions. Martin and Hay [19] have 
calculated relativistic contributions to the experi- 
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mental ionization energies of transition metals at 
the DF level, assuming that electron correlation 
does not change the relativistic correction. 
Raghavachari and Trucks [20] modified the experi¬ 
mental IEs of transition metal atoms Sc through Zn 
by subtracting the relativistic corrections com¬ 
puted by Martin and Hay [19] from the experi¬ 
mental values. Their corrected "nonrelativistic" IE 
for Zn° is 9.23 eV, in good agreement with their 
estimate, 9.13 eV, computed by quadratic configu¬ 
ration interaction with single and double substitu¬ 
tions and perturbative triples (QCISD(T)). The cor¬ 
rected nonrelativistic IE, however, does not take 
into account the relativistic many-body shift. When 
this correction is accounted for, the agreement 
between them improves dramatically (see below). 

Table IV displays the contribution at each order 
of perturbation theory to the ionization energy of 
Zn° computed at L max = 5. These contributions 
were computed by subtracting the energy of Zn° 
from that of Zn 1 + in each order of perturbation 
theory. Relativistic and nonrelativistic results are 
given, respectively, in the first and second rows. 
The third row gives, for each order, the difference 
between the relativistic and nonrelativistic contri¬ 
butions given, respectively, in the first and second 
rows. The second column gives the zero-order 
contribution, A E scf , to the ionization energy, rela¬ 
tivistic and nonrelativistic, and their difference. 
The relativistic zero-order contribution 7.7923 eV, 
the so-called ASCF approximation to the IE, was 
computed by subtracting the DF SCF energy of 
Zn° from that of Zn 1+ . The nonrelativistic zero- 
order contribution, 7.6347 eV, was computed by 
subtracting the nonrelativistic SCF energy of Zn° 
from that of Zn 1+ . Both relativistic and nonrela¬ 
tivistic ASCF approaches significantly underesti¬ 
mate the ionization energy. Numerous nonrela¬ 


tivistic and relativistic SCF calculations have 
demonstrated that single-configuration SCF yields 
IE which are significantly lower than experimental, 
indicating that electron correlation effects are im¬ 
portant. When the Coulomb correlation correction, 
A = 1.5241 eV, the difference in the second- 
order DC correlation energies between Zn 1+ and 
Zn° is taken into account, the computed IE dra¬ 
matically improves. The effects of Breit interaction 
corrections on the computed IE, —0.00299 and 
0.00027 eV, respectively, for AB (1) and AB (2) , on 
the valence-shell electron IE of Zn°, are negligible. 
The Breit interaction is a short-range interaction 
that primarily affects the inner-shell s 1/2 and p 1/2 
electrons [5, 6]. 

Referring to the third row of Table IV, the 
largest relativistic correction to the nonrelativistic 
contribution to the ionization energy, 0.1576 eV, 
comes from the relativistic DF SCF. In an earlier 
study [19], Martin and Hay estimated this correc¬ 
tion to be 0.16 eV, in agreement with our result. 
The effect of electron correlation on the relativistic 
corrections, i.e., the relativistic many-body shift, 
has not hitherto been available for the transition 
metals. The difference in relativistic many-body 
shifts between Zn 1+ and Zn° (0.0599 eV) given in 
the third row of Table IV, which directly con¬ 
tributes to the IE, is approximately § of the rela¬ 
tivistic SCF contribution (0.1576 eV). The differen¬ 
tial effect of relativistic DC correlation on the cal¬ 
culated relativistic corrections to the ionization en¬ 
ergy is thus nonnegligible. The overall relativistic 
correction to the IE of the zinc atom is 0.215 eV. 
When this correction is subtracted from the experi¬ 
mental IE, one obtains 9.17 eV for the corrected 
nonrelativistic IE for Zn, in excellent agreement 
with the estimated nonrelativistic IE of 9.13 eV 
obtained by Raghavachari and Trucks [20]. The 


TABLE IV_ 

Contributions from each order of perturbation theory to the ionization energy (in eV) of 
ground state Zn° at L max = 5. 


£(Zn + 1 )-£(Zn°) 

ae scf 

A£® 

AB (1) 

A B (2 > 

Total (IE b ) 

Relativistic 

7.7923 

1.5241 

— 0.0029 9 

0.0002 7 

9.314 

Nonrelativistic 

7.6347 

1.4642 

0.0 

0.0 

9.099 

Diff. (Rel - Nonrel) 

0.1576 

0.0599 

-0.0029 9 

0.0002 7 

0.215 


( 0 . 16 ) a 


a Martin and Hay (Ref. [19]). 

L_ 

Ionization energy. 
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residual discrepancy of ~ 0.04 eV is primarily due 
to the truncation of partial wave to L max = 3 in the 
nonrelativistic QCISD(T) calculations. 

Table V displays the DF SCF energies E sCF , 
second-order DC correlation energies and 

first- and second-order Breit interaction energies, 
and B (2) , for the ground 1 S 0 state of the cad¬ 
mium atom and its singly ionized Cd 1+ ion ob¬ 
tained at L max = 5. The relativistic many-body 
shift, B (2) , increases dramatically from Zn to Cd. In 
going from Zn to Cd, the first- and second-order 
Breit interaction energies increase four- to fivefold, 
and the Breit interaction results in a significant 
modification of relativistic many-body effects. 
However, B (1) and B (2) are nearly identical for the 
neutral atoms and their singly ionized ions, and 
thus the valence-shell electron IE is almost entirely 
accounted for by the differences in DF energies, 
E SCF , and the Coulomb correlation energies E{^. 
Such is not the case with the elements of the fifth 
and higher rows, nor with highly ionized species 
[5, 6], In highly ionized ions, B (2) is comparable in 
magnitude to the second-order Coulomb correla¬ 
tion correction. The differences in B (1) , B (2) , and 
£§£ between an ion A” + and A (n + 1)+ tend to 
affect the IE by similar amounts [5, 6]. 

As with Zn the large basis sets result in com¬ 
puted correlation energies that have converged to 
at least four significant figures in each order of the 
partial-wave expansion. The convergence rates of 
the second-order Coulomb correlation correction 
with respect to partial-wave expansion are approx¬ 
imately the same for relativistic and nonrelativistic 
MBPT. The relativistic DC mnay-body shifts are 
relatively small for Cd° and Cd 1+ , i.e., 0.03530 a.u. 
(= 0.961 eV) for Cd° and 0.02984 a.u. ( = 0.812 eV) 
for Cd 1+ evaluated at L max = 5. However, as with 
zinc, the effect on the IE of Cd° of the many-body 
shifts is significant because these values are notice¬ 
ably different between Cd° and Cd 1 + . 


Table VI displays the contribution from each 
order of perturbation theory to the ionization en¬ 
ergy of Cd° computed at L max = 5. The table was 
constructed in a similar manner to Table IV. Rela¬ 
tivistic and nonrelativistic results are given in the 
first and second rows, and the third row gives the 
difference between relativistic and nonrelativistic 
contributions. The second column gives the zero- 
order contribution, AE SCF , to the ionization energy 
computed both relativistically and nonrelativisti- 
cally and their difference. The ASCF approxima¬ 
tion for IE, 7.3544 eV, was computed by subtract¬ 
ing the DF SCF energy of Cd° from that of Cd 1+ . 
The nonrelativistic zero-order contribution 6.9264 
eV was computed by subtracting the nonrelativis¬ 
tic SCF energy of Cd° from that of Cd 1 + . As in the 
case of Zn, both relativistic and nonrelativistic 
ASCF approaches underestimate the ionization en¬ 
ergy. When the Coulomb correlation correction 
A E§£ = 1.7309 eV, the difference in the second- 
order DC correlation energies between Cd 1+ and 
Cd°, is taken into account, the computed IE im¬ 
proves. The differential effect of relativistic DC 
correlation on the calculated relativistic corrections 
to the ionization energy is nonnegligible. On the 
other hand, the effects of Breit interaction correc¬ 
tions on the computed IE, —0.00490 and 0.00027 
eV, respectively, for AB (1) and AB (2) , on the 
valence-shell electron IE of Cu° are still negligible 
in Cd. The IE at the nonrelativistic limit, 8.509 eV, 
underestimates the experimental value by 0.48 eV 
because of the neglect of relativistic effects. Refer¬ 
ring to the third row of Table VI, the largest 
relativistic correction to the nonrelativistic contri¬ 
bution to the ionization energy, 0.4280 eV, comes 
from the relativistic DF SCF. The difference in 
relativistic many-body shifts between Cd 1+ and 
Cd° (0.1486 eV) given in the third row of Table VI, 
which directly contributes to the IE, is approxi¬ 
mately § of the relativistic SCF contribution (0.4280 


TABLE V_ 

DF SCF and MBPT energies (a.u.) of the ground-state Cd° OSq) and Cd +1 ( 2 S 1/2 ) computed with 
partial-wave expansion, L max = 5. a 



^SCF 

E& 

B (1) 

e< 2 > 

^tot 

o 

CL 

o 

-5593.32183 

-2.57354 

3.84216 

-0.08161 

-5592.13482 


(-5465.09882) 

(-2.53824) 



(-5467.63706) 

Cd +1 

-5593.05156 

-2.50993 

3.84198 

-0.08160 

-5591.80111 


(-5464.84428) 

(-2.48009) 



(-5467.32437) 


a The values in parentheses are nonrelativistic energies computed by setting c = 10 4 . 
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TABLE VI 


Contributions from each order of perturbation theory to the ionization energy of the ground state 
Cd° at L max = 5 (in eV). 


£(Cd + 1 )-E(Cd°) 

AE scf 

AE$ 

a e (1) 

A B (2) 

Total (IE a ) 

Relativistic 

7.3544 

1.7309 

- 0.0049 

0.0002 7 

9.081 

Nonrelativistic 

6.9264 

1.5823 

0.0 

0.0 

8.509 

Diff. (Rel - Nonrel) 

0.4280 

0.1486 

-0.0049 

0.0002 7 

0.572 


a Ionization energy 


eV). The differential effect of relativistic DC corre¬ 
lation on the calculated relativistic corrections to 
the ionization energy is thus nonnegligible and 
larger than that in Zn. The overall relativistic cor¬ 
rection to the IE of the cadmium atom is 0.572 eV. 

We have summarized in Table VII the com¬ 
puted IEs for Zn and Cd. In the third column, the 
IEs computed in previous nonrelativistic and rela¬ 
tivistic correlated calculations are shown. The ex¬ 
perimental IEs are given in the last column. The 
IEs computed in this study agree with experiment. 
The discrepancies between the computed and ob¬ 
served IEs are within 0.1 eV for both Zn and Cd. 
The bulk of the error is due to truncation of the 
perturbation expansion at second order. Vijayaku- 
mar and Gopinathan [29] employed a relativistic 
correlated local density functional (RCH) method 
to calculate the IEs of atoms He through Zn. The 
computed IE of Zn by the RCH method is 8.41 eV, 
underestimating the experimental value by as 
much as 1 eV. Guo and Whitehead [30] computed 
the IE of Cd by generalized-exchange local-spin- 
density-functional method with relativistic correc¬ 
tions of the mass velocity and Darwin terms. The 


TABLE VII_ 

Ionization energies of Zn and Cd (eV). 




Ionization energy 


Atom 

This work 

Other work 

Experiment 3 

Zn 

9.314 

8.41 , b 9.13° 

9.39 

Cd 

9.081 

8.72 d 

8.99 


a Moore’s table (Ref. [28]). 

b Vijayakumar and Gopinathan; relativitsic and correlated 
local-density B method (Ref. [29]). 

c Raghavachari and Trucks; nonrelativistic QCISD(T) (Ref. 

[ 20 ]). 

d Guo and Whitehead; QR-LSD-GX-SIC-GWB method (Ref. 
[30]). 


density functional method produces an IE for 
Cd (8.72 eV) in closer agreement with experiment 
(8.99 eV) than does Vijayakumar's RCH estimate 
for Zn. 
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ABSTRACT: Accurate results for hyperfine structure (HFS) constants of (d + s) n 
states often require the inclusion of both correlation and relativistic effects. Without 
these, Dirac-Fock HFS constants may have the wrong sign, the energies of uppermost d n 
levels may be wrong by over 1 eV. Here, we demonstrate that these problems persist 
well into an isoelectronic sequence, and identify the principle configurations needed to 
achieve accurate results. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 555-564, 1997 


Introduction 

T ransition-metal atoms play important roles in 
catalysis, as impurities in plasma fusion de¬ 
vices, and as deep-level traps in semiconductors. 
Previously, properties of (d 4- s) n states of these 
species, which form many of the ground and 
low-lying excited states, have been rather poorly 
understood. Positions of d n levels can be in error 
over 1 eV, independent particle hyperfine struc¬ 
ture (FFFS) constants can have the wrong sign, and 
little ab initio work has been done on their transi¬ 
tion probabilities. 
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of Energy Research, U.S. Department of Energy. 

Project grant number: DE-FG02-92ER14282. 

International Journal of Quantum Chemistry, Vol. 65, 555-564 (1997) 
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Our HFS work on these states, which began in 
1992 [1], has established the need to accurately 
position the d n ~ 1 s levels relative to those from d n 
and d n ~ 2 s 2 in order to account for the HFS con¬ 
stants for the latter two configurations. We may 
interpret this as being due to the former normally 
having large HFS constants arising from the con¬ 
tact operator contributing at the zeroth-order (in¬ 
dependent particle) level. Of course, in addition to 
positioning the levels correctly, the usual core po¬ 
larization effects must be included. 

Since relativistic effects associated with d, s 
electron interchanges may be several tenths of an 
electron volt [2], they can have an important im¬ 
pact on level positions and so must be included in 
any theoretical treatment. It is also necessary to 
treat d n , d n ~ 1 s , and d n ~ 2 s 2 levels on an equal 
footing, so our theory must be a multireference 
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one. Furthermore, when obtaining correlated wave 
functions for these levels, one can be involved 
with configurations possessing a few hundreds of 
eigenvectors, each constructed from several hun¬ 
dreds of Slater determinants. This calls for some 
innovative ways of reducing computational costs. 


Methodology 

The wave functions are generated from the 
Dirac-Coulomb Hamiltonian, with the Breit con¬ 
tribution, optionally, added as a perturbation. 
Multiconfigurational Dirac-Fock (MCDF) solu¬ 
tions obtained from Desclaux's program [3] are 
used as zeroth-order functions. There are gener¬ 
ated from just (d + s) n levels. Correlation is intro¬ 
duced through relativistic configuration interac¬ 
tion (RCI); normally a first-order form is chosen, 
i.e., those configurations generated by making sin¬ 
gle or double excitations from outermost subshells 
of MCDF configurations. The question of what to 
use as one-electron functions for unoccupied (vir¬ 
tual) subshells is an important one, as poor choices 
may lead to (partial) variational collapse into the 
"positron sea." This can be avoided if the major 
and minor components of these functions are 
properly constrained (e.g., not allowed to vary 
separately). In our case, experience has demon¬ 
strated that use of relativistic screened (Z*) hydro- 
genic functions avoid this problem, while at the 
same time providing an adequately converged se¬ 
ries, if Z* is chosen during the RCI process to 
minimize the energy for the root of interest. 

Since much of our work deals with excited 
states, we prefer to work with moderately sized 
(< 7000) energy matrices, which we diagonalize 
using Weber and co-workers' [4] variant of the 
Davidson algorithm, to which we have added some 
improvements [5]. We thus want to use N electron 
functions which are eigenstates of J 2 , J z , and par¬ 
ity. Formerly, we used to generate eigenstates of 
} 2 by direct diagonalization of that matrix, but 
some of the correlation configurations can involve 
matrices of order several thousand, and we may 
need a few hundred eigenvectors. In our HFS 
studies, we have avoided this difficulty by split¬ 
ting the configuration into two separate parts, each 
of which we make eigenstates of J 2 and J z , and 
their "paste" the parts back together using stan¬ 


dard vector coupling theory (sums of the product 
of the two section functions and a single 3 j symbol 
are involved). The procedure is very fast and capa¬ 
ble of dealing with large problems. 

To further restrict the size of the energy matrix, 
we have automated a suggestion first made by 
Bunge [6] in a nonrelativistic study of the carbon 
atom. This recognizes that the N parents (eigen¬ 
vectors) of a single configuration, generate matrix 
elements (with the zeroth-order function) involv¬ 
ing M radial integrals, and that M is less than N, 
usually. In fact, in the transition metals and rare 
earths, M may be 10-1000 times smaller than N. 
Since the theory is first order in "form," a rotation 
of the original set of parents can be done, such that 
in the new set, there are only M parents which 
have a nonzero interaction with the reference func¬ 
tion. The other N — M functions can then be dis¬ 
carded, consistent with a first-order (in form) the¬ 
ory. Certain adjustments are made for the fact that 
our theory is relativistic, and multireference, but 
the approximation is both a good one (errors less 
than 200 cm" 1 ) and highly efficient. Further de¬ 
tails of the methodology can be found in our Zr II 
paper [7]. 

COMPUTATIONAL STRATEGY 

Our calculations begin by doing RCI calcula¬ 
tions built from single and double excitations from 
the valence (e.g., 4 d,5s in Zr II) subshells into 
virtuals of symmetries s, p, d, g (one radial for 
each type). These excitations are significant con¬ 
tributors to energy differences, and sensitive to 
orthogonalization order (if core excitations are done 
first, it is difficult to recover energy loss from 
valence virtuals due to orthogonalization to core 
virtuals). Now the 4 d and 5s radial functions can 
vary substantially between (4 d) 3 , (4rf) 2 5s and 
4d(5s) 2 configurations. To account for this, we do 
the MCDF calculation for the root of most interest 
(e.g., (4 d) 3 ] = f was chosen in Zr II, as this was 
the subject of a recent measurement [7]), and use 
"Brillouin"-type excitations, viz. 4 d vd, bs vs 
(and eventually 4p -> vp ) to improve the descrip¬ 
tion of the other roots. 

After the first set of valence virtuals is deter¬ 
mined, a second set, associated with these same 
single and double valence excitations is added; 
this normally provides sufficient radial saturation 
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of the valence space. The larger triple excitations 
are also added in at this time. 

In the next calculational layer, we begin to make 
single and double excitations form the shallow 
core, viz. the 4 s and 4 p subshells, adding in a 
third set of virtuals, with larger effective Z* (de¬ 
termined during the RCI process). Important exci¬ 
tations include 4p —> vp -F vf, and 4 s -» 5 s + vs, 
Ad + vd. Decisions on what to include are based 
on the criteria that we are seeking energy contribu¬ 
tions which are differentially large or make sub¬ 
stantial contributions to HFS. These single excita¬ 
tions generally fall into both categories. 

At this stage, errors in relative energy differ¬ 
ences were still too large, on the order of a few 
tenths of an electron volt. We may reasonably 
expect the next most important corrections to come 
from the classes of excitations: Ap(Ad + 5s) -> , 
but in fact these have a differentially modest ef¬ 
fect. Next, we looked at double excitations from 
4p 6 . The most important of these, differentially, 
turn out to be 4p 2 -> 4 d 2 + 4 dvd, which we may 
ascribe to (1) the fact that l 2 -> (l + l) 2 are among 
the largest [8], and (2) Pauli exclusion effects are 
involved, i.e., the 4 d subshells are already partially 
occupied, and this tends to reduce the contribution 
in varying amounts, depending on the occupancy 
of the 4 d subshell for the level of interest. This 
finding is valid for all transition metal spectra 
studied to date, and was first noted in the Zr II 
work [7]. 

By using intermediate normalization (<f>|T /> ) = 
1, where T> is the zeroth-order function, and ^ is 
the RCI function, it is possible to assign energy 
contributions to each correlation vector. These are 
collected in Table I, for Zr II for all configurations 
contributing 0.025 eV or more to at least one root. 
The excitations present in Table I reduce the aver¬ 
age energy difference error to 0.075 eV (~ 600 
cm -1 ). On simpler systems, e.g.. La II, we have 
reduced the error to 0.02 eV or ~ 160 cm -1 [9]. 

Results 

Zr II ISOELECTRONIC SEQUENCE 

In this sequence, we avoid Y I because of the 
interpenetrating manifolds 5s 2 6s and Ad5s6s [10] 
which would increase the number of reference 
functions needed. In a relativistic treatment such 


as this, each / is a separate problem. We chose to 
study / = | because it had the greatest number of 
reference functions (10), thus making it challeng¬ 
ing, and because several / = § levels were just 
being measured for Zr II [11]. Results for Zr II 
have already appeared [7, 11], but we reproduce 
them here in Table II, as this species can provide a 
benchmark for the other sequence members for 
which no HFS measurements exist. It can be seen 
that the average error for the RCI A results is 
9.2%, in contrast to the Dirac-Fock (DF) values, 
for which two of the three results have the wrong 
sign. We also note that we are in agreement with 
the corrections of Kiess [12] to the Moore [10] 
energy levels. 

In Table III we present results for the Nb III 
member of the / = § isoelectronic sequence. For 
the five known energy differences [10], the theoret¬ 
ical error is 0.050 eV (410 cm -1 ); note that we have 
predicted the positions of four new levels, not 
available from experiment [10,13]. Though there is 
greater agreement between the DF and RCI results 
for A than in Zr II, there are still two levels 
exhibiting a sign reversal. The one and N electron 
basis sets used to generate the results are the same 
as that used for Zr II, with the obvious changes in 
radial functions (MCDF and virtuals, viz. Z*). 

In Table IV are presented the results for Mo IV 
/ = | using the same 1 and N electron basis sets 
(radial sets suitably adjusted). These calculations 
along isoelectronic sequences were considerably 
eased by the fact that almost the entire data sets 
(several thousand lines) could be reused from one 
member to the next. Here, the average error for the 
eight known [10, 14] energy levels is somewhat 
larger: 0.107 eV or 860 cm -1 , and just one new 
level has been identified. We may note that there 
is still one value of A for which the DF and RCI 
results have different signs. 

THE Hf II ISOELECTRONIC SEQUENCE 

Discussion of La I is postponed to the next 
section, as it is a thoroughly measured system and 
we have done calculations for several /'s. The first 
member of the series considered here is Hf II, 
whose results are presented in Table V. The aver¬ 
age error [10] for eight energy differences is 0.030 
eV (~ 200 cm -1 ). One new level is predicted, and 
three of the A's show large differences (sign or 
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TABLE I_ 

Energy contributions in electron volts for Zr II, J = §. All signs reversed. 


Roots 


Basis 

4 d 3 

2 D 

4d 2 5s 
+4 d 3 

2 p 

4 d 3 + 

4 d5s 2 

2 D 

4d5s 2 
+4 of 3 

2 D 

4 d 3 

4 P 

4d 2 5s 

4 P 

4 d 3 + 
4d 2 5s 

2 P 

4d 2 5s 

2 D 

4d 3 

4 F 

4d 2 5s 

4 F 

(4c/+ 5s) 3 

-0.195 

-1.041 

-0.385 

-0.140 

0.010 

0.002 

0.993 

0.738 

0.014 

0.008 

4 dvp 2 

0.269 

0.304 

0.166 

0.473 

0.056 

0.005 


0.040 

0.022 

0.005 

4 dvd 2 

0.339 

0.011 

0.228 


0.192 

0.002 

0.216 


0.161 

0.001 

4 dvf 2 

0.571 

0.054 

0.395 

0.010 

0.351 

0.000 

0.338 

-0.010 

0.244 

0.001 

4 dvg 2 

0.069 


0.044 


0.021 

0.000 

0.021 



0.000 

4 dvsvd 


0.061 

0.006 


0.000 

0.047 


0.044 


0.048 

4dvpvf 

0.090 

0.383 

0.102 


0.019 

0.196 

-0.031 

0.214 


0.192 

4d 2 vs 



0.009 


0.000 

0.027 

0.048 

0.018 

0.001 

0.022 

4d 2 vd 

0.024 

0.063 

0.066 

0.036 

0.091 

0.016 

0.075 

0.006 

0.084 

0.004 

4d 2 vg 

0.068 

0.086 

0.051 


0.034 

0.003 

0.039 

0.003 

0.014 

0.009 

5s vp 2 


0.014 

0.000 

0.097 

0.000 

0.034 

0.000 

0.009 


0.000 

5 svd 2 


0.050 

0.000 


0.000 

0.062 

0.000 



0.037 

5svf 2 


0.164 

0.000 

0.016 

0.000 

0.172 

0.000 

0.127 


0.058 

5s vs vd 

0.000 


0.000 

0.033 

0.000 

0.000 

0.000 

msm 


0.000 

5s vp vf 

o.ooo • 


0.000 

0.227 

0.000 

0.000 

0.000 

warn 


0.014 

4d5svs 



-0.002 

0.162 

0.000 

0.000 

0.001 

0.035 


0.000 

4d5svd 


0.561 

0.010 

0.047 

0.000 

0.316 

-0.017 

0.364 


0.295 

4d5svg 

0.012 


-0.001 

0.032 

0.000 

0.000 

0.000 

0.028 


0.015 

5s 2 vd ■ 



0.000 

0.434 

0.000 

0.000 

0.000 

-0.010 


0.000 

4p 5 vf4d 3 

0.911 

0.034 

1.003 


1.109 

0.021 

1.010 

0.021 

1.166 

0.020 

4p 5 vf4d 2 5s 

0.022 

0.853 

0.020 

0.054 

0.020 

0.852 

0.023 

0.805 

0.019 

0.899 

4p 5 vf 4d 5s 2 



0.000 

0.505 

0.000 

0.006 

0.000 

0.007 


0.006 

4p 5 vp4d 3 

0.012 

0.040 

0.016 


0.030 

0.042 

0.021 

0.056 

0.035 

0.084 

4p 5 vp4d 2 5s 


0.067 

0.002 

0.092 

0.004 

0.067 

0.004 

0.081 


0.079 

4p 5 vp4d5s 2 



0.000 

0.334 

0.000 

0.000 

0.000 



0.004 

4svd4d 3 

0.049 

0.014 

0.036 


0.042 

0.005 

0.032 


0.022 

0.006 

4svd4d 2 5s 

0.025 

0.056 

0.034 

0.015 

0.026 

0.041 

0.001 


0.026 

0.018 

4p 4 4d 5 

0.746 


0.630 

mSm 

0.576 

0.000 

0.621 


0.505 

0.000 

4p 4 4d 4 5s 


0.757 

0.001 

mam 

0.000 

0.754 

-0.002 

0.749 


0.680 

4p 4 4d 3 5s 2 



0.002 

0.876 

0.001 

0.000 

0.001 



0.000 

4p 5 4d 2 vdvf 

0.062 

0.011 

0.070 


0.070 

0.012 

0.074 

0.012 

0.075 

0.012 

4p 5 4d 5svdvf 


0.054 

0.000 

0.024 

0.000 

0.054 

0.000 

0.053 


0.055 

4p 5 5s 2 vdvf 



0.000 

0.029 

0.000 

0.000 

0.000 

0.000 


0.000 

4svs4d5s 2 



0.000 

0.056 

0.000 

0.000 

0.000 

0.000 


0.000 

4s4d 3 5s 


0.062 

0.004 


0.005 

0.061 

0.004 

0.045 


0.016 

4s4d 4 

0.041 


0.063 


0.057 

0.002 

0.042 


0.016 

0.002 

4p 5 vpvd 5s 2 



0.000 

0.040 

0.000 

0.000 




0.000 

Total 

3.136 

2.721 

2.576 

3.516 

2.718 

2.846 

3.520 

3.553 

2.473 

2.623 
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HYPERFINE STRUCTURE CONSTANTS OF {d + s ) 3 STATES 


TABLE II 


Fine and hyperfine structure of Zr II, J = 

= | (d + s) 3 

states. 



Level 3 


Energy (cm 

- 1 ) 

A (MHz) ft 

B (MHz) / Q(b) b 

B (MHz) b 

Config. 

SLJ 

Expt. 3 

Theory^ 

Theory 0 Expt. 

Theory 0 

Expt. 

4 d 3 

2 D 

27 699.96 

29 066 

-190.0 

224.0 


4 d 3 

2p 

20 080.30 

20 710 

111.5 109.768 

(-76.0) 

-138.4 

(-33.9) 

33.716 

4d5s 2 

2 d 

14 298.64 

14 996 

-114.7 

-82.9 


4c/ 3 

2 d 

13428.64 

14 387 

-66.8 

165.1 


4 d 3 

4 P 

9 742.80 

10 072 

134.1 110.177 

(-56.4) 

-133.3 

(-156.2) 

34.713 

4c/ 2 5s 

4 p 

7 736.02 

8 209 

-534.6 

159.5 


4d 2 5s 

p 

6111.70 

6 788 

134.3 

-132.2 


4d 2 5s 

2 d 

4 248.30 

5 084 

190.3 199.250 

(375.1) 

-28.8 

(-83.8) 

7.706 

4 d 3 

4 f 

2 572.21 

2 950 

-298.0 

-57.3 


4d 2 5s 

4p 

0.000 

0 

273.8 

60.5 



a Labeling of levels is from Moore [10] as corrected by Kiess [12]. Experimental energy is also drawn from these sources. 
b From previous work [11]. 
c Dirac-Fock results are given in parentheses. 


TABLE III_ 

Fine and hyperfine structure of Nb III, 

J = |(d + s) 3 states. 3 


Level 

Energy (cm ^ 

A (MHz) 
Theory^ 

B (MHz) / 
O(b) 
Theory 1 * 

Config. 

SLJ 

Expt. d 

Theory 0 

4d 5s 2 

2 D 


71 816 

937.7 

355.9 





(710.0) 

(326.5) 

4c/ 2 5s 

2 P 


44 448 

-1012.2 

-314.5 





(-965.6) 

(-314.8) 

4d 2 5s 

2 D 

36 536 

37111 

-1355.6 

-114.0 





(-1526.7) 

(-132.3) 

4d 2 5s 

4 p 

34 807 

35 360 

2401.4 

255.7 





(2563.0) 

(251.9) 

4d 3 

2 d 


33 870 

943.7 

371.4 





(660.3) 

(406.4) 

4d 2 5s 

4 f 

25 221 

25 061 

-1149.3 

103.2 





(-1361.0) 

(100.8) 

4d 3 

2 d 


13 602 

289.6 

-42.78 





(427.4) 

(-8.66) 

4d 3 

2 P 

10912 

11 411 

-47.82 

-248.5 





(378.8) 

(-271.2) 

4 d 3 

4 P 

8 608 

8 864 

-267.7 

-215.8 





(310.4) 

(— 241.2) 

4 d 3 

4 p 

0.0 

0.0 

1128.0 

-104.3 





(659.4) 

(-114.9) 

a A = 

u. 

92.90, 

Z — 41, / = 

2 ' f 1 ! ~ 

6.167. 



b From Iglesias [13]. 

c Uses same configurational basis set as Zr II, J - f. 
d Dirac-Fock results are given in parentheses. 


TABLE IV_ 

Fine and hyperfine structure of Mo IV, 

J - |(d + s) 3 states. 3 


Level 

Energy (cm ^ 

A (MHz) 

B (MHz) / 
Q(b) 

Config. 

SLJ 

Expt. b 

Theory 0 

Theory 1 * 

Theory 1 * 

4d5s 2 

2 D 


146 242 

-342 

(-264) 

501 

(456) 

4d 2 5s 

2 P 

81 052 

81 858 

423 

(387) 

-446 

(-436) 

4d 2 5s 

2 D 

74 009 

74 862 

493 

(510) 

-170 

(-180) 

4d 2 5s 

4 P 

72163 

73 065 

-871 

(-919) 

350 

(345) 

4d 2 5s 


60 893 

60 794 

447 

(511) 

143 

(138) 

4d 3 

2 d 

39 231 

41 548 

-272 

(-247) 

524 

(559) 

4 d 3 

2 d + 2 p 

17107 

17 857 

-106 

(-141) 

10.9 

(37.6) 

4d 3 

2 p+ 2 d 

14176 

14 920 

-86.0 

(-162) 

-388 

(-419) 

4d 3 

4 P 

10 331 

10 738 

48.7 

(-112) 

-315 

(-340) 

4d 3 

4 p 

0.0 

0.0 

-379 

(-246) 

-151 

(-161) 

a A = 

u 

95.94, Z = 

42, / = |, 

l-Lf — 0.9133. 



b From Fernandez et al. [14]. 
c Uses same configurational basis set as Zr II, J = f. 
d Dirac-Fock results are given in parentheses. 
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TABLE V_ 

Fine and hyperfine structure of Hf II, 

J = f (d + s) 3 states. 3 


Level 


Energy (cm ') 

A (MHz) 

B (MHz) / 
Q(b) 

Config. 

SLJ 

Expt. b 

Theory 0 

Theory^ 

Theory 0 ' 

5 d 3 

2 D 


45 798 

113.1 

(104.0) 

476.0 

(533.0) 

5 d 3 

2 P 

37 324 

37198 

-60.0 

(48.6) 

-99.9 

(35.5) 

5 d 3 

2 D 

30 594 

30 479 

77.4 
(81.7) 

-337.0 

(-361.0) 

5 d 3 

4 p 

27 285 

26 958 

-121.1 

(43.0) 

-290.0 

(-308.0) 

5 d 3 

4 F 

18 897 

18 395 

140.8 

(107.8) 

-162.0 

(-182.0) 

5d 2 6s 

2 p 

17 830 

17 478 

-388.0 

(-496.0) 

-214.0 

(-261.0) 

5d 2 6s 

2 d 

14 359 

14 446 

-224.0 

(-258.0) 

-170.0 

(-284.0) 

5d 2 6s 

4 P 

12 920 

12815 

638.0 

(686.0) 

320.0 

(312.0) 

5d 2 6s 

4 f 

3 644 

3336 

-343.0 

(-445.0) 

193.0 

(121.0) 

5d6s 2 

2 d 

0 

0 

1.88 

(117.3) 

342.0 

(458.0) 


a A = 178.49, Z = 72, / = £, ^, = 0.61. 

5 From Moore [10]. 

c Uses same configurational basis set as Zr II, J = §. 
d Dirac-Fock results are given in parentheses. 


TABLE VI_ 

Fine and hyperfine structure of Ta III, 

J = §(d + s) 3 states. 3 


Level 

Energy (cm ^ 

e (MHz) / 
A (MHz) 0(b) 

Config. 

SLJ 

Expt. b Theory 0 

Theory 0 Theory 0 

5d 3 

2 D 

42 299 

478.9 679.1 

(606.7) (756.7) 

5 d 2 6s 

2 P 

38 054 

-1140.0 -347.0 

(-1917.0) (-581.0) 

5d 2 6s + 5d 3 2 D 

30 834 

-18.7 545.0 

(678.0) (687.0) 

5d 3 + 5d 2 6s 
+5d6s 2 

2 d 

24 836 

-415.3 -220.7 

(469.3) (- 365.5) 

5 d 2 6s 

4 P + 2 P 

19 683 

308.6 445.9 

(357.7) (474.0) 

5 d 3 

4 p + 2 P 

16 363 

-165.1 -235.3 

(240.4) (-128.0) 

5 d 3 

4 P + 2 d 

13402 

-618.5 -295.2 

(337.0) (-470.0) 

5 d 3 

2 P + 4 P + 4 f 

7814 

-134.6 -279.6 

(357.0) (-413.0) 

5d 2 6s 

V 

5 541 

-1961.7 110.3 

(-2433.1) (120.7) 

5 d 3 

4 p 

0 

926.7 -299.4 

(663.2) (-338.0) 


a A = 180.95, Z = 73, / = f , jjl, = 2.36. 
b Uses same configuration basis set as Zr II, J = f. 
c Dirac-Fock results are given in parentheses. 


order of magnitude) between the DF and RCI 
results. Most of our tables also contain predictions 
for the electric quadruple constant, B, divided by 
the quadrupole moment, Q. One can either extract 
B, if Q is known [15], or if a measurement of B is 
available, as in Zr II [11], a determination of Q 
may be made. 

In Table VI, we present our results for the Ta III 
member of this sequence. Since we were unable to 
find any experiment work on energy levels, our 
results constitute predictions for the relative posi¬ 
tions of nine levels. In this case, there are five A's 
exhibiting substantial differences between DF and 
RCI results. 

Table VII contains our results for the W IV 
member of this sequence. Again, there appears to 
be no available experiment, so nine new levels are 
predicted. Three A's exhibit sign changes between 
the DF and RCI value. Studies in the Zr II and Hf 
II sequences were terminated once we reached 


species of lesser technological interest (in our judg¬ 
ment); dramatic differences between at least a few 
RCI and DF A values are likely to persist beyond 
the ions reported here. 

RESULTS FOR La I 

La I has been well studied experimentally, for 
both energy differences [16] and HFS constants 
[17]. In addition to reporting results for / = f, we 
also did calculations for J = j and j as the G FIFS 
constants (B's), exhibit some substantial second- 
order (from A) contributions. 

In Table VIII our results for / = § are presented 
and compared to experiment. For eight energy 
differences, the average error is only 0.028 eV (227 
cm -1 ), and one new level is predicted. The largest 
errors are associated with the uppermost levels, 
which may suggest the presence of significant sec¬ 
ond-order electrostatic effects (see Discussion sec- 
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HYPERFINE STRUCTURE CONSTANTS OF (of + s ) 3 STATES 


TABLE VII_ 

Fine and hyperfine structure of W IV, 
J = |(d + s) 3 states. 3 


Level 

Energy (cm h 

A (MHz) 

Config. 

SLJ 

Expt.k 

Theory 0 

Theory 0 

5d 6s 2 

2 D 


89111 

274.0 

(316.0) 

5 d 2 6s 

2 p 


66 789 

-863.0 

(-1018.0) 

5 d 2 6s 

2 d 


53 377 

-601.0 

(-732.0) 

5 d 2 6s 

4 P 


48 886 

1458.0 

(1551.0) 

5 d 3 

2 d 


47 207 

299.0 

(282.0) 

5 d 2 6s 

4 f 


32 565 

-1058.0 

(-1172.0) 

5 d 3 

2 P 


26313 

-51.5 

(145.0) 

5 d 3 

4 p+ 2 d 


19 578 

-36.4 

(158.0) 

5 d 3 

4 p + 2 P 


11 113 

-68.4 

(160.0) 

5c/ 3 

4 P 


0 

458.0 

(309.0) 


a A = 183.85, Z — 74, l ~ jx, = 0.117. 
b Uses same configurational basis set as Zr II, J = f. 
c Dirac-Fock results are given in parentheses. 


tion). The average error for the six measured A's 
is 10.3%, with the maximum error 19.0%. While 
there are no sign changes (DF vs. RCI) in these 
bottom six A's , the top two of them show large 
differences, and for two of the unmeasured A's 
there are sign changes. For the six B's, the average 
error is 32%, and the RCI values represent at best 
only a marginal improvement over DF values. For 
the smallest measured B (-6.00, we estimate a 
12.4% correction from second-order HFS effects 
(see below). 

The average energy difference errors for J = f 
and / = f, shown in Table IX, are 0.17 and 0.13 eV, 
respectively (1350 and 1020 cm" 1 ). This is indica¬ 
tive that the basis set of Table I, so useful for 
/ = §, needs improvement for these J's. Our more 
accurate studies on La II [9], Nb II [18], and Cs II 
[19] contain suggestions of what might be added. 
Comparing the A's, on the other hand, with obser¬ 
vation [17] yields an average error of 11% for / = f 


and 14% for / = f, which is comparable to our 
J = | accuracy. 

Both Tables VIII and IX contain results for B's, 
the electric quadrupole constants; those for the 
5d 2 6s 2 G's involve large second order HFS effects 
which we discuss in the next section. 

ELECTRIC QUADRUPOLE CONSTANTS FOR 
La 

When |B/A| 1 and fine structure (FS) split¬ 

tings are small, second-order contributions from A 
may be important; experiment measures net HFS, 
not individual poles (M lx E 2 ). Lower-order poles 
(M x ) must be treated in high enough order so 
errors are smaller than those desired in the pole of 
interest. Based on the experimental work [17], the 
four levels with the largest discrepancies [ B obs vs. 
^corrected ^ i n La I are shown in Table X. Only the 
last two are of interest here, as we don't work on 
/= 2 - In Table X, the B obs values are deduced 
from experiment without any second-order correc¬ 
tions as determined semi-empirically by fitting 
experiment [17]. This does not produce an entirely 
satisfactory fit to the directly observed frequencies 
[17]. 

The second-order correction may be expressed 
as in [17] as 

< 2 g 7/2/ f\h(m 1 )\ z g 9/2 , f>| 2 
e( 2 g 7/2 ) - e( 2 g 9/2 ) 

where H(Mj) is the relativistic magnetic dipole 
operator, and the denominator is the FS difference. 
In Ref. [17] this was initially evaluated at the 
MCDF level, but that yielded too small corrections. 
Here we show that the RCI second-order contribu¬ 
tion is adequate to explain observation without 
having to introduce extra semi-empirically deter- 
mined parameters. 

In the spirit of Ref. [20], we may write the 
numerator as 

W /r (MHz) 

= (-d j+f - / 

X 
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where 

ZX 7 

A jy = 13074.7—=======-< JJlT^I/'/')• 

VM/> + 1 > 

Here /x z is the magnetic dipole moment [15], 
is the Qth component [Q = } — }'] of the relativis¬ 
tic magnetic dipole component and { } is a 6/ 
symbol. The constant factor is a product of c and a 
factor to convert atomic units to megahertz. In 
evaluating Ajy we take nonorthonormality effects 
fully into account [21]. Our RCI result for 
A(5d 2 6s 2 G 7/2r 5d 2 6s 2 G 9/2 ) is -504.8 MHz, 
whereas our MCDF value is T 343.7 MHz. 

Our RCI results for the electric quadrupole 
constants, using experimental [15] Q's are 
B[5c/ 2 6s 2 G 7/2 ] = 112.6 MHz and B[5d 2 6s 2 G 9/2 ] 
= 108.3 MHz. These results are obtained from the 


Zr II type bases, with 3p vp excitations added. 
These results are in good agreement with the 
semi-empirical B (corrected) values [17] which are, 
respectively, 111.512 and 127.571 MHz. These val¬ 
ues appear in Table IX. 

It is useful to look at what is happening in 
detail which we do in Table XI. By columns, these 
tables contain (1) the experimental energy differ¬ 
ences (transitions between HFS levels), (2) exper¬ 
iment with the magnetic dipole contribution 
removed, using the observed A [labeled A E expt - 

8( A obs I • J>], (3) the E 2 contribution obtained from 
our RCI B using first-order perturbation theory, 
Brci, (4) the second-order M 1 contributions using 
our RCI A, labeled - 8A^ 2 \ and (5) a final column 
summing columns (3) and (4), X Th . Experimental/ 
theoretical discrepancies are obtained by compar¬ 
ing columns (2) and (5). It can be seen that the 


TABLE VIII 
Fine and hyp 








•erfine structure of 139 

La 1, J = | (d + s) 3 

states. 




Level 


Energy (cm ^ 

A (MHz) 


e (MHz) 

d 

Config. 

SLJ 

Expt. b 

Theory 0 

Theory* 5 

Expt. c 

Theory* 5 

Expt. c 

5 d 3 

2 D 


27680 

212.6 


36.4 






(175.1) 


(38.8) 


5 c/ 3 

2 P 

21 037 

21 614 

-64.0 


-14.1 






(81.9) 


(-1.03) 


5 d 3 

2 D 

18 038 

18 640 

190.2 


-24.3 






(145.3) 


(-23.3) 


5c/ 3 

V 

16735 

16 753 

-231.8 


-17.7 






(68.1) 


(-22.8) 


5c/ 3 

4 p 

12 431 

12 361 

391.8 

445.0 

-9.87 

-16.07 





(176.15) 


(-11.3) 


5d 2 6s 

2p 

9719 

9 726 

-530.7 

-655.0 

-24.2 

-33.54 





(-188.96) 


(-24.0) 


5c/ 2 6s 

2 d 

8 446 

8 692 

— 422.9 

-423.0 

-4.45 

-6.00® 





(-321.39) 


(-16.2) 


5c/ 2 6s 

4 p 

7 491 

7 556 

842.6 

930.0 

25.0 

34.72 





(844.26) 


(22.9) 


5 d 2 6s 

4 f 

2 668 

2 436 

-414.7 

-480.0 

10.7 

16.34 





(-398.55) 


(9.26) 


5c/6s 2 

2 d 

0 

0 

129.7 

141.0 

29.9 

44.78 





(111.23) 


(31.8) 



a Experimental energy is taken from Martin et al. [16]. 

b This work. The values in parentheses are MCDF values taken from same reference as footnote c . 
c From Childs and Nielsen [17]. 
d Q is from Raghavan [15]. 

e Estimated second-order HFS contribution ~ 12.4% (see text). 
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TABLE IX 


Fine and hyperfine structure of 139 La 1, J = 

J, |(d + s) 3 

states. 




Level 


Energy (cm 

r 1 ) 

A (MHz) 


e (MHz) 

, d 

Config. 

SLJ 

Expt.* 3 

Theory 0 

Theory* 3 

Expt. c 

Theory* 3 

Expt. c 

5d 3 

2r 

'7/2 

18 449 

20 920 

50.8 

(187.0) 


28.5 


5 d 3 

2 g 

U 7/2 

13 529 

15 282 

137.7 

(182.8) 


35.1 


5 d 3 

4 

' 7/2 

9 744 

10121 

-18.2 

(190.9) 

-19 

-19.3 

-20.9 

5d 2 6s 

G7/2 

6 466 

7 493 

-253.7 

(-189.7) 

-292 

113 

112 

5 d 2 6s 

2 C 

'7/2 

4 558 

5 659 

-170.7 

-197 

37.5 

40.3 




(81.9) 




5c/ 2 6s 

%/2 

0 

0 

411.3 

(507.9) 

463 

17.9 

16.1 

5 d 3 

2 «9/2 

14194 

15 704 

104 


47.2 





(125) 




5 d 3 

^9/2 

13019 

14 357 

69 


34.3 





(131) 




5 d 3 

4 f 

'9/2 

9626 

9 968 

-53 

-64 

-20.6 

-27.4 




(82) 




5d z 6s 

£*9/2 

5 798 

6694 

471 

560 

108 

128 




(432) 




5d z 6s 

4 f 9/2 

0 

0 

440 

490 

24.2 

31.5 




(451) 





a Experimental energy is taken from Martin et al. [16]. 
b This work; RCI. The values in parentheses are MCDF values. 

c From Childs and Nielsen [17]. All B’s except those for 5d 34 F are obtained using second-order HFS effects. 


agreement is quite good; and that the second-order 
effects associated with M l can be an appreciable 
fraction of the first-order E 2 effects. 


Discussion 

A cautious error estimate for unmeasured A's 
might be 20-25% for RCI values, except for those 
below 50 MHz in magnitude. Errors for these 


might be 50-100%. These estimates are based, in 
part, in how A changes as the calculation pro¬ 
ceeds. Although we have not emphasized, in de¬ 
tail, the importance of the relative positioning of 
d n ~ 1 s levels in this work, this is more thoroughly 
explored in the work on La II [9] and Cs II [19]. 

In general, uppermost levels exhibit large errors 
in energy positions and HFS constants. This is 
because these levels are closer in energy to "Ryd¬ 
berg" excited states than the lowest levels. Excita- 


TABLE X_ 

Experimental [17] results for HFS constants [B’s] exhibiting significant second-order effects. 

Level (cm- 1 ) Label A (MHz) B obs (MHz) B corrected (MHz) 


3 010.002 

5 d 2 6s 

4 f 

'5/2 

300.563 

10.873 

7.800 

9183.797 

CO 

CD 

OJ 

"O 

in 

2 n 

^5/2 

876.319 

-2.772 

-5.404 

9 919.821 

5c/ 2 6s 

2 r 

n U 9/2 

559.812 

202.638 

127.571 

9 960.904 

5d 2 6s 

2 g 7 , 2 

-292.267 

67.537 

111.512 
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TABLE XI_ 

La I 5 d 2 6s 2 G electric quadrupole results (MHz) including second-order effects. 8 


Transition 


F F' 

AEobs 

^^obs ^^obs 1 

^RCI 

- SA < 2 > 

^Th 

7-6 

-2016.965 

J = l A ob s= -292.267 MHz 
28.904 

46.44 

-12.9 

33.5 

6-5 

-1746.669 

6.933 

11.13 

0 

11.13 

5-4 

-1468.190 

-6.855 

-11.13 

7.80 

-3.33 

4-3 

-1182.851 

-13.783 

-22.08 

11.3 

-10.78 

3-2 

-891.992 

-15.191 

-24.37 

11.4 

-13.0 

2-1 

-596.977 

-12.443 

-20.3 

9.05 

-11.3 

1-0 

-299.185 

-6.918 

-10.4 

4.95 

-5.44 

8-7 

4555.752 

j = |A obs = 559.812 MHz 
77.256 

43.04 

31.6 

76.64 

7-6 

3943.948 

25.264 

14.07 

12.9 

27.0 

6-5 

3349.169 

-9.703 

-5.29 

0 

-5.29 

5-4 

2768.920 

-30.14 

-16.84 

-7.80 

-24.64 

4-3 

2200.726 

-38.522 

-21.5 

-11.3 

-32.8 


a Entries are explained in text. Column 2 constitutes the experimental prediction and the last column constitutes the theoretical 
prediction. 


tions from the core artificially pull all the (d Hb s) n 
levels away from these Rydberg levels, because 
they aren't given equivalent core correlation. 
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ABSTRACT: Haberlen and Rosch (HR) demonstrated [Chem. Phys. Lett. 199, 491 
(1992)] the feasibility of performing scalar-relativistic, density functional theory (DFT) 
linear combination of Gaussian-type orbitals-fitting function (LCGTO-FF) calculations on 
clusters of atoms using an "incomplete" Douglas-Kroll transformation. Some of the 
approximations used in their multiatom calculations have not yet been fully explored for 
isolated atoms, especially the neglect of matrix elements involving vector products of the 
momentum and the use of fitting functions. In this investigation, scalar-relativistic 
LCGTO calculations (without fitting functions) were carried out at four levels of 
approximation, including that employed by HR, for four atoms: Ce, Au, Pb, and Pu. The 
results of these calculations suggest that the HR approximation should be applicable to 
atoms through the light actinides, so long as spin-orbit effects are unimportant. A 
comparison of very large basis-set results with results obtained using ordinary-size basis 
sets indicates that the valence (bonding) states from scalar-relativistic LCGTO-FF 
calculations should be no more sensitive to the orbital basis-set size than are their 
nonrelativistic counterparts. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 565-574, 
1997 


Introduction 

T he linear combinations of Gaussian-type or¬ 
bitals-fitting function (LCGTO-FF) technique 
is now used routinely to perform all-electron, full- 
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the U.S. Government retains a nonexclusive, royalty-free license 
to publish or reproduce the published form of this contribution, 
or to allow others to do so, for U.S. Government purposes. 
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potential, density functional theory (DFT) elec¬ 
tronic structure calculations on a wide range of 
systems, including isolated clusters of atoms [1-3], 
ID periodic polymer chains [4-6], 2D periodic 
ultrathin films [7-9], and 3D periodic crystalline 
solids [10-12]. This ability to treat localized 
(molecular) and periodically extended (crystalline) 
systems on an equal footing is one of the primary 
advantages of the LCGTO-FF method over other 
existing DFT electronic structure techniques. For 
example, the various linearized techniques used in 
solid-state physics (FLAPW, FLMTO, etc.) have 

CCC 0020-7608 / 97 / 050565-10 
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not yet been demonstrated to be useful tools for 
studying isolated clusters of atoms. Thus, the 
LCGTOFF method is one of the few all-electron, 
full-potential methods that can be used to bridge 
the gap between quantum chemistry and solid- 
state physics. To this extent, the LCGTO-FF tech¬ 
nique may be viewed as a "universal" band-struc¬ 
ture technique. 

One limitation of the LCGTO-FF method has 
been the difficulty associated with applying any 
fixed-basis method to heavy-atom systems, due to 
the lack of a stable method for incorporating rela¬ 
tivistic corrections into such calculations [13]. 
Haberlen and Rosch [14-16] (HR) demonstrated 
the feasibility of performing scalar-relativistic 
LCGTO-FF calculations on clusters containing 
heavy atoms using an "incomplete" Douglas- 
Kroll [17] (DK) transformation (the HR approx¬ 
imation hereafter) that requires only a minimal 
increase in computational effort relative to the cor¬ 
responding nonrelativistic calculation. The HR 
scalar-relativistic LCGTO-FF method (LCGTO-DF 
in their notation) has since been successfully ap¬ 
plied to a number of heavy-atom systems [18-26]. 
This development is of great significance because 
it substantially increases the range of applicability 
for LCGTO-FF calculations. 

Three aspects of the HR scalar-relativistic 
LCGTO-FF method have not yet been fully ex¬ 
plored for isolated atoms: (1) the effect of neglect¬ 
ing all matrix elements involving vector products 
of the momentum; (2) the extent of sensitivity to 
the orbital basis-set size; and (3) the effect of using 
charge and exchange-correlation (XC) fitting func¬ 
tions in a scalar-relativistic calculation. In the cur¬ 
rent investigation, scalar-relativistic LCGTO calcu¬ 
lations (without fitting functions) are carried out 
for four atoms (Ce, Au, Pb, and Pu) in an attempt 
to resolve these issues. In the next section, the 
scalar-relativistic LCGTO method is reviewed and 
the four approximations considered here are de¬ 
fined, including the HR approximation. Results are 
presented in the third section. 


Scalar-Relativistic LCGTO DFT Method 

The formulation of scalar-relativistic DFT used 
by HR [14] begins with the four-component 
Dirac-Kohn-Sham (DKS) equations [27]: 


where 


V'ff = V n + + V xc (2) 

is the effective one-electron potential, composed of 
the nuclear potential v n , the classical electronic 
Coulomb potential v c , and the DFT XC potential 
v xc . The eigenvalues of the DKS equations are 
unbounded, above and below, since they include 
both electron and positron degrees of freedom. For 
this reason, any attempt to solve the DKS equa¬ 
tions variationally with a truncated basis set will 
lead to the so-called variational collapse problem, 
unless some specially designed collection of basis 
functions is used [13]. In principle, the electron 
and positron degrees of freedom in the DKS equa¬ 
tions can be decoupled exactly to produce two sets 
of two-component equations whose eigenvalues 
are bounded [28]. The resulting second-order 
equations are, however, nonlinear in e t and are 
not amenable to solution with a matrix method. 
Both of these difficulties can be circumvented by 
performing some unitary transformation on the 
DKS equations that approximately decouples the 
electron and positron degrees of freedom while 
maintaining linearity. This is the approach fol¬ 
lowed by HR [14-16]. 

REDUCTION TO TWO-COMPONENT FORM 

The approximate decoupling of the DKS equa¬ 
tions into two sets of two-component equations 
begins with a free-particle Foldy-Wouthuysen [29] 
(FW) transformation using the unitary transforma¬ 
tion operator 


U 0 = A p (l + fiRf), 

where 


A 


p 


E p + me 2 
2JEL 


-|l/2 


R< 4) = K p a • p, 

K p = c /{ E p + mc2 )' 
E p = c(p 2 + m 2 c 2 ) 1/2 . 


(3) 


(4) 

(5) 

( 6 ) 
(7) 


This transformation yields an alternative four- 
component equation 


h( DKS<Pi = [(cot • p + /3mc 2 ) + v eff ]if/, = e,iA„ (1) 


h\\ = [ pE p + S’j + ^ J (8) 
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where and ^ are even and odd operators with 
respect to commutation with (3: 

^=A p [v eff +R^v eff Rf>]A p (9) 

^ = /3A p [ Rfv eff - v eff Rf]A p . (10) 

Note that, up to this point, no approximations 
have been made beyond those inherent in the 
original DKS equation. 

It can be demonstrated that the even (odd) 
operators do not (do) couple the positive and neg¬ 
ative kinetic-energy states of the free-particle DKS 
equation [30]. Equation (8) can then be reduced to 
an approximate two-component form by simply 
dropping the odd operator to get 

h¥pp<t>i = e i 4>i t ( 11 ) 

hfpp = E p + A p [v efI + Rfv eff Rf]A p , 

where R (2) is the two-component equivalent of 

Rf = K p <r • p. (12) 

This approximation is known as the free-particle 
projector (FPP) approximation because it can also 
be obtained by surrounding h$ KS with free-par¬ 
ticle projection operators, thereby forcing the elec¬ 
tronic wave functions in the presence of v e ^ to be 
constructed from a superposition of positive-en¬ 
ergy free-particle states [30]. The FPP approxima¬ 
tion should be reasonable if the effective potential 
is only weakly dependent on the position, a re¬ 
quirement that is clearly not satisfied near a nu¬ 
cleus. 

The primary shortcoming of the FPP approxi¬ 
mation is its total neglect of the mixing of positive 
and negative energy free-particle states induced by 
the presence of the odd operator ^ in hf\ An 
improved two-component equation can be ob¬ 
tained by performing another unitary transforma¬ 
tion on /i ( x 4) that approximately eliminates To 
this end, Douglas and Kroll [17] observed that the 
operator 


llj = (1 + W 2 ) 1/2 + W (13) 

will be unitary if W is anti-Hermitian. W can then 
be selected such that the transformation Lf x elimi¬ 
nates &\ from /t ( ! 4) to first-order in W. Carrying out 
this procedure and expressing the integral opera- 
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tor W in momentum-space yields 

W„ = A n (R n - R„')A„,v' (14) 

r ' r r r r r r > r 

with 



p e //(p/p') 
E„ + E„, ' 

V V 


(15) 


where v e /f{ p,p0 is the momentum-space represen¬ 
tation of v e ff and R p is now assumed to take on its 
two- or four-component representation as needed. 
Applying U r to hf\ expanding the resulting oper¬ 
ator to second-order in W, and dropping all 
higher-order odd operators yields the second-order 
DK equation [17, 31, 32]: 




(E p W 2 + W z E p + 2WE p W) 


4>i 


e i4>i- 


(16) 


This approximation is often referred to as the ex¬ 
ternal-field projector (EFP) approximation because 
it effectively projects out the electronic solutions to 
the DKS equation for a particle moving in the 
external-field v e ^. 


SCALAR-RELATIVITY 

The approximate two-component equations pre¬ 
sented above [Eqs. (11) and (16)] are fully relativis¬ 
tic, in the sense that they include mass-velocity, 
Darwin, and spin-orbit coupling corrections. A 
scalar-relativistic formulation is desired here to 
allow a direct comparison with previous work. In 
the FPP equation [Eq. (11)], the spin-orbit correc¬ 
tions are all contained in the term A p R p vR p A p . 
Using the standard properties of the Pauli matrices 
( a ), this term can be rewritten as 

A p R p vR p A p 

— A p K p (p • vp + i(T * p X vp)K p A p . (17) 

The term involving p • vp is scalar-relativistic, 
whereas the term involving a • p X vp is a 
spin-orbit coupling term. Thus, the scalar-relativ¬ 
istic FPP approximation does not require matrix 
elements of p X vp. 

Separating out the spin-orbit coupling terms for 
the EFP equation is somewhat more complicated 
because of the second-order terms in Eq. (16). The 
basic properties of the external-field correction 
terms can all be illustrated with the operator W 2 . 
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Since the momentum-space operators in W com¬ 
mute, except for v', W 2 can be expressed as 

W 2 = (A p R p v f R p A p )(A p v f A p ) 

- ( A p v r A p )( p 2 K p )( A p v'A p ) 

+ ( A p v'A p )( A p R p v'R p A p ) 

- ( A p R p v'R p A p )( p 2 K 2 ) \A p R p v’R p A p ). 

(18) 

The second term in this expression is scalar-relativ¬ 
istic as written. The first and third terms can be 
easily split into Darwin and spin-orbit coupling 
terms using Eq. (17). The last term is more compli¬ 
cated because it behaves like 

(A p R p v'R p A p ) 2 

= A p K p (p • v'p + i<r • p X v'p)K p A p ] 2 . 

(19) 

Inspection of the rhs of Eq. (19) reveals that the 
squares of the individual factors in the parentheses 
are both scalar-relativistic, while their cross-prod¬ 
ucts are spin-orbit coupling terms. Thus, even the 
scalar-relativistic version of the EFP approxima¬ 
tion requires matrix elements of p X vp. Through¬ 
out the remainder of this work, it will be assumed 
that all spin-orbit coupling terms in hfp P and /? ( |^ P 
are neglected. 

INCOMPLETE TRANSFORMATIONS 

The FPP and EFP equations both involve the 
operators E p/ A p/ and K p . Although these opera¬ 
tors are easy to work with in momentum-space, 
analytical evaluation of the GTO matrix elements 
for these operators has thus far proven to be im¬ 
practical. This difficulty can be avoided by trans¬ 
forming the GTO basis set to an approximate mo¬ 
mentum-space representation by diagonalizing the 
nonrelativistic kinetic-energy (KE) matrix [33]. 
First, the matrix elements of p • vp and p X vp (if 
needed) are evaluated along with the usual nonrel¬ 
ativistic matrix elements. Next, the nonrelativistic 
KE matrix is diagonalized to obtain approximate 
eigenfunctions of p 2 and the matrices are trans¬ 
formed to this approximate momentum space. The 
operators E , A , and K p can then be constructed 
from the approximate p 2 eigenvalues. These basic 
components are used to form the complicated ma¬ 


trices, like A p R p v'R p A p/ which are then back- 
transformed to the original basis set. 

The approximate momentum-space representa¬ 
tion becomes exact in the limit of a complete basis. 
Conversely, as the basis set is reduced, the approx¬ 
imate momentum-space representation can be¬ 
come a source of serious error. Hess [31] tested 
this approximation by calculating relativistic 
Hartree-Fock total and one-electron energies for 
Br (Z = 35) and Ag (Z = 47) using fairly large 
primitive basis sets that were contracted to obtain 
a wide range of basis-set sizes. Hess concluded 
that the results were sufficiently sensitive to war¬ 
rant using the uncontracted basis sets whenever 
possible. (Jansen and Hess [32] carried out similar 
tests on Au [Z = 79] with contracted basis sets 
ranging from 13s9p6dl/ to 17sl2p8d4/, but only 
studied the total energy variation in detail.) The 
requirement of a large basis set to ensure a good 
momentum-space representation conflicts with the 
obvious need to use rather heavily contracted or¬ 
bital basis sets for heavy atoms to reduce the 
computational demands of such systems. 

HR overcame this paradox by only applying the 
relativistic transformations associated with the FPP 
and EFP approximations to the one-electron por¬ 
tions of the DKS equations, i.e., the kinetic energy 
and nuclear potential. These incomplete transfor¬ 
mations allow all of the relativistic matrix ele¬ 
ments to be initially evaluated using the primitive 
GTO basis set and then transformed to a much 
smaller contracted basis set prior to beginning the 
iterative SCF cycle [15]. The more numerous two- 
electron integrals associated with v c then only need 
to be evaluated for the smaller contracted basis set. 
These nuclear-only versions of the FPP and EFP 
approximations will be denoted as the FPPn and 
EFPn approximations, respectively. HR further re¬ 
duced the complexity of the EFPn approximation 
by dropping all of the terms involving p X vp; this 
then is the HR approximation. 

COMPUTATIONAL DETAILS 

To test the approximations discussed, an exist¬ 
ing program for performing nonrelativistic LCGTO 
DFT calculations on atoms, without the use of 
fitting functions, (ATOM [34]) has been rewritten 
as a scalar-relativistic code (SATOM). In SATOM, 
each term in the effective one-electron potential 
(v u , v c , and v xc ) can be treated nonrelativistically 
(NR) or with one of three scalar-relativistic approx¬ 
imations, FPP, EFP without vector product terms, 


568 


VOL 65, NO. 5 




SCALAR-RELATIVISTIC LCGTO DFT CALCULATIONS 


or EFP with vector product terms. The HR approx¬ 
imation then corresponds to treating v c and v xc 
nonrelativistically and treating v n with the EFP 
approximation without vector product terms. 

Four atoms, spanning a wide range of atomic 
numbers, were studied here: Ce (Z = 58), Au (Z 
= 79), Pb (Z — 82), and Pu (Z = 94). In each case, 
LCGTO results have been obtained nonrelativisti¬ 
cally and with four scalar-relativistic approxima¬ 
tions: full EFP, EFPn, HR, and FPPn. For Ce and 
Pb, all calculations were carried out using the Xa 
LDA model (a = 0.7) to allow direct comparison 
with previous LCGTO-FF and numerical results 
[16, 35]. For Au and Pu, the Hedin-Lundqvist [36] 
(HL) model was used to allow a comparison with 
numerical results obtained from a code written by 
Wills [37]. 

The present LCGTO calculations for Ce em¬ 
ployed a high-quality 2lsl6plld9f basis set de¬ 
scribed by Haberlen [16]. Unfortunately, no com¬ 
parable quality basis sets have been published 
for Au, Pb, and Pu. For this reason, large 
30s24pl8dl2/ even-tempered basis sets were used 
for the three larger atoms. The (minimum:maxi- 
mum) exponents of the s, p, d, and / parts of the 
large basis sets for Au and Pb were (0.02:1 X 10 8 ), 


(0.01:1 X 10 5 ), (0.05:2 X 10 4 ), and (0.1:2 X 10 3 ), re¬ 
spectively. For Pu, the limits for the d basis set 
were changed to (0.02:1 X 10 4 ). To study the ef¬ 
fects of basis-set size on the results, the calcula¬ 
tions for Au and Pb were repeated with smaller 
basis sets. The small basis used for Au was a 
20sl5plld6/ basis derived from Gropen's [38] 
19sl4pl0d5/ basis by adding one exponent for 
each symmetry: s, 0.18; p, 0.20; d, 0.12; and /, 1.4. 
The small basis for Pb was the 20sl7plld6f basis 
employed by Knappe and Rosch [35] in their pro¬ 
totype scalar-relativistic LCGTO-FF calculations. 


Results 

The rich basis-set results for the NR and scalar- 
relativistic one-electron energies and total energies 
for Ce, Au, Pb, and Pu are compared with results 
from numerical calculations in Tables I-IV. In each 
case, the NR results are in excellent agreement, 
with the deepest core levels differing by no more 
than 0.5 eV and the total energies differing by no 
more than 0.3 Ry, demonstrating the high quality 
of the large basis sets. A careful inspection of 
Tables I-IV also reveals that, in each case, the EFP 


TABLE I_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Ce. Scalar-relativistic corrections to the nonrelativistic 
results are listed for various approximations. The numerical results are from [16]. The LCGTO results were 
calculated here with a 21s16p11d9f basis. All calculations used the Xa model with a = 0.7. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

38286.20 

38285.95 

1932.93 

1866.18 

1838.04 

1835.17 

2723.95 

2 s 

5956.84 

5956.84 

485.09 

477.75 

474.55 

473.61 

565.31 

3s 

1281.39 

1281.41 

105.83 

104.38 

103.83 

103.54 

121.63 

4s 

256.50 

256.52 

23.40 

23.11 

23.01 

22.92 

26.99 

5s 

36.54 

36.57 

3.45 

3.42 

3.41 

3.39 

4.04 

6 s 

2.99 

3.00 

0.23 

0.22 

0.22 

0.22 

0.26 

2 P 

5637.49 

5637.49 

133.70 

133.25 

130.98 

136.38 

131.86 

3 P 

1142.79 

1142.81 

31.26 

31.30 

30.94 

31.95 

30.70 

4p 

204.75 

204.78 

5.51 

5.53 

5.48 

5.68 

5.29 

5p 

22.38 

22.42 

0.16 

0.19 

0.18 

0.21 

0.08 

3d 

883.62 

883.62 

-12.49 

-12.64 

-12.75 

-12.96 

-14.12 

4 d 

113.76 

113.77 

-3.71 

-3.69 

-3.69 

-3.75 

-4.07 

5d 

3.08 

3.07 

-0.64 

-0.62 

-0.62 

-0.62 

-0.69 

4 f 

8.58 

8.61 

-3.53 

-3.49 

-3.47 

-3.51 

-3.70 

E 

17134.59 

17134.55 

581.69 

571.60 

568.89 

572.54 

725.03 
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TABLE II_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Au. Scalar-relativistic corrections to the nonrelativistic results 
are listed for various approximations. The LCGTO results were calculated here with a 30s24p18cM2f basis. 

All calculations used the HL model. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

73021.73 

73021.79 

7617.36 

7288.75 

7200.23 

7185.95 

11173.46 

2 s 

12187.55 

12187.62 

2034.09 

1993.22 

1982.11 

1977.25 

2428.50 

3s 

2852.37 

2852.44 

504.21 

495.39 

493.16 

491.68 

589.78 

4s 

600.78 

600.81 

127.28 

125.13 

124.64 

124.21 

148.83 

5s 

84.79 

84.79 

23.95 

23.54 

23.46 

23.36 

28.23 

6 s 

4.51 

4.51 

1.67 

1.64 

1.63 

1.64 

2.00 

2 p 

11720.50 

11720.58 

572.48 

573.25 

567.06 

604.16 

589.10 

3 P 

2631.48 

2631.58 

166.33 

167.40 

166.14 

174.19 

170.19 

4 p 

505.55 

505.60 

39.34 

39.70 

39.47 

41.37 

40.23 

5 p 

54.55 

54.55 

5.50 

5.59 

5.56 

5.88 

5.62 

3d 

2217.99 

2218.08 

-16.42 

-17.31 

-17.91 

-18.78 

-22.61 

4 d 

330.12 

330.17 

-4.70 

4.83 

-4.88 

-5.11 

-6.07 

5 d 

8.37 

8.37 

-1.17 

-1.16 

-1.16 

-1.17 

-1.31 

4 f 

94.89 

94.89 

-13.22 

-13.18 

-13.14 

-13.33 

-13.96 

E 

35720.73 

35720.65 

2280.97 

2237.68 

2229.21 

2253.88 

2944.25 


TABLE III_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Pb. Scalar-relativistic corrections to the nonrelativistic results 
are listed for various approximations. The numerical results are from [35]. The LCGTO results were calculated 
here with a 30s24p18c/12f basis. All calculations used the Xa model with a = 0.7. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

78976.75 

78976.27 

8997.12 

8640.22 

8540.96 

8523.55 

13354.48 

2 s 

13312.12 

13312.11 

2421.16 

2377.64 

2365.02 

2358.97 

2913.73 

3s 

3173.88 

3173.94 

604.04 

594.37 

591.82 

589.92 

711.34 

4s 

701.38 

701.40 

153.40 

150.97 

150.40 

149.79 

180.62 

5s 

113.91 

113.91 

29.86 

29.34 

29.25 

29.07 

35.49 

6 s 

8.93 

8.93 

2.59 

2.54 

2.52 

2.50 

3.17 

2 p 

12824.30 

12824.31 

668.90 

677.78 

670.84 

717.56 

699.65 

3 P 

2940.72 

2940.82 

195.58 

198.37 

196.93 

207.12 

202.18 

4 p 

598.97 

599.03 

45.18 

46.29 

46.02 

48.44 

46.86 

5 p 

79.45 

79.45 

5.80 

5.97 

5.94 

6.35 

5.86 

6 P 

3.13 

3.13 

-0.15 

-0.14 

-0.14 

-0.13 

-0.20 

3d 

2503.90 

2503.99 

-20.25 

-21.07 

-21.71 

-22.82 

-27.58 

4 d 

409.46 

409.51 

-7.67 

-7.79 

-7.86 

-8.20 

-9.57 

5d 

23.86 

23.86 

-3.27 

-3.27 

-3.26 

-3.33 

-3.66 

4 f 

152.37 

152.36 

-18.88 

-18.91 

-18.85 

-19.18 

-20.16 

E 

39055.40 

39055.29 

2705.40 

2653.31 

2643.97 

2675.09 

3513.16 
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TABLE IV_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Pu. Scalar-relativistic corrections to the nonrelativistic results 
are listed for various approximations. The LCGTO results were calculated here with a 30s24p18c/12f basis. 

All calculations used the HL model. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

104982.08 

104982.29 

17168.80 

16343.22 

16165.23 

16128.81 

26240.88 

2 s 

18284.96 

18285.20 

4725.81 

4615.31 

4591.57 

4578.44 

5820.87 

3s 

4635.88 

4636.15 

1212.89 

1187.79 

1182.78 

1178.49 

1456.94 

4s 

1177.46 

1177.62 

329.99 

323.28 

322.05 

320.59 

395.44 

5s 

253.73 

253.77 

83.30 

81.56 

81.27 

80.82 

100.07 

6 s 

35.23 

35.24 

14.02 

13.65 

13.60 

13.50 

17.10 

7s 

3.42 

3.42 

0.70 

0.67 

0.67 

0.66 

0.86 

2 p 

17707.84 

17708.08 

1242.85 

1257.35 

1247.14 

1356.24 

1322.06 

3 P 

4351.35 

4351.64 

378.01 

383.73 

381.35 

405.97 

396.19 

4 p 

1045.15 

1045.35 

97.28 

98.98 

98.47 

104.78 

101.40 

5 p 

201.82 

201.88 

20.77 

21.19 

21.09 

22.58 

21.44 

6 P 

21.20 

21.20 

1.37 

1.43 

1.42 

1.63 

1.22 

3d 

3818.07 

3818.34 

-22.69 

-25.11 

-26.25 

-28.45 

-37.85 

Ad 

797.72 

797.90 

-9.62 

-10.11 

-10.27 

-11.03 

-14.03 

5 d 

110.79 

110.82 

-4.91 

-5.01 

-5.02 

-5.21 

-6.10 

4f 

453.88 

453.95 

-37.42 

-37.54 

-37.49 

-38.32 

-40.67 

5 f 

8.43 

8.41 

-6.20 

-6.22 

-6.20 

-6.29 

-6.77 

E 

54004.24 

54003.98 

5106.93 

4999.42 

4983.10 

5056.02 

6828.79 


approximation accounts for roughly 98% of the 
scalar-relativistic correction to the total energy and 
produces corrections to the core-state energies that 
are within 5% of the numerical results, with the 
exception of the Pu 3d energy. More importantly, 
the EFP approximation yields energies for the all- 
important valence states that are within a few 
hundredths of an eV of the values found in the 
numerical calculations. Using the simpler EFPn 
approximation in place of the EFP approximation 
produces a modest deterioration in the one-elec¬ 
tron results that is concentrated in the chemically 
inert core states. On the other hand, the FPPn 
approximation is clearly inadequate for all four of 
the atoms considered. These observations are all in 
agreement with the results of earlier work on Ce 
[16] and Pb [35]. 

Analysis of the results obtained with the HR 
approximation is not so straightforward. For sim¬ 
plicity, focus on the Pu results in Table IV, since 
they exhibit the trends most clearly. The HR ap¬ 
proximation has only a small effect on the rela¬ 
tivistic corrections to the total energy and the s 
and / one-electron energies. (In fact, the shift in 
the total energy due to the HR approximation 


cancels some of the error already existing in the 
more rigorous EFP approximation.) For the Estate 
corrections, the HR approximation results in sig¬ 
nificantly larger percentage errors than does the 
EFPn approximation, but those errors represent 
only a small fraction of the orbital energies them¬ 
selves. The most serious evidence for a potential 
problem with the HR approximation is in the rela¬ 
tivistic corrections for the p states, for which the 
HR approximation gives even poorer results than 
does the simple FPPn approximation. To under¬ 
stand this, consider the vector product terms that 
are neglected in the HR approximation. For a 
spherical potential, 

i dv 

p X vp = ——L. (20) 

r dr 

The radial factors in Eq. (20) blow up near the 
origin and, hence, will primarily affect states with 
small l values. At the same time, L will be zero for 
s states. Thus, the vector product terms, and the 
HR approximation, will primarily affect the p 
states, as observed. The HR approximation does 
not appear to have a significant effect on any of 
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the chemically active states for the four atoms 
under consideration. Given the similarity between 
the vector product terms being dropped in the HR 
approximation and the lowest-order spin-orbit 
coupling term in Eq. {17), the HR approximation 
should be valid for any system in which spin-orbit 
coupling is unimportant. At the same time, if 
spin-orbit coupling is required for a given system, 
there will be no advantage to using the HR ap¬ 
proximation anyway. 

The small basis set results for Au and Pb are 
given in Tables V and VI. Comparison of the large 
and small basis-set results for these two atoms 
reveals that the change in the basis set has a 
noticeable impact on the NR one-electron energies; 
note, in particular, the energy of the 5 p state for 
Au, which is shifted by about 0.4 eV. It is also clear 
that the relativistic corrections to the total energy 
and the one-electron eigenvalues of the core states 
are sensitive to the basis-set size. Nevertheless, for 
the chemically active valence states, there is rather 
good agreement between the relativistic correc¬ 
tions obtained with the large and small basis sets. 
This suggests that the physical properties obtained 
with scalar-relativistic LCGTO calculations should 


be no more sensitive to the orbital basis set size 
than are their NR counterparts. 

In Table VII, the current small basis LCGTO 
results for Pb are compared with previous 
LCGTO-FF calculations using the same orbital ba¬ 
sis set and LDA model [35], The LCGTO-FF work 
did not consider the full EFP approximation used 
here or the HR approximation; hence, results are 
only compared for the NR, EFPn, and FPPn ap¬ 
proximations. There is very good agreement be¬ 
tween the NR results, indicating that a good fitting 
function basis set was used in the earlier work. 
The differences between the scalar-relativistic 
LCGTO and LCGTO-FF results are significantly 
larger. Those differences are, however, once again 
concentrated in the chemically inert core states and 
should not pose any difficulties for multiatom cal¬ 
culations. 

The three questions posed in the Introduction 
can now be answered as follows: 

1. The neglect of matrix elements involving 
vector products of the momentum should not 
present any problems for systems in which 
the spin-orbit coupling is unimportant. 


TABLE V_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Au. Scalar-relativistic corrections to the nonrelativistic results 
are listed for various approximations. The LCGTO results were calculated here with a 20s15p11d6f basis. All 
calculations used the HL model. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

73021.73 

73022.30 

7617.36 

7281.25 

7192.75 

7178.54 

11117.51 

2 s 

12187.55 

12188.15 

2034.09 

1991.51 

1980.41 

1975.56 

2419.90 

3s 

2852.37 

2852.98 

504.21 

494.85 

492.62 

491.15 

587.68 

4s 

600.78 

601.40 

127.28 

124.87 

124.38 

123.95 

148.15 

5s 

84.79 

85.26 

23.95 

23.49 

23.41 

23.31 

28.10 

6 s 

4.51 

4.53 

1.67 

1.65 

1.64 

1.64 

2.00 

2 p 

11720.50 

11721.21 

572.48 

573.21 

567.02 

603.99 

589.17 

3 p 

2631.48 

2632.12 

166.33 

167.29 

166.03 

174.04 

170.10 

4 P 

505.55 

506.17 

39.34 

39.57 

39.34 

41.23 

40.09 

5 p 

54.55 

54.95 

5.50 

5.54 

5.51 

5.83 

5.56 

3d 

2217.99 

2218.55 

-16.42 

-17.47 

-18.02 

-18.89 

-22.68 

4 d 

330.12 

330.54 

-4.70 

-4.94 

-4.99 

-5.22 

-6.19 

5 d 

8.37 

8.39 

-1.17 

-1.19 

-1.18 

-1.19 

-1.33 

4 f 

94.89 

95.19 

— 13.22 

-13.31 

-13.26 

-13.46 

-14.10 

E 

35720.73 

35719.90 

2280.97 

2236.13 

2227.66 

2252.24 

2933.86 
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TABLE VI_ 

Numerical and LCGTO results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Pb. Scalar-relativistic corrections to the nonrelativistic results 
are listed for various approximations. The numerical results are from [35]. The LCGTO results were calculated 
here with a 20s17p11c/6f basis. All calculations used the Xa model with a = 0.7. 


Nonrelativistic Scalar-relativistic corrections 



Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

Is 

78976.75 

78976.79 

8997.12 

8631.51 

8532.30 

8515.90 

13281.58 

2 s 

13312.12 

13312.67 

2421.16 

2376.22 

2363.60 

2357.86 

2903.21 

3s 

3173.88 

3174.49 

604.04 

593.93 

591.38 

589.58 

708.83 

4s 

701.38 

701.85 

153.40 

150.86 

150.29 

149.71 

179.97 

5s 

113.91 

114.25 

29.86 

29.37 

29.27 

29.10 

35.42 

6 s 

8.93 

8.97 

2.59 

2.54 

2.54 

2.50 

3.17 

2 P 

12824.30 

12824.82 

668.90 

675.69 

668.75 

713.45 

695.88 

3 P 

2940.72 

2941.33 

195.58 

197.84 

196.40 

206.07 

201.22 

4 P 

598.97 

599.44 

45.18 

46.17 

45.89 

48.18 

46.63 

5 P 

79.45 

79.76 

5.80 

5.96 

5.93 

6.32 

5.84 

6 P 

3.13 

3.12 

— 0.15 

-0.14 

-0.14 

-0.13 

-0.20 

3d 

2503.90 

2504.49 

-20.25 

-20.86 

-21.50 

-22.54 

-27.20 

4 d 

409.46 

409.79 

-7.67 

-7.74 

-7.80 

-8.12 

-9.47 

5 d 

23.86 

23.92 

-3.27 

-3.23 

-3.23 

-3.29 

-3.61 

4 f 

152.37 

152.61 

-18.88 

-18.88 

-18.83 

-19.13 

-20.11 

E 

39055.40 

39054.66 

2705.40 

2649.61 

2640.28 

2670.00 

3496.23 


TABLE VII_ 

LCGTO and LCGTO-FF results for the negatives of the nonrelativistic one-electron eigenvalues (in eV) and total 
energies (in Ry) are compared for paramagnetic Pb. Scalar-relativistic corrections to the nonrelativistic results 
are listed for two approximations. The LCGTO-FF results are from [35] and are indicated by FF. All 
calculations used the 20s17p11 c/67 basis and the Xa model with a = 0.7. 


Nonrelativistic Scalar-relativistic corrections 



LCGTO 

LCGTO-FF 

EFPn 

EFPn-FF 

FPPn 

FPPn-FF 

Is 

78976.79 

78977.31 

8532.30 

8531.04 

13281.58 

13279.32 

2 s 

13312.67 

13312.75 

2363.60 

2363.22 

2903.21 

2902.52 

3s 

3174.49 

3174.50 

591.38 

591.27 

708.83 

708.62 

4s 

701.85 

701.84 

150.29 

150.26 

179.97 

179.91 

5s 

114.25 

114.25 

29.27 

29.26 

35.42 

35.39 

6 s 

8.97 

8.97 

2.54 

2.53 

3.17 

3.16 

2 p 

12824.82 

12825.03 

668.75 

671.07 

695.88 

700.14 

3 p 

2941.33 

2941.38 

196.40 

197.03 

201.22 

202.36 

4p 

599.44 

599.45 

45.89 

46.05 

46.63 

46.91 

5 P 

79.76 

79.75 

5.93 

5.87 

5.84 

5.90 

6 p 

3.12 

3.12 

-0.14 

-0.15 

-0.20 

-0.20 

3d 

2504.49 

2504.50 

-21.50 

-21.60 

-27.20 

-27.38 

4 d 

409.79 

409.79 

-7.80 

-7.83 

-9.47 

-9.52 

3d 

23.92 

23.92 

-3.23 

-3.24 

-3.61 

-3.62 

4 f 

152.61 

152.61 

-18.83 

-18.85 

-20.11 

-20.14 

E 

39054.66 

39054.74 

2640.28 

2641.92 

3496.23 

3499.20 
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2. The valence (bonding) states from scalar-rela¬ 
tivistic LCGTO calculations should be no 
more sensitive to orbital basis set size than 
are their NR counterparts. 

3. Although the use of fitting functions has a 
somewhat larger impact on scalar-relativistic 
calculations than on NR calculations, this ef¬ 
fect is confined to the deepest core levels and 
should not affect the properties that are of 
interest in most multiatom calculations. 
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ABSTRACT: The aim of the present study is to extend the multiconfiguration 
Dirac-Fock approach to include spin polarization in order to study magnetically ordered 
systems. Routines handling time-dependent ionization phenomena adapted to multilevel 
and multichannel autoionization processes have been developed and used to describe the 
giant resonance in the Gd 3+ absorption spectrum. In our calculation of the complete 
energy structure of 4d 9 4f 8 configuration it is discovered that the majority of the 
oscillator strength associated with the 4d-4f transitions is limited to a relatively narrow 
spectral width, creating a giant resonance. Our calculations take account of the most 
important correlation effects, and the agreement with the experimentally observed giant 
resonance in gadolinium is quite good. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 
575-584, 1997 


I n this study, we demonstrate the applicability 
of the spin-polarized multiconfiguration Di¬ 
rac-Fock (SPMCDF) formalism for an accurate de¬ 
scription of magnetic dichroism in rare-earth 
atoms. The motivation for applying the SPMCDF 
method is our desire to use the exact form of the 
exchange potential essential for a correct calcula¬ 
tion of the core spin polarization and ferromag¬ 
netism in general. As far as we know, our work is 
the first attempt to apply the Dirac-Fock formal¬ 
ism in combination with multiconfiguration effects 
[1] to describe magnetic dichroism on an atomic 
level. A multiconfiguration description enables one 
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to introduce the multiple structure of relativistic 
terms with different total angular momenta. More¬ 
over, this approach is designed to treat spin polar¬ 
ization and relativistic effects on the same footing 
by including the exchange potential dependence 
on the orientation of the electron spin. Magnetic 
dichroism in its circular and linear forms is caused 
by the reduced symmetry, compared to the para¬ 
magnetic state, due to the simultaneous effects of 
spin-orbit and spin polarization. 

Besides being valuable in the study of ferro¬ 
magnetic materials, magnetic dichroism also gives 
spectroscopic information about the electronic 
structure of free atoms and ions embedded in a 
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crystal. We have found that it is instructive to 
begin the study of magnetic dichroism assuming a 
free atom or ion. Moreover, there is experimental 
evidence that the total absorption cross section of 
rare-earth metals for photon energies near the 4 d 
thresholds [2] is nearly identical with the corre¬ 
sponding quantity measured in the gas phase [3]. 
This motivates one to consider magnetic dichroism 
in an atomic model. 

A number of relativistic theories have been de¬ 
veloped recently to describe magnetic dichroism in 
magnetic solids [4-8] using the local density ap¬ 
proximation. In particular, the L 2 3 edges of solid 
Gd have been studied within a multiple scattering 
formalism which includes finite lifetime effects for 
the excited states [9]. 

Here we consider a free Gd 3+ ion, which has 
been obtained by removing the three valence elec¬ 
trons (5d6s 2 ) from the Gd atom. The remaining 
ion has a high-spin 4/ 7 state as the ground config¬ 
uration. Spin polarization of the 4/ orbital and as 
well as a large spin-orbit splitting of the 4d 3/2 and 
4d 5/2 core hole states are essential ingredients for 
predicting strong magnetic dichroism in the Gd 3+ 
spectrum. We wish to describe theoretically the 

transitions in Gd 3+ from the ground 4d 10 4/ 7 8 S 2/2 
to the upper 4d 9 4/ 8 states. Experiment [10, 11] 
suggests that these transitions create the key fea¬ 
tures in the rare-earth atomic spectra: giant reso¬ 
nances. We will show that indeed the 4d-4/ tran¬ 
sitions of core electrons in the Gd 3+ ion display 
resonance profiles similar to those obtained experi¬ 
mentally. Hence, for both rare-earth atoms and 
metals the strong overlap of the 4/ and 4rf orbitals 
makes the giant resonances an important proving 
ground for many-electron effects. 

A decade ago considerable progress was made 
toward understanding the essence of giant reso¬ 
nances in absorption spectra of rare-earth ele¬ 
ments. A compilation of review articles can be 
found [12]. In many cases, relativistic time-depen¬ 
dent local-density approximation (RTDLDA) [13] 
as well as random-phase approximation with ex¬ 
change (RPAE) [14] and many-body perturbation 
theory (MBPT) [15] were applied successfully. 

We focus particular attention on reproducing 
the Gd 3+ total cross section in the 140- to 180-eV 
energy range. We want to demonstrate the forma¬ 
tion of the giant resonance from the series of asym¬ 
metric absorption lines, all determined by 
4d 10 4/ 7 -4d 9 4/ 8 excitation, or otherwise mainly by 


4d 9 4/ 8 -4d 9 4/ 7 £/, 4:d 10 4:f 6 sl autoionizing decays. 
The SPMCDF matrix elements and the resonance 
energies are used to solve time-dependent Floquet 
equations simulating the Gd 3+ absorption cross 
section. This approach is a generalization of the 
theories of Fano [16] and Mies [17]. 

The study is organized as follows. In the first 
section a description of the SPMCDF theory is 
given. For the sake of completeness, a brief deriva¬ 
tion of the relativistic formalism used to calculate 
autoionization and photoionization transition ma¬ 
trix elements is presented in the second section. 
The formalism of the time-dependent Floquet the¬ 
ory describing the appearance of the Fano-type 
resonances in absorption spectra is given in the 
third section. The absorption cross section and for 
circular and linear magnetic dichroism in Gd 3 + 
spectrum are presented in the fourth section to¬ 
gether with a comparison with experimental re¬ 
sults [11]. 


Spin-Polarized Dirac-Fock Theory 

The theory is modeled on the basis of the 
Dirac-Fock approximation with an exact descrip¬ 
tion of the exchange potential. We have con- 
structed the central-field spin-polarized Dirac-Fock 
(SPDF) equations by selecting all relativistic atomic 
orbitals in two groups with opposite directions of 
electron spin. The total energies have been taken 
as the center of gravity of relativistic terms. Aver¬ 
aging energies helps to break the // coupling and 
partly implement the LS coupling. We also use the 
fractional occupation of open relativistic subor¬ 
bitals, so that the SPDF solutions can be reduced 
to the spin-polarized Hartree-Fock solutions in the 
limit of c -> oo. 

In the spin-polarized relativistic approximation, 
the radial parts P(r ) and Q(r) of wave function 
become dependent on both the projections of the 
total angular momentum ] 7 (with quantum num¬ 
ber m ) and the electron spin /x. The solutions of 
these equations can be identified by the mixture of 
the quantum numbers m and /x, so it is hard to 
distinguish solutions related to a particular spin 
projection. To avoid construction of a large num¬ 
ber of equations and to focus more on the spin- 
polarization effects, we assumed the radial func¬ 
tions P(r) and Q(r) to be independent of the 
quantum number m. This is justified because the 
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m splitting is much smaller than that of spin polar- 
ization. 

The spin-polarization interaction mixes states 
with different total angular momenta j and creates 
new states, described by the linear combinations of 
;*! = l + \ and j 2 = l - \ [20] for a given /x. We 
obtain from two relativistic orbitals, j x and ] 2 , two 
linear combinations by a unitary transformation. 
Coefficients in the linear combinations are found 
from the minimization of the total energy. 

In this approximation, the atomic wave function 
is a four-component Dirac spinor 

P A r ^X K ,m^ + \ 

iQA(r)x- K , m (V + iQ^X.+Umd') J 

( 1 ) 



dQ 


A 


_f^ + Z ~ Y M P _I X Q 

dr r A \ 1c cr I A r A 


d_h 

dr 


K — 1 


sx Z - Y* 
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2 c — — + 
2 c 


cr 
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-x p 

a A / 

■y 


dQ 
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— K — 1 


dr 


qa - 1 ^ + 


z - y 


A 


cr 


JL _ 

--xp. 


(3) 


where we have 


with A = (k, /x), and A ~ k — 1, /x); m (r) is a 
two-component spin-angular function (i.e., a Pauli 
spinor). P A and Q A are the large and small compo¬ 
nents of the wave function, respectively. 

The quantum numbers k (with k — ±1, ± 
2 ,...) are related to the orbital and total angular 
momentum quantum numbers l and j by 


*A = 


E (*?A' ^AA'^XvA' 

A' 

+ E<i+ «,*>£ 7 ^*. 

A s>0 


xF = 


^AA' 


1 ^ ^ ^ 

- E E^XvaQa'-^ E 

C A'#A s ?A A'^ A ^ CC lA 


j -\k\- l = / + s k /2, s k = #c/|#c|. (2) 


The set k = 1,2,3,... is sometimes denoted as 
1P\ / 2 / ^ 3 / 2 r fb / 2 / * * • and k 1, 2, 3,... corre¬ 

sponds to S 1/2 , p 3 / 2 / d $/ 2 / . ♦ . 

The trial functions are then constructed from 
the wave functions ip(r) so that they belong to a 

given configuration (w^i/x^ ••• (n a l a fJL a ) q . The 
former of these means that the set of one-electron 
functions ( 1 ) is made up of functions belonging to 
the spin-polarized subshells, where are 

numbers of electrons with a specified direction of 
spin, belonging to the subshells 1 ,..., a. 

Once the class of functions to be varied is cho¬ 
sen, the subsequent calculations are performed in 
two stages. In the first stage, /x-dependent expres¬ 
sions for the total energy are constructed for the 
centers of gravity of relativistic configurations. The 
second stage of the calculation involves a search 
for the extremals of the energy functional, which 
leads to a system of coupled differential equations 
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cr 



x a q 
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= r j “7TT [ A.A' + QaQaA dr 1 , 
J T > 


(4) 


and Z is the nuclear charge. Here, atomic units are 
used, i.e., h, Planck's constant, the electron mass, 
the charge are numerically equal to 1. Indices A 
and A label different atomic subshells in the SPDF 
approximation. The parameters s and A are the 
Lagrange multipliers when the normalization and 
orthogonalization conditions are taken into ac¬ 
count. The functions f s and g s are defined by the 
electron orbital [18]. To solve Eq. (3) we apply the 
self-consistent field approximation. The ideas un¬ 
derlying the SPDF self-consistent field theory are 
similar to those in the Dirac-Fock (DF) theory. A 
numerical solution was performed by transform¬ 
ing the differential equations to a logarithmic scale 
p = In r. A uniform grid in p is taken between 
p min and p max , corresponding with r min = 0.1 X 
e~ A /Z and r max = 80 a.u. 

Since the interaction between an atom and the 
electromagnetic field is represented by a one-par¬ 
ticle operator, the corresponding matrix element 
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can be written in the one-electron form 


v kk = <^in t l%> 


^A' Xk', rri 

in fc l 

P\ Xk , m ^ 

^ iQk X- k', rri 

^Qa X - k , m i 

P\ f X— k' — 1, m' 

in*l 

P\ X— k — 1, m 

i'Qk Xk 1 + 1, m r 

iQk Xk + 1 ,m 

P% X— k’ —1, rri 

in,l 

P 

^ A Xk , m 

^ Qa' Xk 1 + 1, rri 

^Qa X- k , m j 

X k ', rri 

inj 

P* v 

1 A a - k — 1, m 

iQjri X-k', rri 

Xk +1, m J 


where we have U k = j k (co/c, r)Y kAm . The terms in 
the above expression can be written as radial and 
angular parts, for example, 


^A' Xk', w! 
1 Qk X~k', rri 


n 


Xk, m 
1 Qa X-k, 


m 


I 


(2k + 1)(2 j K + 1) 
(2 /a + i) 


X E (R A (y)\j k \R A (y»' 

y= ± 1 


Cjyml,,k Am Q A o] j* 1 /2 

( 6 ) 


where Cj]Z\, k a m are Clebsch-Gordan coefficients; 
,R(y) is the radial part of a wave function, defined 
through the large P A (r) and small Q A (r) compo¬ 
nents of the wave function 


<R A (y)|/*|R A ,(y)> 


I P A (r) if y = 1 
| Q a ( r ) if y = — 1, 


with the condition that (l A + l A + k) is even, and 
j k is a spherical Bessel function. 

The total matrix element can be represented by 
a linear combination of the matrix elements in Eq. 
(5). 

V ab = E T,C A C A .V AX , (8) 

A A' m 


where the matrix elements v AA are generated by a 
one-electron dipole transition between states A, A 
and A', A, included as components in the initial 


and final levels, respectively. The coefficients c A 
and c A reflect the multiconfiguration character of 
terms and are determined by solving the multicon¬ 
figuration problem with the Dirac-Fock basis set 
(MCDF) [1]. In SPMCDF, the Dirac-Fock spin- 
polarized orbitals in Eq. (1) were used as the basis 
functions in Eqs. (5) and (8). The dipole selection 
rules for the different polarizations of external 
field are determined by the Clebsch-Gordan coef¬ 
ficients. 


Transitions to the Continuum 

The formalism for the calculation of photoion¬ 
ization matrix elements is similar to that between 
discrete levels. For the description of the contin¬ 
uum state we have limited ourselves to a single 
configuration as the correlation interaction be¬ 
tween the continuum states is assumed to be negli¬ 
gible. The outer (free) electron with a well-defined 
orbital momentum and kinetic energy is consid¬ 
ered to be moving in a core potential. The core 
potential is found by a SPDF calculation of the 
core orbitals in either the ground or excited state. 
A numerical integration leads to a continuum 
two-component wave function $ with the correct 
phase behavior. We matched the wave function <£ 
to Coulomb functions between 10 and 20 a.u., with 
a charge appropriate for the remaining ion to nor¬ 
malize the wave functions. The Coulomb regular F 
and irregular G functions were calculated using 
continued fractions [19]. 

The many-body photoionization matrix ele¬ 
ments of Eq. (8), assuming a single configuration 
for the continuum, reduces to 

Vmi-EEv*,.,. (9) 

A m 

where a labels the initial discrete state, and i 
labels the core orbital. Here the label s combines 
both the kinetic energy and the orbital angular 
momentum of the free electron. The one-electron 
matrix element v A ei is described by Eq. (5), but 
now calculated with continuum wave functions. 

The autoionization matrix elements describe the 
transition between an autoionizing atomic state a 
and an ion, which need not be in the ground 
configuration, plus a free electron. This process 
involves separating two electrons in the outer shell 
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and is mediated by the Coulomb repulsion be¬ 
tween those electrons. Within the Dirac-Fock 
framework each electron can be assigned an inde¬ 
pendent wave function. Calculating the many-elec- 
tron autoionization matrix elements therefore re¬ 
duces to the evaluation of two-electron matrix 
elements. If the electrons participating in the tran¬ 
sition are labeled A and B for the atomic state a 
and C and D for the continuum configuration, 
where D defines the escaping electron, then the 
matrix element is given by 


< AB\l/r u \CD) 


= JoTa + D(2; b + 1X2 ic + 1X2 ; D + D 
l 


pO 


2 C j A -l/2,j c l/2 


xE- 

v (2v + 1 ) 

xCJ- b °_ 1/2Jd1/2 R”(A,B,C,D) 

X E E E^ F (-D mc+ ”' D+1 

m A ,m B m c ,m D a 

X C v ( m A~ m c) rHm B -m D ) 

jA m A Ac~ m C jB m B'jD- m D' 


( 10 ) 


where the c^ F 's describe the many-electron Dirac- 
Fock wave function in terms of Slater determi¬ 
nants. The radial integrals have the usual form 

R V (A, B,C, D) 



X v+l T Qe( r 2^Qo(^2 

(ID 


The configuration interaction (Cl) matrix element 
for autoionization is then subsequently found as a 
linear combination of these two electron matrix 
elements with coefficients given by the Cl calcula¬ 
tion for the a level. 


Calculation of Absorption 
Cross Sections 


correlated to and interacts with its own contin¬ 
uum. The total wave function of such a system 
evolving under influence of a light field of fre¬ 
quency o) is represented as a linear combination of 
atomic wave functions, as calculated in the two 
preceding sections, which are either discrete if/ k or 
continuum ip ei wave functions. Note that, in this 
section the subscripts k, n = 0,1,2,... label dis¬ 
crete states, while the subscript i labels the various 
continua, i.e., the various core orbitals, and s is 
the kinetic energy of the free electron. Therefore, 
we have 


^ = c 0 (t)ijj 0 e l<t>t + c k (t)$ k 

k* 0 

+ E J ds c ei (t)ifj ej , (12) 

ie continua 


where the coefficients c describe the light-induced 
mixing of states. The dynamics of the wave func¬ 
tion is given by 


i 


dV 

dt 




(13) 


where Hf includes the atomic and relativistic 
atom-field interaction 


H /= H 0 - E«yAy(0/ (14) 

) 

A -(f) is the relativistic vector potential of the ex¬ 
ternal field, a ■ is a Pauli matrix, and the sum j is 
taken over all electrons. Inserting Eq. (12) into Eq. 
(13), integrating over all electron coordinates r, 
and using the rotating wave or resonant approxi¬ 
mation to the time-dependent differential equa¬ 
tions, the coefficients c are given by 

d 

i J t c„(0 - E n c n (t ) 

= L c k (t)v kn + E jdec £i (t)V siin , 

k^n i g continua 

(15) 


The nature of giant resonances is complicated 
due to the simultaneous presence of a multiplet 
structure and discrete and continuum contribu¬ 
tions. Any model for the frequency dependence of 
the resonances must have a discrete initial state ifj Q 
and a number of final states embedded in the 
autoionizing region. Moreover, each final state is 


d ^ 

i^c E ,(0 - E ei c ej (t) = T,c k (t)V k , e ,, 

Jc 


where the index n denotes either the initial or final 
state and the energies E are the atomic energies 
except for the initial state where E 0 equals the 
initial atomic energy plus the photon energy co. 
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The matrix elements V denote the coupling 
strength between the various levels. In fact, the 
V si k lead to field-induced photoionization (k = 0) 
or autoionization (k > 0). 

For the transition between the initial and upper 
state it follows that we have V k=0n = d k=0 n F f 
where the d 0n are the dipole transition matrix 
elements and F the field strength. Similarly, for 
the field-induced photoionization, we have V ei k=Q 
= d ei k=0 F / where the d ei0 denote dipole transi¬ 
tion matrix elements into the continuum. On the 
other hand, the nonzero autoionizing matrix ele¬ 
ments V sik (for k > 0) are independent of the laser 
field and arise from CI. Moreover, the matrix ele¬ 
ments V kn for k,n > 0 can in principle be nonzero. 
This is due to the configuration interaction be¬ 
tween two excited states. However, our level 
structure calculation has already taken those ef¬ 
fects into account and thus we have V kn = 0 for 
k,n> 0. Finally, as correlation between continua is 
weak, the matrix elements V £ie > r are negligibly 
small and therefore have been omitted from Eq. 
(15). 

The major difficulty in solving Eq. (15) by 
straightforward integration over time t is caused 
by the kinetic energy s . In fact, Eq. (15) forms an 
infinite set of coupled differential equations, which 
are complicated by the fact that the matrix ele¬ 
ments V si k have a weak dependence on s such 
that any introduction of an energy cut-off will lead 
to a theory with poor convergence characteristics. 
Therefore we applied the Floquet approach which 
transforms the differential equations into a set of 
integral equations with the property that the inte¬ 
gration over s can be solved analytically. In the 
Floquet method the Laplace transformation 

-CO 

c ( p) = / dtc (t)e pt , (16) 

•'o 

where p is either k or s if reduces Eq. (15) to the 
form, 

[ip - E n \c„(p) - Ec*(p)V tll 

k^n 

- Y, f d£C ei V ei,n = *C„(0), 
i e continua 

{'P ~ e Jc £ ,- - Ec t (p)V tie , = ic ei ( 0), (17) 

k 

where c n ( 0) and c /;/ (0) describe the population am¬ 
plitudes of all levels at time t = 0, and are given 


by 

c 0 ( 0) = 1; c k ( 0) = c ei ( 0) = 0 (18) 


for k > 0. In deriving Eq. (17), it is assumed that 
the autoionizing and CI matrix elements V are 
time-independent. 

Equation (17) can be rewritten by eliminating 
c ei . The second equation of Eq. (17) gives 


c,i( p) = 


S* Ck(p)Vk, si 

*P ~ E ei 


(19) 


and is inserted into the first equation of Eq. (17). 
The integral over e can now be solved with Dirac's 
formula to give 


W V • 

k, ei v ei, n 
¥ - E ei 


P - ™Vk,,y,i.nr (20) 


where V k> si is a matrix element at the point s = 
~E th/i + ip, where we have E ei = E th i + s, and 
E thfi is the atomic energy at threshold. Because 
V eif n has weak energy dependence, the principal 
part P is small and will be neglected henceforth. 
Moreover, this weak dependence allows one to 
replace V si n with its value at resonance when 
E ei = E n . Therefore we get, for the discrete levels, 

lip - £„ + iir £ I K h „\ 2 )c n (p) 

' /e continua ' 

- L (v kn - ITT E Vk.siKi.n) = »C„(0). 

k^n' i‘e continua ' 

( 21 ) 


For each p Eq. (21) forms a finite set of algebraic 
equations of form (ip - A)c(p) = ic( 0) over the 
discrete levels n, where —A denotes the matrix 
formed by the terms in Eq. (21), which are inde¬ 
pendent of p. In general, the solutions of the 
algebraic equations are given by 


c «(p) = LMox/p - a)J, 


A V-1 _ D nk(ip) 

(ip — A) nk — — 


( 22 ) 


where D kn (ip) and D are the minors and the 
determinant of the matrix (ip — A), respectively. 
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The inverse Laplace transformation of Eq. (22) gives 
us the time-dependent solutions of Eq. (21) and 
after some algebra we find 


c,,(£) = *'E 


D n0 (X t )e x *‘ 




(23) 


From the population P n (t ) = |c n (OI of a level, the 
partial photoabsorbtion cross section is found as 


cr n (co) = — P n (r )/(tF 2 ), (24) 

wc 


where r is the pulse duration and F the field 
strength. The method was originally developed for 
strong field applications [21]. 


Results 

The absorption cross section of Gd 3+ as a func¬ 
tion of the photon energy is calculated using the 
resonant energies and the dipole matrix elements 
of transitions, from the ground state of the 4 d 10 4/' 7 
configuration to the states of the 4d 9 4/ 8 configura¬ 
tion, which lie in the range of photon energies 
between 139 and 180 eV. The Gd 3+ ground state is 
a high-spin state with seven aligned 4/ electrons 
and zero orbital angular momentum, i.e., 8 S 7/2 . 
The absorption selection rule A/ = ±1,0 permits 
transitions to states with total angular momentum 

Tf = 5 7 9 

J 2' 2 1 2’ 

The MCDF method [1] was applied to calculate 
initial and final states in the intermediate-coupling 
regime. The Cl procedure included 7 relativistic 
configurations, belonging to the 4 d 10 4/ 7 configu¬ 
ration and describing the initial states and 14 rela¬ 
tivistic configurations describing the final states of 
the 4d 9 4/ 8 configuration where seven and eight 
electrons were divided in all possible ways over 
the 4 f 5/2 and the 4/ 7/2 relativistic orbitals. Table I 
presents the resonant energies of the strongest 
lines in the Gd 3+ spectrum which contribute to the 
giant resonance and correspond to transitions from 
the lowest level of the 4d 10 4/ 7 configuration. Other 
lines (530 lines in the range between 140 and 160 
eV) have strengths smaller by a factor of 2 or more. 

The total absorption cross section is summed 
from three partial cross sections found for transi¬ 
tions to states with total angular momenta quan¬ 
tum numbers } f = §, §, § of the 4d 9 4/ 8 configura¬ 
tion. Because there is no interference between 


TABLE I_ 

Resonant energies of absorption lines used to 
construct the giant resonance in the Gd 3+ spectrum. 


J' of final state 

Resonant energy in eV 

9 

2 

140.444 

7 

2 

141.856 

9 

2 

144.424 

7 

2 

144.537 

5 

2 

144.622 

9 

2 

144.787 

5 

2 

153.990 

5 

2 

154.028 

5 

2 

154.092 

5 

2 

154.129 

7 

2 

154.972 

7 

2 

155.052 


channels belonging to different /', we solved the 
multichannel equations (21) for each }' separately. 

For complex atomic systems, most of the au- 
toionizing states have different core states and 
therefore decay to different continua. This plays an 
important role in the formation of the absorption 
spectrum of the Gd 3+ ion. 

Figure 1 presents the theoretical results for the 
absorption resonances induced by transitions to 
the three levels with /' = §, while Figure 2 shows 
the absorption lines of transitions from the ground 
state to states with /' = \ (four levels) and }' = § 
states (five levels). The later transitions comprise 
considerably more oscillator strength than the /' 


0.6 



~i- r -f- 1 i- 1 -i- 1 -r 

140 142 144 146 148 

Photon energy (eV) 


FIGURE 1 . MCDF absorption cross section for J - \ 
to J r = | resonant transitions between the ground state 
and states of the 4d 9 4f 8 configuration. 
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o 


Photon energy (eV) 

FIGURE 2. MCDF absorption cross section for J = \ 
to J' = f, \ resonant transitions between the ground 
state and states of the 4d 9 4f 8 configuration. 

= | lines. The main reason for the significant 
differences in the cross sections and its strong 
dependence on /' can be found in the multiconfig- 
urational distribution of the oscillator strengths 
among the levels of the 4d 9 4/ 8 configuration. 

In the calculation, the full set of final states with 
the same total angular momentum /' are mixed; 
the results are plotted in Figure 3. The cross sec¬ 
tions turn out to be given by the sums of the 
individual lines (cross sections are plotted in Figs. 
1 and 2) to within 1%. The total cross section in 
Figure 3 determines the observable giant reso¬ 
nance. The maximum of this resonance lies at a 
higher photon energy (6.5 eV higher) than the 
experimental peak resonance at 148.86 eV [10, 11]. 
Figure 3 also displays several weak absorption 


lines preceding the giant resonance in the range 
from 140 to 146 eV. These satellites contain a small 
fraction of the oscillator strengths, but they are 
very characteristic features of the absorption spec¬ 
trum, observed in core transitions of atomic 
gadolinium [10] and in solid gadolinium [11]. 

In order to compare the theoretical profile of 
Figure 3 (solid line) with the experimental reso¬ 
nance from [11], the theoretical cross section was 
shifted toward lower photon energies by 6.5 eV. 
The result is presented in Figure 4 together with 
the experimental profile. Since the experimental 
absorption cross section is given in relative units, 
we normalized it to our absolute theoretical total 
cross section to achieve the best comparison. Fig¬ 
ure 4 shows that the MCDF calculation (solid line) 
describes the experimental spectrum (round dots) 
quite well. The giant resonance in both cases has a 
single Fano-type profile with almost the same 
width and asymmetry parameter q. The results 
indicate that the theoretical approach of the MCDF 
method is quite appropriate for an accurate de¬ 
scription of the Gd absorption spectrum. 

The MCDF results discussed above yield good 
agreement with the experiment, but do not include 
correlations due to spin polarization. However, 
this correlation is important in calculating mag¬ 
netic dichroism. In order to introduce spin polar¬ 
ization we included in the model the alignment of 
all electron spins in the 4/ 7 orbital (4/ 5 5 /2 T T T T T 

T 4 / 7 / 2 1) of the ground state of Gd 3+ . The spin 
alignment of orbitals in the upper configuration 

4d 9 4/ 8 is introduced as ^d\ /2 T4/ 5 6 /2 T T T T T T 
4 / 7/2 t i • Spin polarization of the 4/ orbital and 




Photon energy (eV) 


FIGURE 3. MCDF total absorption cross section for 
J = \ to J' — §, f, f resonant transitions summed over 
final states (dashed lines) and the total absorption cross 
section (solid line). 



Photon energy (eV) 


FIGURE 4. Theoretical (solid line) and experimental 
(open circles) total absorption cross section in Gd 3+ . 
The theoretical cross section is shifted by 6.5 eV toward 
smaller photon energy. 
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the spin-orbit splittings of both 4 d and 4/ orbitals 
lead to the strong magnetic dichroism in Gd. In 
other words, magnetic dichroism appears when 
one can observe multiplet structure within the 
resonant profile. Our results show that the multi¬ 
plet structure in Gd 3+ is due to the fine splitting of 
final states. 

In the spin-aligned model we consider transi¬ 
tions from the ground state with J = \ and nij = \. 
First, we assume that the external field is either 
left- or right-circular-polarized. The selection rule 
A m — +1 for circular-polarized light allows tran¬ 
sitions from the ground state to /' = § levels only 
for left-circular-polarized light and to /' = f, f 
levels with right-circular-polarized light. The pro¬ 
nounced polarization dependence of the calculated 
spectrum for left- and right-circular-polarized radi¬ 
ation can be seen in Figure 5. 

The dashed lines in Figure 5 describe the total 
absorption cross section for left- and right- 
circular-polarization. The cross section for right- 
circular-polarized light is much higher than for 
left-circular-polarized light. The solid line in Fig¬ 
ure 5 is the difference between the two cross sec¬ 
tions and defines circular dichroism. 

Linear dichroism, defined as the difference of 
the cross section for x-linear and z-linear polar¬ 
ized light, gives complementary information about 
Gd 3+ . The cross section for x-linear polarized light 
is half of the sum of cross sections for left- and 
right-circular polarized transitions while the cross 
section for z-linear polarized light, with selection 
rule Am = 0, allows transitions to J' = \ , f, but 
not to /' = f levels. The resulting spectra are 
shown in Figure 6. 



FIGURE 5. Total absorption cross section for left and 
right circular polarization (dashed lines), magnetic 
circular dichroism (solid line). 



FIGURE 6. Total absorption cross section for x- and 
z-linear polarization (dashed lines), magnetic linear 
dichroism (solid line). 


Conclusion 

We have found from MCDF calculations that 
the giant resonance in the Gd 3+ spectrum between 
135 and 160 eV has multiplet structure. This struc¬ 
ture is described by the spin-orbit splittings of the 
4d 9 4/ 8 final configuration. Moreover, partial ab¬ 
sorption asymmetric lines of different total angular 
momentum differ significantly. This allows the 
calculation of the dependence of the cross section 
on atomic and light polarization and thus the cal¬ 
culation of magnetic dichroism in Gd 3+ . 

Comparison of the theoretical and experimental 
results shows that we describe the resonance shape 
accurately, whereas the position of the resonance 
is obtained with a 4% accuracy. We believe that 
the distribution of the oscillator strengths will im¬ 
prove when additional correlation beyond Cl with 
only the 4d 10 4/ 7 and 4d 9 4/ 8 configurations is 
introduced. 

The SPMCDF calculation of the absorption cross 
section in Gd 3+ shows a strong dependence of the 
cross section on the polarization of light. The mag¬ 
netic dichroism for the circular polarization is 
nearly maximal. 
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ABSTRACT: The differential scattering cross section (DCS) for electrons scattered 
elastically by neon is studied using a model potential. The short-range part of the 
effective potential is represented by the Chebyshev approximation and the long-range 
polarization potential is represented by an energy-dependent function. The computed 
differential cross section obtained using the approximate effective interaction potential 
for electrons scattered by neon atoms in their ground state in the 50-100 eV energy range 
is compared with available published results. In the present study, a nonrelativistic 
Hartree-Fock wave function of the target atom is used to compute the short-range static 
part of the interaction potential and the parameters contained in the energy-dependent 
function are determined by the minimization of the DCS with respect to the angle and 
the incident energy. The resulting DCS in the angular range 2° < 0 < 178° is found to be 
in good agreement with the available experimental and theoretical results in the 
intermediate energy range. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 585-590,1997 


Introduction 

T he differential cross section of electrons scat¬ 
tered by atoms and selected molecules has 
been a topic of considerable interest as evidenced 
by a large number of experimental and theoretical 
work published over the last decade. The theoreti¬ 
cal work has progressed mostly along the applica¬ 
tion of the R-matrix theory [1], while the semiem- 

Contract grant sponsor: Grants and Research Funding Com¬ 
mittee of the Southeast Missouri State University. 

International Journal of Quantum Chemistry, Vol. 65, 585-590 (1997) 
© 1997 John Wiley & Sons, Inc. 


pirical calculations based on the optical potential 
models [2] with or without adjustable parameters 
have increased over the last few years owing to 
their simplicity and the ease of computations lead¬ 
ing to the differential scattering cross section (DCS) 
over a wide range of incident energy. In the pres¬ 
ent study, the model potential is based on the 
dipole polarizability a d of the target atom and two 
parameters contained in the energy-dependent po¬ 
larization potential determined by minimizing the 
DCS at the critical angle [3] with the condition that 
the computed integral cross section agrees with 
the experimental value of the integral cross section 

CCC 0020-7608 / 97 / 050585-06 





PAIKEDAY 


for two selected incident energies. The significance 
of the DCS minimum at the critical angle can be 
understood on the basis of the phenomenon of 
low-energy electron diffraction associated with the 
interaction between the incident electron and the 
atomic target. While the reported differential 
cross-section data for a given target atom varied 
from 5 to 13% depending on the incident energy, 
the integral cross section is known to have less 
uncertainty for most closed-shell atoms over the 
intermediate energy range. In the present study, 
the critical minima and the integral cross section of 
the computed DCS are shown to depend on the 
values of the parameters so that one can vary them 
to find the set that minimizes the DCS at the 
critical angle and corresponds to the correct inte¬ 
gral cross section within the uncertainty of the 
experimental data. Using this set of parameters, 
the differential cross section for the target atom is 
computed over the energy range 20 < E < 500 eV 
and the angular range 2° < © < 180°. The com¬ 
puted DCS are in good agreement with recently 
reported experimental and theoretical cross-section 
data for neon atoms over the intermediate energy 
range. 


Approximation to the Effective 
Potential 

The effective potential is constructed from the 
Hartree-Fock orbitals [4] of the target atom and is 
of the form given by 

V e = V ex ( r) + V p (E,r) 

+ (2Z/r)Y,HA k P tI (r)exp(-B k r), (1) 

k n 

where V ex (r) and V p (E,r) are the exchange and 
polarization potentials, respectively, and the val¬ 
ues of A k and B k (k = 1,2,..., M, M < 10) are 
determined from the Hartree-Fock wave function 
of the target atom. The approximate function rep¬ 
resenting the short-range part of the potential is 
then determined using the Chebyshev polynomials 
T„(x) given by 

T 0 (x ) = 1, T^x) = x, T 2 (x) = 2x 2 - 1. 

(2a) 


The higher-order functions can be generated with 
the recursion relation 

T n+1 (x) = 2 xT n (x) - T n _,(x). (2b) 

The coefficients C ; for the representation of the 
effective potential is computed from the following 
equation: 

Cj = (2/N)J^Log e {(r k /2Z)(<& 0 \V(r k ) l<F 0 >} 

k 

XT j _ 1 (x k ) / (2c) 

where x k = cos{ir(k - \)/N }. The Chebyshev rep¬ 
resentation truncated to order M (M < N) is then 
given by (k = 1 to M in the sum) 

V 0 (r) =(2Z/r)^P n (r)expl^C k T k _ 1 (r) -CJ 2 }. 

n ' k ' 

(3) 

In the approximating equation, several terms con¬ 
taining exponential and polynomial terms in the 
original function are reduced to fewer exponential 
terms of a sum of polynomials. This speeds up the 
DCS minimization code used in the present calcu¬ 
lations. The operator V(r k ) contained in the equa¬ 
tion for Cj represents the Coulomb interaction 
between the incident electron and the target atom. 
In the present study, the Log(base e) of the origi¬ 
nal function is used due to its exponential varia¬ 
tion for the Chebyshev representation using a value 
of N = 30 and the DCS is computed using the 
approximate function with M = 9. The exponen¬ 
tial function is then used to reverse the basis 
representation of the Log of the original function 
for the computation of DCS. These coefficients are 
determined with the condition that the DCS com¬ 
puted from the original interaction agrees with the 
DCS obtained from the approximated potential 
within 1%. This has been found to be sufficiently 
accurate for the DCS minimization algorithm used 
in the present study. 

Thus, the effective potential used in the present 
study is of the form 

V e (E, r ) = V ex (E, r) + V p (E, r) + V 0 (r). (4) 


Computation of Differential Cross 
Section 

The effective interaction potential used in the 
present study is the sum of the short-range and the 
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energy-dependent long-range polarization terms 
given by 

V(r) = <<J> 0 |V 0 (r,r,)|<I> 0 > 

+ a a /[r +f(A, B,E)? + V ex> (5) 

where <F 0 is the ground-state Hartree-Fock [4] 
wave function of the target atom and V 0 contains 
the electron-electron and electron-nuclear poten¬ 
tials, a d is the dipole polarizability of the target 
atom, and the parameters A and B contained in 
/(A, jB, E) are to be determined for the selected 
incident energy E (eV) of the electron. The form of 
the exchange potential V ex is similar to the one 
used in an earlier calculation [5]. The differential 
scattering cross section Kk,&) is then computed 
from the scattering amplitude /(/c, 0) as a function 
of the wave number k and scattering angle © 
given by 

/(A:,©) = (1/2z*)£(2Z + 1) 

/ 

X[exp(2z§ / ) - llP/Ccos©), (6) 

where the phase shifts 6; are computed by solving 
the radial part of the Schrodinger equation for the 
angular momentum in the range 0 < l < 12 using 
the Numerov algorithm [6] from which the zeros 
of the wave functions are used in the argument of 
Bessel and Neumann functions to compute the 
phase shifts accurate to four decimal places. For 
higher partial waves, the Born approximation is 
used for l in the range 11 < / < 30 with a 32-point 
Gauss quadrature method for the numerical inte¬ 
gration. The phase shifts for higher partial waves 
are computed using an analytical expression [7] for 
the phase shift in terms of integrals involving 
Bessel functions of fractional order for values of / 
up to 200. It has been found that for scattering 
angles > 2° not more than 200 partial waves are 
needed for the DCS to converge to an accuracy of 
±0.1%. Flowever, for angles less than 0.1°, it was 
found that more than 1000 partial waves were 
needed for convergence to the listed accuracy. All 
the computations were performed in double preci¬ 
sion using an IBM4381. For scattering angles in 
the small-angle range, it was found that a faster 
algorithm was required to efficiently execute the 
minimization routine. For this, the electron-atom 
interaction was represented by a truncated least- 
square-fitted function containing fewer terms 
which substantially reduced the computing time 
of the forward scattering amplitude and the result¬ 


ing integral cross section. For the higher partial 
waves, the approximation to the short-range inter¬ 
action did not significantly affect the final results 
since the phase shifts depend mostly on the polar¬ 
ization potential. This enabled the determination 
of the approximate values of parameters A and B 
for helium and neon atoms that satisfy the speci¬ 
fied conditions with a few minutes of computer 
time for each incident energy for a given target 
atom. 


Discussion of Results 

The variation of computed DCS with the pa¬ 
rameters contained in the polarization potential 
/(A, B, E) is shown in Figure 1. The plots num¬ 
bered 1,2,...6 are the DCS corresponding to the 
energy-dependent polarization potential with two 
parameters for an incident energy of 60 eV. For 
neon, the minimum of DCS occurs near 105° as 
seen in plot #1 and for smaller angles as the value 
of / is increased. The corresponding values of the 
computed integral cross section are shown in Table 
I. For the approximate determination of parame¬ 
ters A and B, the experimental values [8] of a at 
60 and 100 eV were used. Comparison of cr with 
other experimental values is shown in Figures 2 



SCATTERING ANGLE (deg.) 


FIGURE 1 . Plots numbered 1-6 showing the variation 
of computed DCS for e-neon scattering at 60 eV using 
the model potential for various values of f(A, B, E) with 
B = 0.00476 and A = 0.7-1.5 in steps of 0.1 (DCS in 
units of a^/ster). 
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TABLE I_ 

Variation of integral cross section a with f(E) for 
neon at the incident energy E - 60 eV; experimental 
value 3 of a = 2.98 + 0.06 (10" 16 cm 2 /ster). 


Plot# 

f(E) 

cr 

1 

0.986 

3.181 

2 

0.820 

3.172 

3 

0.920 

3.150 

4 

1.020 

3.131 

5 

1.120 

3.100 

6 

1.220 

3.071 


3 Experimental data from [8]. 


and 3. The final values of A and B are determined 
by minimizing the DCS surface as shown in Figure 
4 in the energy range 50 < E < 90 eV and angular 
range 90 < 6 < 110°. The contour plot shown in 
Figure 5 indicates that the critical values of the 
energy and angle are E = 66 eV and 6 = 98°. These 
are in good agreement with the experimental val¬ 
ues of 65 eV and 101°, respectively. The values of 
the parameters are determined to be A = 1.11 and 
B = 0.0036. The computed values of cr for various 
incident energies are compared with the experi¬ 
mental values of Register et al. [8] and the R-ma- 
trix calculations of Fon et al. [1] as shown in Table 
II. The comparison between the present results and 
other theoretical and available experimental data 



ENERGY (ev) 

FIGURE 2. Comparison of the computed integral cross 
section for e-neon scattering for incident energy in the 
range 30-200: (□) results of this work; (■) experimental 
data; (k) fl-matrix theory. 


is shown in Table III (50 eV) and Table IV (100 eV). 
The computed integral cross section for e-neon 
scattering for the incident energy of 30-200 eV and 
its comparison with experimental data are shown 
in Figure 2 and the computed differential cross 
sections at 100 eV compared with recent experi¬ 
mental and theoretical results for neon are shown 
in Figure 3. 

The computed values of the DCS (open squares) 
are seen to be in good agreement with the experi¬ 
mental data and R-matrix calculations for scatter¬ 
ing angles up to 140°. For larger angles, the model 
potential results seem to differ somewhat from the 
available theoretical data. For different target 
atoms, the present model calculations for the DCS 
are in good agreement with available published 
data except near the critical angles. It is possible 
that there is more uncertainty in the DCS near 
this region compared to those near the forward 
scattering angles. A more general function for 
f(A, B,C, E ) is being tested to determine the DCS 
near the critical angle for other atoms. The value of 
the computed critical angle has been found to be 
in good agreement with published data. The pres¬ 
ent model is being extended to argon, krypton, 
and xenon to determine the limitations of the ef¬ 
fective interaction potential for the computation of 
the DCS in the intermediate energy range. In con¬ 
clusion, the usefulness of the model potential used 
in the present study is for intermediate energies 
where a large number of partial waves are needed 



SCATTERING ANGLE (deg) 

FIGURE 3. Comparison of the computed DCS (in units 
of cm 2 / s) for e-neon scattering at an incident energy of 
100 eV. Markers are same as in Figure 2. 
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TABLE II_ 

Comparison of the computed cross section ct 
with experiment and ft-matrix theory at various 
incident energies for neon atom; <r in units 
of (10 -16 cm 2 /ster). 


E 

(J 3 

a b 

cr c 

50 

3.45 

3.13 

3.08 

60 

3.12 

2.98 

2.86 

70 

2.85 

2.82 

2.70 

80 

2.62 

2.60 

— 

100 

2.27 

2.23 

2.25 

200 

— 


1.45 


a Present results. 
b Experimental values from [8]. 
theoretical values from [f ]. 


TABLE III 


Comparison of the differential cross section with 
experimental and other theoretical results for neon 
at 50 eV (units in / ster). 

Angle 0 

Theory 

Present 

[9] 

Experiment 

[8] 

0 

6.354 

6.8582 


10 

4.731 

4.6057 


15 

3.959 

3.8401 

4.182 

20 

3.329 

3.1967 

3.239 

30 

2.425 

2.2424 

2.100 

40 

1.838 

1.6455 

1.468 

50 

1.407 

1.2717 

1.139 

60 

1.031 

0.9910 

0.889 

70 

0.673 

0.7209 

0.643 

80 

0.352 

0.4426 

0.400 

90 

0.116 

0.1919 

0.168 

100 

0.016 

0.0345 

0.035 

120 

0.329 

0.2424 

0.268 

140 

1.193 

1.1916 

1.103 

160 

2.105 

2.3315 


180 

2.502 

2.8406 



to obtain converged values of the DCS. For low 
energies, where a fewer number of partial waves 
will be sufficient for the convergence of the DCS, 
the close-coupling calculations have the advantage 
of being more accurate at all angles. 


TABLE IV_ 

Comparison of the differential cross section with 
experimental and other theoretical results for neon 
at 100 eV (units in a^/ster). 


Angle 0 

Theory 

(Present) 

Experiment 

[8] 

10 

6.515 

7.605 

20 

3.630 

3.699 

30 

1.969 

1.843 

40 

1.104 

0.968 

50 

0.661 

0.553 

60 

0.405 

0.324 

70 

0.228 

0.199 

80 

0.105 

0.100 

90 

0.042 

0.042 

100 

0.051 

0.039 

120 

0.299 

0.229 

140 

0.758 

0.700 

160 

1.195 


180 

1.385 
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ABSTRACT: The Schrodinger equation for the scattering of an electron by a hydrogen 
molecule is solved by the finite element method, in spherical coordinates, using fifth-order 
Hermite interpolating polynomials. The computational method is quite similar to the 
work of Shertzer and Botero [Phys. Rev. A 49, 3673 (1994), and references therein]. 
However, to study large systems, an effective one-particle dynamical equation is defined, 
unlike the procedure of Shertzer and Botero. To illustrate the basic computational 
procedure, a model electron-H 2 interaction potential (static + exchange + polarization) 
is constructed and the X-matrix is calculated. A novel feature of the present method is 
the procedure for extracting the partial-wave amplitudes at a value of r, the size of 
which is fixed by the range of nonlocal potentials in the problem, and then propagating 
the scattering amplitudes out to an effective infinity where the converged fC-matrix is 
determined. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 591-600, 1997 


Introduction 

L ow-energy electron-molecule scattering is a 
fundamental problem in molecular physics. 
An excellent review of the status of this field in 
1980 can be found in the article by Lane [1]. An 
idea of the current status of the field can be ob- 
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Contract grant sponsor: FAMU-Army High Performance 
Computing Research Center. 

Contract grant number: DAAH04-95-C-0008. 


tained from a collection of articles appearing in a 
recent book [2]. In that book, a number of different 
methods are described for the solution of the low- 
energy electron-molecule scattering problem. As 
described in the article by Temkin and Weather¬ 
ford [3], when the interaction is highly nonspheri- 
cal, the use of angular momentum partial-waves is 
problematical in the region of close encounter be¬ 
tween the electron and the molecule. At a large 
separation, partial-waves are more appropriate. 
Thus, Temkin and Weatherford, in a series of arti¬ 
cles [4-7], described a solution method whereby 
the two-dimensional Schrodinger equation is di¬ 
rectly discretized using finite differences. This was 
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done within the context of time-independent scat¬ 
tering theory [1], and, thus, the partial differential 
equation was elliptic. They were considering elec¬ 
tron-^ collisions where the cylindrical symmetry 
allows for the separation of the (/>-variable, reduc¬ 
ing the effective one-electron scattering equation to 
two coupled variables (r and 9). The scattering 
problem was solved in the frame of fixed nuclei 
(body frame). Thus, r describes the distance to the 
scattering electron and 6 describes the angle of the 
scattering electron position vector with respect to 
the z axis (see Fig. 1). 

In an independent development, Shertzer and 
co-workers [8-12] used the method of finite ele¬ 
ments to discretize the Schrodinger equation for a 
number of important problems. In each case, they 
solved the Schrodinger equation explicitly as a 
few-body problem. The present work differs in 
that we are solving an effective one-body problem 
using finite elements in two dimensions (r, 6). 
This allows for the possibility of solving for the 
solution of electron scattering from much more 
complicated targets [1]. 

The K-matrix is a concise and computationally 
simple means of presenting the results of a scatter¬ 
ing calculation using the method of partial differ¬ 
ential equations. In the present work, the K-matrix 
is calculated for a model e — H 2 interaction poten¬ 
tial, in spherical coordinates, which includes the 
static, exchange, and polarization potentials. The 



r (bohr) 


FIGURE 1. First quadrant of the r - 9 plane deformed 
into a rectangle (PDE solution space). 


cylindrical symmetry is used to factor out the <£ 
dependence, resulting in a two-dimensional ellip¬ 
tic partial differential equation. The scattering re¬ 
gion is bounded by a quarter circle (see Fig. 1, 
where the scattering region is shown as a coordi¬ 
nate rectangle) at a value of r = 10 a 0 where atomic 
units are used. The partial-wave amplitudes are 
extracted at that value of r = p and then the cou¬ 
pled ordinary differential scattering equations in r 
are propagated to an effective infinity where the 
K-matrix is determined [4,5]. 

In the next section, we describe the basic ideas 
of the finite element method that we are using. 
The third section describes the extraction of the 
partial-wave amplitudes and the propagation pro¬ 
cedure. The fourth section describes the model H 2 
potential. Finally, the fifth section gives the results 
and conclusions. 


Finite Element Method 


A short outline of the finite element method is 
presented below for the simple case of elastic scat¬ 
tering with local potentials—the generalization to 
inelastic multichannel scattering is straightforward 
but complicated: The time-independent Schro¬ 
dinger equation for e -molecule scattering using 
the fixed nuclei approximation (FNA) in 3D, using 
spherical coordinates (in atomic units), is 

’ d 2 is id 2 cote d 

- 1 - 1 - 1 - 

dr 2 r dr r 2 dd 2 r 2 d0 


+ 


1 d 2 
r 2 sin 2 0 d(fi 2 


+ fc 2 


^(r, 0,<£) 


= 2 V(r,0,0Wr,0,0). (1) 


The homonuclear diatomic molecule is the sim¬ 
plest system for which nonspherical terms con¬ 
tribute to the scattering potential, so it will serve 
as the physical anchor for an otherwise computa¬ 
tional study. When the energy of the incident 
electron is between 1 and 10 electron volts, the 
nonspherical structure of the molecule is particu¬ 
larly important. In such a situation, we use the 
axial symmetry of the molecule to reduce the 
Schrbdinger equation to a two-dimensional partial 
differential equation. We assume for simplicity 
that the collision does not excite vibrational states 
of the molecule. Rotational excitations may be dealt 
with at the cross-section stage of the calculation in 
the usual adiabatic-nuclei manner [1]. 
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The target molecule may be assumed to be 
oriented in a particular direction in space so we 
use the axis of symmetry of the molecule as the 
z-axis of our coordinate system (Fig. 2) and define 
a modified wave function which is related to 
the true wave function 'F for the projectile by 

'FCr, 9, 4>) = 6)e im<f> . (2) 


Note that we are using the nonreduced equation. 
The equation for 9) in atomic units is 


2 # (m) 1 


+ - 
r 


dip 


On) 


2 , /,('«) 


dr A r dr 

-Oik 1 - 2 V(r,6) - 


H-^ < cot (6 )-h 




nr 


r 2 sin(d) 


39 36 2 

ip {m) = 0, (3) 


where V(r, 6) is the scattering potential and k is 
the momentum of the projectile. A finite element 
approximation to this equation is solved on the 
domain 0 < r < p, 0 < 9 < tt/2. The outer limit p 
is chosen to be sufficiently large that the scattered 
particle sees a nearly spherical potential at that 
distance and, for cases with a nonlocal potential, is 
greater than the range of the nonlocality. The range 
of 0 was cut in half to take advantage of the 
symmetry of the molecule under reflections in the 
x-y plane. 

No radial boundary condition is set at r = 0. If 
we were solving the reduced equation, we would 
require that ijs = 0. We do set the derivative of ijs 
with respect to 6 equal to zero at the origin 
(dip/ 39 | r « 0 — 0)* Even though we are solving an 
elliptic equation, which requires a specification of 
a boundary condition on all boundaries, we may 



FIGURE 2. Spherical coordinate system for scattering 
from a diatomic molecule. 


avoid setting a boundary condition at individual 
points on the boundary as long as they are iso¬ 
lated. At 6 = 0, there are two possibilities: If m, 
the z-component of the orbital angular momen¬ 
tum, is nonzero, then ip — 0; otherwise, 3ijj/36 — 0. 
At 6 — tt/2, the parity of the system dictates ei¬ 
ther that u — 0 or 3u/30 = 0. At r = p, we impose 
the requirement that the incoming electron be in a 
state of well-defined orbital angular momentum, 
so that 


*A/ (m) ( P,0) = Pi m (cos6). (4) 

The parity of the associated Legendre function, 
( —1) /+ m , determines the choice of boundary condi¬ 
tion at 0 — tt/2. Although this parity is conserved 
by the scattered wave function, the value of l for 
the incident wave is not. Outgoing waves therefore 
contain a superposition of /, / ± 2, l ± 4, and so 
forth. The finite element discretization results in a 
matrix algebra problem: 


A X X Jm — B lm , (5) 

where A is the coefficient matrix; X lm , the solution 
vector at the grid points; and B lm , the vector of 
boundary values. Note that the solution and 
right-hand-sides are indexed by lm, the Legendre 
parameters from Eq. (4). Once the solution has 
been found, information about the scattering is 
extracted by matching it to the known asymptotic 
form [4] at r t < P : 


ik/ n) (r t , 0) = £P rm (cos0) 
v 


X 


1 1 

Si’(kr t )J rm im + C r (kr t )N Vm lm 

r t r t 


( 6 ) 


and 


— 9) 

3r 

= Y,p,. m (cose ) 

V 


17 l d l 


"4 1 

| 

*"4 

ZT* 

1 

-4 1 

K> j 

Jl’m, lm 

id i \ 

* 

C v (kr t ) , C,,(kr t ) 

r t r t / 

l m, lm ' 


(7) 


where l and m are the index and order of the 
Legendre function used in the boundary condition 
at large r = p, and S and C are related to spherical 
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Bessel functions by S, = k 1/2 rj t and C } = ~fc 1//2 rn / . 
Thus, ji and n l are the regular and irregular spher¬ 
ical Bessel function [13]. The coefficients J and N 
form matrices on the space of allowed values of 
orbital angular momentum. Note that J and N are 
diagonal in m because of the cylindrical symme¬ 
try. The validity of Eq. (7) can be established 
because of the Wronskian relation S,CJ — SJC Z = 
— 1 (where the prime indicates differentiation with 
respect to r) and Leibnitz's rule with respect to the 
parametric differentiation of an integral. This latter 
point concerning parametric differentiation is a 
somewhat subtle point that is explainable only 
when viewed in light of the noniterative integral 
equation (NIEM) solution method [14,15]. We will 
not elaborate on the point now, but only say that it 
is well established [16]. 

The orthogonality of the Legendre functions may 
be used to project out a particular value of L When 
this is done, the linear independence of the Bessel 
functions allows extraction of the unknown coeffi¬ 
cients J and N from Eqs. (6) and (7) by a matrix 
inversion. This process must be done for each 
allowed value of Z, e.g., each right-hand side of Eq. 
(5). However, the matrix A does not have to be 
redetermined since it is independent of the p 
boundary condition. The result is two matrices 
describing the transformation of an angular mo¬ 
mentum eigenstate into a scattered wave. In prin¬ 
ciple, the dimension of the matrices is infinite, but, 
in practice, the finite range of the nonspherical 
potentials provides a cutoff. For an H 2 molecule, 
with a projectile energy below 10 eV, values of Z 
less than 5 are all that are required. Hence, the 
matrices are at most 3x3. The K-matrix is de¬ 
fined by the equation 

i kj' m Kr / 6) 

1 

= -£[S r Ur) + C v {kr)K n ]P Vm {cosQ) f (8) 
r v 

so K = NJ-\ and the influence of the artificial 
imposition of an initial orbital state is removed. 
The diagonal elements of K are the tangents of the 
elastic-phase shifts, and the off-diagonal elements 
describe the coupling of states of different orbital 
angular momentum by the potential. 

The method of finite elements is well described 
in [9,17] and references therein. We will give a 
short outline of the procedure. The procedure is 

1. Divide solution space into nonoverlapping 
domains called "finite elements." 


2. Define local basis functions within each ele¬ 
ment—in the present case, fifth-order Her- 
mite polynomials. 

3. Discretize in each element by the Galerkin 
method and implicitly form a global element 
matrix. 

4. Form a global connected matrix by enforcing 
continuity of the solution and its deriva¬ 
tives across element boundaries. 

5. Form a boundary reduced^ connected matrix 
and boundary vector; A; B ^ 

A X = B. (9) 

6. Solve simultaneous equations by lower-up¬ 
per factorization (LU) decomposition [17,18]: 

X = A _1 B. (10) 


[9]: 


The method of Hermite interpolation is used 


-1 


0 


+ 1 


- ^ 

> r 


•(e) 


r< e) + \h\ e) 


r { 0 e) + h[ e) 


In terms of local variables Xf. {-1,0, +l}f =1 , 

6 

ip (e) (x) = Yj 
i=i 

where 


(ID 


-r 

H e) 


^ (c) (*i) 

d 

— (e) U 

dx 

lf/ <e) (x 2 ) 

d 

— 

dX 

ik ie) (x 3 ) 
d 


dX 




and x) is determined from 

6 

p«(*)“ I ]gii* hl 
/=i 

Pj(xj) = 8jj fori = 1,3,5 
and / e {x l , x 2 , x 3 } 


( 12 ) 


(13) 


(14) 


594 


VOL 65, NO. 5 




ELECTRON SCATTERING FROM A MODEL H, POTENTIAL 


— p ; ( x)| x . = S if for i = 2,4, 6 

and j e {x u x 2 , x s }. (15) 

Let us write Eqs. (13) and (14) explicitly: 

Pi( x) = g n + g 12 x + g 13 x 2 + g u x 3 + g 15 x 4 

+ gl6 x5 
d 

— = &22 + 2#23* + 3#24 + 4g 25 X 3 

+ 5 #26 x4 

P 3 (*> = $31 + £32* + g 33 X 2 + g 34 X 3 + g 35 X 4 

+ g 36 X 5 

d 

— p 4 (^> = ^42 + 2g 43 x + 3g 44 x 2 + 4g 45 x 3 

+ 5g 46 x 4 

p 5 ( x) = g 51 + g 52 X + g 53 x 2 + g 54 x 3 + g 55 x 4 

+ g 5 6 x5 

d 

— p 6 (^> = <?62 + 2^ 63 X + 3g 64 x 2 + 4 g 65 X 3 

+ ^g^X 4 . 

We now implement conditions (14) and (15) and 
construct a matrix equation to be solved for g-: 


$n 

$12 

$13 

$14 

$15 

$16 

$21 

$22 

$23 

$24 

$25 

$26 

$31 

$32 

$33 

$34 

$35 

$36 

$41 

$42 

$43 

$44 

$45 

$46 

$51 

$52 

$53 

$54 

$55 

$56 

$61 

$62 

$63 

$64 

$65 

$61 


10 10 
0 111 
0 0 12 
0 0 13 

0 0 14 

0 0 15 


= 1. (16) 


The expression for i/h e) in terms of global variables 
can now be written using 


r = r ( Q e> + h[ e) 


(1 + x) 
2 


“ _1 + Tm . 




(17) 


To go to a global representation of (11), ijj (e) (r), it 
is not necessary to let x -» r in pfx). The global 
operators can be transformed to local operators 
which operate only on the piecewise continuous 
polynomials within each element: 


d dx d 2d 
dr dr dx h\ e) dx 

A2 \ o n 2 ,2 


/zl e) dx 2 * 


(18) 


(19) 


So, we can simply say that 


4 ^ (e) (r) = Y ^ e) Pi(x). 

i = i 


( 20 ) 


However, the problem with (17) is that for 
/ = 2,4,6, represents derivatives of ^L (e) (r) with 
respect to the local variable x and we need it to 
represent derivatives with respect to the global 
variable r : 

dif/^Kx) dr £?i/f (e) (r) 1 dip (e) (r) 

- --= -h\ e) -. (21) 

dx dx dr 2 dr 


Thus, 


y (e) (r) = Y e) Pi(x ) 

* = 1,3,5 


_ 1 / <?i// (e) (r) 

+ E ^ h \ e) --- Pi(x). (22) 

*■=2,4,6 2 


Let us now write 


¥ (e) (r) - £^L>pW(x), 

i ~ 1 


(23) 


such that 


\|/(<0 = 


/or / = 1,3,5 

dijj^Kr) I 


(24) 


(l + x,) 

dr r j = h[ l ^ —-— 


for i = 2,4,6 


and 


pi' Kx) ,h ix) /»'•■ = C3,5l 
' U«p,.(x) fori = 2,4,6’ 
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Thus, the solution and its derivatives [see Eq. (12)] 
are extracted at each grid point in the solution 
space. 


Partial-wave Amplitude Extraction and 
Propagation Procedure 

The matrices J and N can be obtained from Eqs. 
(6) and (7), at r = r t , as follows: 

' J0»> (r( ) ' 

N (, " ) (r ( ) 


— S(r,) 


-S'(q) - -jS(r ( ) 


— C(r,) 
r. 


-C (r t ) - -C(r ( ) 


F (m) (r f ) 

F' ( '")(r,) 


(26) 


Note that the prime indicates differentiation with 
respect to r. Also, S and C indicate diagonal matri¬ 
ces. Note that the F and F' are the terms within the 
large square brackets of Eqs. (6) and (7). Rewriting 
Eqs. (6) and (7) in terms of F and using the prime 
to indicate differentiation with respect to r (except 
when used on /'), 

tf m Kr lr 6) = Y J P rm (cose)Ff,fKr t ) (27) 
= £P rm (cos0)F'j.7 ) (r f ). (28) 


Then, using the orthogonality of the Legendre 
polynomials, 

, w x (/' - + 1/2) 

^ - «■ + ,)■ — 2 

X f / sin(d) d6 P rm (cos 6) (29) 

J n 


F'W(r t ) = 


(/' - m)l(r + 1 / 2 ) 


(/' + m)\ 


X /* 7r sin(O) dd P rm (cos , 6). (30) 

Let us drop the m label and switch to the 
reduced function / defined by f = rF. Let us also 


drop the l and V label and switch to i and j. We 
thus have the matrix of partial-wave continuum 
orbitals f at r = r t . Thus, from [14-16], we may 
write 


f(r) = S(r)J(r) + C(r)N(r), 


(31) 


where 


Jijir) = 8 tj + 2 fdxC i (x)R ij (x) (32a) 
■'0 

Njj(r) = 2 f dxS i (x)R ij (x), (32b) 


where 


Rn(r)=EV ik (x)f ik (x). 


(33) 


We are working with the unnormalized solution in 
the sense defined in [14]. Then, by using a trape¬ 
zoidal integration rule with step size A r, it can be 
shown that [16] 


I(r y ) - 'P(r i )I(r i _ 1 ) f 


(34) 


where 


I(r) = 


J (r) 

N(r) 


(35) 


and 


P(r) = 1 + Ar M(r) 


(36) 


and 


M(r) = 2 


+ C(r)V(r)S(r) +C(r)V(r)C(r) 


- S(r)V(r)S(r) 


S(r)V(r)C(r) J 
(37) 


Thus, we may form 


I(rJ = P X I(r f ), 
where, if r, = r 0 , the origin, then 


Kr n ) s 


(38) 


(39) 


Otherwise, the value of I(r,) is picked up from the 
partial differential equation (PDE) calculation and 
constructed using Eq. (35). Thus, we may propa¬ 
gate from the surface of the PDE sphere, out to an 
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approximate infinity. This is what we do in the 
present work. Thus, 

P - P(r f ) X P (r [f _ X] ) X — X P(r f ) (40) 

and P (r f ) = P(r M ). Then, 

K(rJ = N(rJxj- 1 (rJ. (41) 

All the scattering information is contained in K(r rx ). 
It is a real, symmetric matrix. 

It should be pointed out that his propagation 
procedure is very similar to the procedure of the 
NIEM [14]. The NIEM propagated f instead of I 
and thus required unnecessary matrix multiplica¬ 
tions. Therefore, this propagation is a very general 
solution algorithm and, in principle, could be used 
all the way into the origin. For the present problem 
under consideration, this works fine and is, in fact, 
used to check the accuracy of the results. For more 
general problems with highly nonspherical poten¬ 
tials, accuracy is lost because of the large number 
of partial-waves required near the origin and be¬ 
cause of the large magnitude of the irregular 
spherical Bessel function n t (r) near the origin as 
compared with the regular spherical spherical 
Bessel function j } (r). We have therefore defined a 
computational procedure similar to the R-matrix 
method [19]. The difference is that we are breaking 
the R-matrix sphere up into elements and span¬ 
ning each element with Hermite interpolating 
polynomials instead of basis function spanning the 
entire R-matrix sphere. 


The Potentials 

The potential energy function V in Eq. (3) repre¬ 
sents the interaction of the projectile with the tar¬ 
get molecule and is the source of the breaking of 
spherical symmetry. In principle, this interaction is 
highly complicated. A molecule is by no means a 
static, rigid body in the presence of an incoming 
electron, so a potential model is not strictly correct, 
but the practical usefulness of potentials in calcu¬ 
lations is great enough to excuse the gross simpli¬ 
fication of the scattering problem that they imply. 
The potentials involved in scattering from hydro¬ 
gen molecules are approximations to three effects: 
the static electric moments of the molecule; the 
polarization of the molecule by the electric field of 
the projectile; and the substitution of an electron 
from the molecule for the projectile in the outgoing 


state, known as “exchange." An H 2 model was 
adopted in the present work, which mimics to 
some extent, the radial structure of the physical 
potentials and has angular properties which are 
expandable in the low-order Legendre polynomi¬ 
als of even parity. Actually, only three polynomi¬ 
als are needed because the potential, constructed 
below, contains only l = 0,2,4. 

Because the hydrogen molecule is composed of 
identical atoms, the lowest-order multipole field 
seen by the projectile is the quadrupole term 

Vg(r, 9) = -~P 2 (cos 9), (42) 

where Q = 0.49 in atomic units. This form is valid 
for vales of r which are outside the molecule for 
all 0 . At shorter ranges, the quadrupole potential 
is cut off by the usual exponential term (1 — 
exp[—(r/a) 6 ]) as a multiplier. The adjustable pa¬ 
rameter a is set to 1, since no effort is made in the 
present work to fit experimental data. The static 
potential inside the molecule is represented by a 
Yukawa function of unit range and arbitrary 
strength, which is adjusted as necessary to give the 
correct order of magnitude for the K-matrix. 

Polarization of the target is well known to in¬ 
clude terms of both spherical and P 2 character [1]. 
The form used here is 

1 

Vp(r, e) = --4 (1 - e ( ' Otflo + a 2 P 2 icos 6)], 

(43) 

with a 0 = 5.5, and a 2 — 1.4. 

The exchange potential used here is a crude 
form of the free-electron gas model. The basic idea 
here is that the electron in the target may be 
approximated by a degenerate Fermi gas whose 
density varies with position [1]. The exchange po¬ 
tential is proportional to the Fermi momentum at 
any point in space, giving 

V x (f) = -(A) [ 377 2 /3 ( r )] . (44) 

The electron density is approximated by a 

simple sum, 

1 

j8(r,0) = — e~ r [l + 1.3P 2 (cos0)], (45) 

477 

which has contours of constant density which re¬ 
semble those of the sum of two hydrogen atoms 
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separated by 0.7 angstroms. The resulting potential 
can be approximated by a linear combination of 
Legendre polynomials, which is 

V x (r, 0) = —0.8<r r/3 [0.97 + 0.43P 2 (cos 6) 

+ 0.1 P 4 (cos 0)]. (46) 

The "realistic" potential is a sum of these three 
terms. 

The long-range quadrupole potential is the most 
important term in real scattering problems, but it 
does not vanish until a distance of 14 bohr radii 
from the molecule's center is reached. This was 
determined a posteriori, by searching for a region 
in which the K-matrix does depend on small 
changes in r mnx . Such a long range requires a large 
number of radial points and, consequently, a long 
running time for the computer code. Although the 
convergence properties of the angular approxima¬ 
tion are roughly independent of the quality of the 
radial approximation, this independence is valid 
only when the radial mesh is on the order of 
magnitude of the mesh which gives decent radial 
convergence. The P 2 terms in the polarization and 
exchange potentials are, in any case, expected to 
yield the same requirements on the angular mesh. 


Results and Conclusions 

Convergence of the K-matrix is determined in 
two way: The first takes advantage of the invari¬ 
ance of the K-matrix under time reversal to re¬ 
quire that the K-matrix be symmetric. The second 
is that the value of an element of the of the 
P-matrix change with changes in the number of 6 
points in a manner which points clearly to an 
asymptotic limit. Now, in the present calculation. 


the radius of the PDE solution region was set at 10 
bohr. The partial-wave amplitudes were extracted 
at the first grid point of the last element whose last 
grid point is p. The solution was then propagated 
to 70 bohr where the K-matrix was determined. 

The matrices are not strictly symmetrical. When 
the scattering potential is spherical, the wave func¬ 
tion should have the angular properties of the 
large-r boundary condition, all the way to the 
origin. This is true only when the boundary condi¬ 
tion is for l = 0. When higher waves are used, 
each contains a contaminant of lower l values. 
This contamination decreases with each step in r 
away from the origin. 

The second criterion makes an accurate graphi¬ 
cal depiction of the convergence difficult. The con¬ 
vergence of almost all elements is monotonic, so 
the choice of a specific limit is arbitrary, yet it has 
a great impact on the perceived quality of the 
convergence. A systematic study of the conver¬ 
gence has not been done as a part of the present 
work. We set grid parameters which ensured accu¬ 
racy to at least five significant figure in all of the 
K-matrix elements for k = .1, .5, and 1.0 au. The 
PDE r-grid used A r = 0.1 bohr. The PDE 0-grid 
used A 0 — 10°. The propagation step size was Ar 
= 0.01 bohr. The PDE (with propagation from 10 
to 70 bohr) results are displayed in Tables I—III. 
The answers were checked by the propagation 
algorithm starting at the origin and going out to 70 
bohr again using Ar = 0.01 bohr. The propagation 
algorithm actually produced K-matrix accuracy to 
12 significant figures. All calculations were done 
using 64-bit arithmetic. Table IV shows the con¬ 
verged (p = 70 bohr) ^-matrices as a function of 
the incident electron moment (fc). 

Figure 3, as well as the X u contribution, com¬ 
pares the present X g elastic cross sections, calcu- 


K-matrix elements for k = 0.1 au in X g symmetry; p is the value, in units of bohr, of r where the K-matrix was 
determined symmetric to the number of figures indicated. 


k = 0.1 au 


p 

Mi, i) 

Mi, 2) 

Ml, 3) 


M2, 2) 


M2, 3) 


M 3,3) 

10 

- .39023E + 1 

— .18534E + 0 

— .43518E - 

2 

— .41619E - 

2 

— .10151E - 

3 

.47213E - 4 

30 

— .10282E 4- 1 

— .71393E - 1 

— .22522E - 

2 

.36462E - 

1 

.14096E - 

2 

.24640E - 2 

50 

— .10267E + 1 

— .72195E - 1 

— .22252E — 

2 

.36821E - 

1 

.98354E — 

3 

.24326E - 2 

70 

— .10267E + 1 

- .72271E - 1 

— .22363E - 

2 

.36769E - 

1 

.96741E - 

3 

.22941 E - 2 
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TABLE II 


Same as Table 1 except for k = 0.5 au. 

P 



k = 0.5 au 



K( 1,1) 

K( 1,2) 

K( 1,3) 

K( 2,2) 

K( 2,3) 

K( 3,3) 

10 

.25061E + 1 

.61816E 4 0 

.77306E - 1 

.16084E 4 1 

.23790E + 0 

.63817E + 0 

30 

.34973E + 1 

.10642E + 1 

.12152E + 0 

.21885E 4 1 

.30192E + 0 

.76876E 4 0 

50 

.34979E + 1 

.10645E + 1 

.12158E + 0 

.21890E 4 1 

.30203E 4 0 

.76885E 4 0 

70 

.34978E + 1 

.10645E + 1 

.12158E 4- 0 

.21889E 4 1 

.30203E + 0 

.76883E 4 0 


lated using [1] 

4lT „ 

^=TtEI T u .\ 2 , (47) 

K l r V 

where 

T = 1 - (1 + iK)(l - z'K)“\ (48) 

and the total elastic cross-section calculation of 
Jain and Baluja [20] as well as the total experimen¬ 
tal cross section of Hoffman et al. [21] It must be 


emphasized that the present calculation employs a 
model e~- H 2 potential and, thus, the interesting 
agreement with the experimental results of Hoff¬ 
man et al. is fortuitous. These results simply indi¬ 
cate that the model for the interaction potential has 
a realistic aspect to it. 

The finite element method has been shown to 
be an effective numerical method for the solution 
of the time-independent electron-molecule scatter¬ 
ing problem. The next step is to generalize to 
nonlocal potentials and coupled equations in order 
to treat real molecular targets realistically. 


TABLE III_ 

Same as Table I except for #c = 1.0 au. 


k = 1.0 au 


P 

K(1,1) 

K( 1,2) 

K(1,3) 

K{ 2, 2) 

K(2, 3) 

K( 3,3) 

10 

.67237E + 0 

.11579E 4 0 

.85839E - 2 

.92080E 4 0 

.11500E 4 0 

.72282E + 0 

30 

.72882E 4 0 

.12633E 4 0 

.81894E - 2 

.10158E 4 1 

.12908E + 0 

.80166E 4 0 

50 

.72888E + 0 

.12632E 4 0 

.81858E - 2 

.10159E 4 1 

.12908E 4 0 

.80172E 4 0 

70 

.72887E 4 0 

.12632E 4 0 

.81852E - 2 

.10159E 4 1 

.12908E 4 0 

.80171E + 0 


TABLE IV 


Converged K-matrix elements (p = 70 bohr) in X g symmetry. 


k 

K(1,1) 


K(1,2) 


K( 1,3) 


K( 2, 2) 


K( 2,3) 

K( 3,3) 

0.10 

— .10267E 4 1 

- .72271E 

- 1 

- .22363E 

- 2 

.36769E 

— 

1 

.96741E - 3 

.22941E - 2 

0.25 

- .3530E + 1 

- .88189E 

4 0 

- .68481E 

- 1 

.79764E 

4 

0 

.80772E - 1 

.19074E + 0 

0.50 

.34978E 

4 1 

.10645E 

4 1 

.12158E 

4 0 

.21889E 

4 

1 

.30203E 4 0 

.76883E + 0 

0.75 

.12721E 

4 1 

.27909E 

4 0 

.26470E 

- 1 

.14367E 

4 

1 

.20023E 4 0 

.88056E 4 0 

1.00 

.72887E 

4 0 

.12632E 

4 0 

.81852E 

- 2 

.10159E 

4 

1 

.12908E 4 0 

.80171E 4 0 

1.25 

.45211E 

4 0 

.67314E 

- 1 

.26203E 

- 2 

.75040E 

4 

0 

.86545E - 1 

.68998E + 0 

1.50 

.27464E 

4 0 

.36056E 

- 1 

.52522E 

- 3 

.56448E 

4 

0 

.59900E - 1 

.58408E + 0 

1.75 

.15123E 

4 0 

.15902E 

- 1 

- .45278E 

- 3 

.42410E 

4 

0 

.41872E - 1 

.49046E 4 0 

2.00 

.65783E 

- 1 

.13066E 

- 2 

- .10867E 

- 2 

.31309E 

4 

0 

.28755E - 1 

.40838E 4 0 

2.25 

.11581E 

- 1 

— .96914E 

- 2 

— .16741E 

- 2 

.22331E 

4 

0 

.18616E - 1 

.33608E 4 0 

2.50 

— .15535E 

- 1 

— .17612E 

- 1 

- .23247E 

- 2 

.15070E 

4 

0 

.10418E - 1 

.27204E 4 0 

2.75 

- .20396E 

- 1 

- .22432E 

- 1 

- .30457E 

- 2 

.93306E 

— 

1 

.36233E - 2 

.21529E 4 0 

3.00 

- .95865E 

- 2 

-.24141E 

- 1 

- .37803E 

- 2 

.50201E 

— 

1 

- .20086E - 2 

.16531E 4 0 
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FIGURE 3. Cross section (•••) present elastic 2 g + X u 
cross section for H 2 model potential; (a a a) calculation 
of elastic cross section (Jain and Baluja [20]); (OOO) 
experimental total cross section (Hoffman et al. [21]). 
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ABSTRACT: Some preliminary numerical studies of scattering by a potential separable 
in a general ellipsoidal coordinate system are presented. The associated scattering or 
far-field amplitudes are evaluated using expansions in radial and angle Lame wave 
functions. Simple representations of the scattering matrix and the far-field amplitude are 
given as well as some preliminary figures of scattering data and related potential level 
surfaces. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 601-608, 1997 


Introduction 

W hen separating the variables in terms of 
ellipsoidal coordinates (see Fig. 1), particu¬ 
lar solutions of the Helmholtz equation are formed 
as products of the so-called angle and radial ellip¬ 
soidal (or Lame) wave functions. For comparison, 
in case of a sphere, the products of the angle 
functions are the spherical harmonics, while the 
radial ones are the Bessel and Hankel functions. 

A potentially very useful method to study, cal¬ 
culate, and analyze these ellipsoidal wave func- 
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tions was presented, during the last 7-8 years, by 
the powerful Abramov School at the Department 
of Numerical Methods at the Computing Center of 
the Russian Academy of Sciences [1-3]. This work 
was, to a large extent, inspired by M. V. Fedoryuk, 
who investigated the fundamental properties of 
the ellipsoidal wave functions (see [4,5] and the 
literature cited therein). Furthermore, his articles 
[6,7] on acoustic scattering by a triaxial ellipsoid 
are also of great importance here, as he was the 
first to obtain the explicit form of the series expan¬ 
sions of scattering data in terms of ellipsoidal 
wave functions. 

It should be noted in this context that we are 
not simply interested in the ellipsoidal wave func¬ 
tions as such, but we have, rather, a more general 
viewpoint in mind. For instance, one may apply 
the Fourier technique to solve the Schrodinger 
equation, thus arriving at a system of problems 
very similar to those arising in the study of the 
Helmholtz equation. In the particular case of the 
potential plotted in Figure 2, the angular functions 

CCC 0020-7608 / 97 / 050601 -08 
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FIGURE 1. Examples of ellipsoidal coordinate surfaces. The grid is produced by the intersection of the coordinate 
surfaces. For more details, see text. 



FIGURE 2. Equipotential surfaces of the potential V{£ v f 2 , £ 3 ) = y 0 l [(£ 3 - € 2 )U; 3 - ^)]. Two adjacent octants are 
displayed to show the mirror symmetry of the potential. 
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will coincide with the angular ellipsoidal wave 
functions. The methods developed in [ 1 - 3 ] allow 
us to compute these perturbed ellipsoidal wave 
functions for an arbitrary potential separable in 
ellipsoidal coordinates and vanishing sufficiently 
fast at infinity and, more importantly, to represent 
scattering data in them. The present technique 
should also lead to many important applications to 
general problems in physics and chemistry, where 
the standard partial wave analysis may extend 
into the regime of thousands and more waves— 
obviously, by trying to manage a description of a 
system far from spherical symmetry. 

In a recent contribution [8], we examined the 
shapes of the partial waves in an ellipsoidal repre¬ 
sentation using so-called pcu-diagrams, where 
p 2 = ( a 2 — c 2 )/(a 2 — b 2 ) and oj 2 = k 2 ia 2 — b 2 ) are 
defined [see Eqs. ( 1 )—( 4 ) below]. It is easy to see 
that the flexibility offered by the ellipsoidal coordi¬ 
nate representation is obviously absent in the 
standard choices of coordinate systems. 

We will here consider the scattering of particles 
by a potential which enables a separation of the 
variables in ellipsoidal coordinates, i.e., we look 
for the solution to the Schrodinger equation 

AT' + (k 2 - V(r))V = 0 , re I 3 ( 1 ) 
defined at infinity by 

M/'fr, k) ~ expz(k, r) 

exp iikr) 

+ fin, n 0 ) -, (r —» oo) (2) 

r 

where fin, ft 0 ) is the far-field or scattering ampli¬ 
tude, n = r/r and n 0 = k/k. Our aim was to ex¬ 
pand it in terms of the eigenfunctions of the 
Sturm-Liouville problem arising as a result of 
separation of variables in (1). In what follows, the 
variables introduced, in e.g., [ 1 - 3 ] are used. 


Separation of Variables 

Given a > b > c > 0 , the ellipsoidal coordinate 
system is defined by 

x 2 /(a 2 + 17) + y 2 /(b 2 + 17) 4 - z 2 /(c 2 + 7]) = 1 , 
or, going over to the dimensionless variables £ = 


(77 + a 2 )/{a 2 — b 2 ), being used further below, by 

x 2 y 2 

( a 2 - b 2 )i; + (a 2 - b 2 )i £— 1) 

z 2 

+ ia 2 ~b 2 )U-p 2 ) 

Here, we have denoted 

p 2 = ( a 2 - c 2 )/ia 2 - b 2 ) > 1. 

This equation defines a family of confocal sur¬ 
faces specified by the variable To obtain a deeper 
understanding of the ellipsoidal coordinate sys¬ 
tem, one should contrast Figure 1 with Eq. ( 3 ). For 
each point ( x , y, z), xyz ¥* 0, the latter exhibits 
exactly three different roots, ( — A, £?, £,, where 
£ e/, = (0; 1), & e I 2 = (1; p 2 ), and £ 3 e I 3 = 
ip 2 ; 00). The equation £ 3 = £ 3 ° e I 3 defines an el¬ 
lipsoid with the semiaxes ia 2 - £> 2 )£ 3 ° > ia 2 - 
b 2 )ig 3 ° — 1 ) > ia 2 - fr 2 )(£ 3 ° - p 2 ). As £ 3 tends to 
p 2 + 0, ellipsoids degenerate into the part of the 
plane z = 0 bound by the focal ellipse x 2 /ia 2 - 
c 2 ) + y 2 /ib 2 - c 2 ) = 1 . Setting £ 2 = p 2 , we obtain 
the part of the plane z = 0 complement to £ 3 = p 2 . 
As £ 2 diminishes from p 2 to 1 , the equation 
£2 = ^2° specifies a hyperboloid of one sheet. As £ 2 
tends to 1 + 0, hyperboloids degenerate to the part 
of the plane y — 0, enclosed between the branches 
of the focal hyperbola x 2 /ia 1 — b 2 ) — z 2 /ib 2 — 
c 2 ) — 1 . The remaining part of the plane, which 
consists of two disjoined regions, containing foci of 
the hyperbola, is given by = 1. Starting from 

= 1, a family of hyperboloids of two sheets, 
each described by ^ tends to the plane x = 

0, which coincides with the surface ^ = 0. 

The above three families of confocal surfaces 
form an orthogonal curvilinear coordinate system 
related to Cartesian coordinates within each octant 
(e.g., x > 0, y > 0, z > 0) by the one-to-one corre¬ 
spondence: 

x 2 = (« 2 -& 2 )^ 2 £ 3/ y, 

y 2 = -(a 2 - b 2 )(£i - 1)(£ 2 - 1 ) 

x (£3 1 )/(p 2 - 1), 

Z 2 = ( a 2 - b 2 )( - p 2 )(^ 2 - p 2 ) 

X(f 3 -P 2 )/[(P 2 -!)P 2 ]- 

In Figure 1 , one can see the ellipsoidal coordinate 
surfaces of all possible types. Here, we cut off a 
one-eighth part of the ellipsoid to show the hyper¬ 
boloids of both one and two sheets. It is readily 
seen that there is a point of intersection of all the 
plotted surfaces in each octant. 
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We now proceed as follows: Denote 

L i = ]/f ( & ^}' 

7 = 1 / 3 , with /(£) = g(g - 1)(£ - p 2 ), 
ai 2 = k 2 (a 2 — b 2 ). 

Then, in the newly introduced coordinates, the 
Lame coefficients (see below) are ft? = — 

f 2 Xfi - £ 3 )(fl 2 - ft 2 )/( 4 /(£i)) (ft 2 and h 3 result 
from the above formula by a cyclic permutation of 
the indexes), with Eq. ( 1 ) taking the form 


By substituting a particular solution T'(r) = 
A (1) (£i)A (2) (£ 2 )A (3) (£ 3 ) into (1), we obtain three 
equations similar to the Lame wave equations: 



+ it <?( £i/ h,l, (o) — a( ^)] A (1) — 0 , 

0 < < 1 , ( 6 ) 



^ “ £(£ 2 )]A (2) = 0 / 


{[(& - + Ui- € 3 )Li + (& - ^i)i 3 ] 

+ ?(£l _ &)(& ” £ 3 )(£ 3 ~ £l) 

x[a> 2 - V(f 1/ f 2 ,f 3 )]}M(r) = 0 . ( 4 ) 

In ( 4 ), we will admit a separable potential of the 


y(fi/ f 2 / £3) - [(^ 3 _ £ 2 ) a (£i) + (£1 _ £3)P(f 2 ) 
+ (£ 2 - £i)y(£ 3 )n(£i _ £ 2 ) 

( 5 ) 

Here a, j8, and y are arbitrary functions. This is, 
in fact, the most general expression for such a 
potential [ 9 ], 


1 < f 2 < P 2 , ( 7 ) 

_ d I _ _ dA (3) 

3> 

+ iU(f 3 /M,«) - y(^)]A (3) = o, 

p 2 < < °°, (8) 

where ft, Z are the separation constants, 

qU;, h, l, a>) = ftp 2 - Ip 2 ^ + <w 2 £ 2 . 

Taking the symmetry of the problem into ac¬ 
count, a particular solution of (1) must be either 
odd or even with respect to each coordinate plane. 
Since coordinate planes correspond to the ends of 
the intervals of the ellipsoidal coordinate varia¬ 
tions, we find that 



either lim = 0, 

f i - +o v ' 

either lim Ujil ,) A'jC^)) = 0, 

either lim (V-/(£>) A' 2 ( £ 2 )) = 0 , 
f 2 ^i + o v “ > 

either lim (\/-/(f 2 ) A' 2 (£ 2 )j = 0 , 
£ 2 p 2 ~ o 

either lim [Jf( £ 3 ) A' 3 ( £ 3 )) = 0 , 

f 3 ^p 2 +0 V 


or lim Ajf^j) = 0, 

+0 

or lim A 1 (£ 1 ) = 0 , 

fi^l-0 

or lim A 2 ( f 2 ) = 0, 

£2^1 + 0 

or lim A 2 (£ 2 ) = 0, 

p 2 ~ 0 

or lim A 3 (^ 3 ) = 0. 

£ 3 ^ p 2 + o 


( 9 ) 


Here, we note that, in order to keep the same 
parity with respect to a coordinate plane, it is 
necessary that the boundary conditions for ^ -» 
1 — 0 and for £ 2 —> 1 + 0 as well as for £ 2 -> p 2 — 0 
and for £ 3 -> p 2 + 0 are identical. As in the case of 
ellipsoidal wave functions, the ends of the inter¬ 
vals are singular points for (6)-(8). By a proper 


change of variables, one can, however, avoid these 
singularities [1, 3 ]. 

Thus, we obtain eight boundary value prob¬ 
lems, each specified by a triplet of binary indexes 
(i x , i XJ , i z ), where i- } = 0 (7 = x, y, z ) for a particu¬ 
lar solution T'tr) of even parity with respect to the 
plane 7 = 0 [see the left column of ( 9 )], while 
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ij = 1 (; = x, y, z) corresponds to T'tr) of odd 
parity with respect to j = 0 [the right column of 

( 9 ) ]. Each particular solution can be extended 
smoothly over the whole space according to its 
parity properties [i.e., in accordance with 
(i x , i y , i z )\. Given the set of boundary conditions 
represented by the triplet (i x , i , i z ) and consider¬ 
ing Eqs. (6) and ( 7 ) on the intervals <e l x and 
£ 2 e hf we w iH arrive at the following two- 
parameter Sturm-Liouville self-adjoint problem: 
We look for such a couple of values for { h, 1 } that 
nontrivial solutions of (6) and ( 7 ) exist simultane¬ 
ously on both intervals, satisfying the above- 
mentioned conditions corresponding to the triplet 

Let a and (3 be continuous. It can be proved 
that for any multi-index (m, n) there exists pre¬ 
cisely one eigenvalue (h, /)/ such that the corre¬ 
sponding solution to (6) oscillates m times within 
I x , while the solution to ( 7 ) has (n — m) zeroes 
inside J 2 . Besides, this two-parameter problem is 
left-definite with respect to the vector |ut = (0,1); 
hence, the eigenfunctions 

H?(fi,£ 2 ) = A (1) U,)A (2) U 2 ) 

form a complete system on J? 2 (II, a), where El = 
I, X l 2 and 

or(g u & = Kfc - fl)["/(fi)/(f 2 )]" 1/2 

[ 10 ] . In passing, we note that the functions 

E”/^, f 2 ), associated with distinct points of the 
spectrum, are orthogonal, with respect to the 
weight £ 2 ), on U. Extended according to the 

triplet ( i x , i , i z ) and normalized by 

/ / [E”( g lf € 2 )] 2 <r(€ u f 2 ) d& d% 2 = 1/8, (10) 

all the eigenfunctions for all sets (i x , i , i z ) thus 
constitute an orthonormal basis in J? 2 (S, a), where 
S is an arbitrary ellipsoid defined by £ 3 = £ 3 * e I 3 . 
The functions f 2 ) specified in this way are 

determined uniquely up to a sign. Any two func¬ 
tions H”, E”', with m ¥= m' and/or n =4 n', are 
obviously orthogonal (if they are subject to the 
same boundary conditions, they are orthogonal on 
each part of S situated within a coordinate octant; 
otherwise, their inner product vanishes on S be¬ 
cause they are of different parity). 

The completeness of the function system, de¬ 
fined above, follows from the fact that each func¬ 
tion from Jz f 2 (S) may be represented as a sum 
of eight terms of distinct parity with respect to 


coordinate planes: 

u = v 0 + xv j + yv 2 + zv 3 + xyv 4 
+ xzv 5 + yzv 6 4 - xyzv 7 


[here, = v t (x 2 f y 2 , z 2 )]. One can therefore ex¬ 
pand each term in the above equality as a series in 
the functions 5 J”( Cu £ 2) °f the corresponding par¬ 
ity due to their completeness on Jzf 2 (Il, a). The 
expansion holds for all coordinate octants and the 
sum of all the eight series yields then the expan¬ 
sion of the original function in terms of E™(£i/ £ 2 ) 
(for more details, see also [ 3 ]). 

In [8], a natural way to classify these surface 
functions in relation to the conventional classifica¬ 
tion in the spherical case was proposed. The so- 
called pco-diagrams utilize the property that for 
small co the shapes of the partial angular distribu¬ 
tion are almost independent of the geometry of the 
problem and thus that the orbitals are close to the 
spherical case (see also [11]). 

On I 3 , a nontrivial solution to (8), subject to 
"the i z boundary condition/' exists for any h and 
L If y(£ 3 ) ~ as £ 3 -» °°, with q < 1 (for sim¬ 
plicity), then (cf. the case of radial ellipsoidal func¬ 
tions) the behavior of such a solution at infinity is 
given by 


a <3) (£ 3 ) 


-7= Sin( 0) ifis + A(/, h )) 
V b 3 

l 1 \ 

+ O — , £ 3 -^°°, 

-Ur 53 


(id 


where D is of an arbitrary constant value (see 
below) and the phase A(Z, h ) expresses the influ¬ 
ence of the potential on the scattered particle mo¬ 
tion: If, e.g., a(^) = j8(^ 2 ) = y{ f 3 ) = 0 , then the 
functions n (% x ), A® „(£,), and A® „(£ 3 ) coin- 
cide with the corresponding ellipsoidal wave func¬ 
tions with the phase A(//, Zz/) given by 

A" 7 = W™,K) = -7 r\i x + L + i z \/2 (mod 7r). 

( 12 ) 


It is interesting to note that the last relation has not 
yet been proven thus far. Since we have only 
observed it in practical calculations [ 3 ], it still 
remains as a conjecture. 

As for the asymptotical relation (11), the proof 
is analogous to the one used in [2] for the radial 
ellipsoidal wave functions. Calculations of the 
functions A ( j£ „( £ x ), A (2) „(£ 2 ), and A ( ^ n (£ 3 ) are 
carried out by applying previously developed 
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techniques to calculate ellipsoidal wave functions 
(see [ 1 , 2 ]). The method for calculating the radial 
function [ 2 ] exploits the asymptotic behavior dis¬ 
played in Eq. (11). Instead of computing the solu¬ 
tion to Eq. ( 8 ), we integrated an auxiliary equation 
for a modified Priifer angle, a modification equat¬ 
ing the latter to A(Z, h ) at infinity. However, to 
speed up the calculations and to diminish the 
errors due to ODE numerical integration, we use 
additional terms of the radial function asymp¬ 
totics, allowing us to reduce the integration inter¬ 
val (see [12,13]). 


Scattering Data Expansions 

Following [14], we can now find the connection 
between the functions E(£ lr £ 2 ) anc * eigenfunc¬ 
tions of the scattering operator S = 1 4- (ik/lrr)/. 
We will further use the asymptotic representation 
of the particular solution in the form, putting 

D = 1/ v^" [see (11)], 

~ S nUi,£ 2 )^{ex P [i(kr + A™)] 
-exp[ -i(kr + A'”)]}, (r -> °°). (13) 

Note that as £ 3 tends to infinity the ellipsoidal 
coordinates as mentioned above approach the 
spherical ones, i.e., 

£ 3 (a 1 2 — b 2 ) ~ r 2 (r —> oo), (14) 

and (under the tradition of acoustics, 0 is meas¬ 
ured in relation to the large semiaxis which here 
happens to be the X-axis) 

£i£> = p 2 cos 2 e, 

(£] - 1 )( - 1 ) = -( p 2 - l)sin 2 e cos 2 <p, 

- p 2 )(£ 2 - P 2 ) = P 2 ( P 2 - l)sin 2 6 sin 2 <p 

(15) 

(see [ 6 ,7] for more details). 

The behavior of an arbitrary solution to (1) can 
now be expressed with the aid of the scattering 
operator S (defined on the unit sphere): 

1 r 

^(r, k) ~ —— jexp(zfcr)S(n, « 0 )F(-n 0 ) 

2 ikr K 

— exp( —ikr ) F ()}, (r -» oo), (16) 

where F(n 0 ) depends on the angles cp°, 6° of the 
incidence direction. 


Introducing the inversion (with respect to the 
origin) operator /, 

IF(h) = F( —«), 
we will, instead of (16), obtain 

1 

^(r, fc) ~ ——{exp (ikr)S(n,n Q )I 
2 ikr 1 

— exp(— ikr)}F(n 0 ), (r -» o°). ( 17 ) 

Rewriting (13) as 
1 

¥“(r) ~ —- {exp(ZAr) exp(2zA”0 - exp (—ikr)} 

2 ikr 

X[exp(-zA , ”)E^(f 1 / f 2 )]/ °°)/ 

and comparing it with (17), we find that 
F(w 0 ) = exp(-*A”)E”(? 1 °, f 2 °), 

where ^ £ 2 ° are connected with the angles <p°, 0 ° 
by (15) and with E"'(£i, f 2 ) being the eigenfunc¬ 
tion of SI: 

SfS;r(^°, f 2 °) - exp(2fA“)S?(f 1 ° / f 2 °). 

In view of Eq. (14) and since cr(^, £ 2 ) = 
lim^^oc [^^/(fl 2 - & 2 )£ 3 ], we have 

f 27T f [E| ”(0, cp)] 2 sin 6 d6 dip 
J o J o 

= lim —7 f f[SJ"(ra)] 2 dS 

= lim 72 / r 2 /[srcfwfa )] 2 

r "“ 

X Zq/z 2 ^£1 d £2 

= 8 / /[S”(f 1 ,f 2 )] 2 <r(f 1 ,f 2 )df 1 d £ 2 
•'n J 

(18) 

(here, S r is a sphere of radius r). Thus, the matrix 
elements of the scattering operator in terms of the 
functions E™(fi, £ 2 ) are ( as Ihe latter constitute an 
orthonormal system) 

(in, n\ S \m f , n’) 

= exp(2fA”J) (m , ?z| J |m', n') 

= (-i)' x+l 'j + ‘ z exp(2zA'JJ) (m, n \ m', n') 

= (— 1) 1 r++ * 2 exp (2 f A™) 

X ^m / m , ^n f n'\i xf i y ,i z )(i , xr i ' y , i' z ) 
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SCATTERING IN ELLIPSOIDAL COORDINATES 


or, written differently. 


oo n 


S(n,n 0 )= E EE [(-l) , * + '» + '*exp(2«A™)B“(f 1 ,f 2 )Br(^ 0 ,f 2 °)] (irf<i/ii) . 


O x ,i,j,i z ) " = 0 m -0 


Tli ere fori', the far-field or scattering amplitude is given by 


2tt 


co n 


f(„, n 0 ) = — E E E {[(-l)' I+,!,+!z exp(2iA';;) - ^ 2 )S^(^ 0 , | 2 °) 


W "° 777 = 0 




As a check, we note that for 

a(£i) = /3(£ 2 ) = y(£ 3 ) = 0, 
taking (12) into account, we get f(n, n 0 ) = 0. 


Examples and Discussion 

In Figure 2, we displayed a scattering potential, 
separable in ellipsoidal coordinates, given by the 
formula V (£ 1( | 2 , £ 3 ) = [y 0 /(£ 3 - £ 2 )(f 3 ~ ^i)l- 
As the potential goes to infinity, the corresponding 
equipotential surfaces approach the focal ellips (cf. 
the description of the ellipsoidal coordinate sys¬ 
tem). Far from the origin, the potential vanishes as 
r“ 4 and the corresponding equipotential surfaces 
approach a spherical shape. Generally, the shape 
of the equipotential surfaces does not depend on 


the value y 0 . We have here only plotted a quarter 
of each surface for three typical cases. 

The corresponding numerical results are dis¬ 
played in Figure 3. One can see here the examples 
of scattering diagrams, i.e., the plots k \f(n, n Q )\ for 
different incident directions: parallel to axis OX, 
parallel to axis OY, and parallel to axis OZ, from 
left to right The parameter values are p 2 = 2.0, 
oj == 10.0, and y 0 p 2 = -0.1, the accuracy of the 
auxiliary ODE calculations is s = 10“ 6 , and the 
maximum n = 15. We have not displayed the scat¬ 
tering amplitude for positive y 0 , since it will not 
give a large visual difference. In this particular 
case, the potential will only perturb the phase shift 
—a perturbation effect which is approximately lin¬ 
ear. From the series expansion of f(n, n 0 ), it fol¬ 
lows that its absolute value does not depend on 
the sign of the potential. This means that within 
the linear domain the amplitudes in Figure 3 would 



6=0 (p=0 0=jr/2 (p=0 0=7i/2 (p=rc/2 

FIGURE 3. Examples of k \f(n , n 0 ) I for the potential displayed in Figure 2. Various incident directions are shown from 
left to right. 
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naturally scale with the potential so that their 
shapes are preserved. 
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ABSTRACT: Electronic structure calculations, carried out at the HF/6-311G*, 
MP2/6-311G*, and BLYP/6-311G* levels of theory, reveal an unexpectedly large 
influence of electron correlation on bonding in the octasulfur dication Sg + . Whereas a 
structure with a strong transannular bond linking two sulfur atoms with localized 
positive charges is predicted within the Hartree-Fock (HF) approximation, second-order 
Moller-Plesset (MP2) and Becke-Lee-Yang-Parr density functional (BLYP) calculations 
produce delocalized dications in which the transannular interactions are completely 
missing. Rigorous analysis of the computed electronic wave functions explains this 
phenomenon, which is also observed in the HF/TZVP and MP2/TZVP optimized 
geometries of Se 8 + . In agreement with the experimental data, the endo-exo conformers 
are found to possess the lowest energies for the octachalcogen dications, in contrast to the 
corresponding neutral species in which steric interactions determine the relative energies, 
favoring the exo-exo ring conformations. None of the three theoretical approaches is 
capable of reproducing the experimental solid-state geometries of Sg + and Seg + , which 
are almost certainly distorted to a large degree by the crystal packing and counterion 
effects. The results of the present study underscore the need for a concerted experimental 
and theoretical effort to provide the definitive answer to the question of transannular 
interactions in chalcogen compounds. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 
609-616, 1997 


Introduction 

W ith its propensity to catenation (formation 
of homocyclic chains), sulfur occupies a 
unique place among the known elements. Com- 
Corresponden.ee to: J. Cioslowski. 
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bined with flexible packing in solid state, the large 
number of homocyclic compounds of sulfur gives 
rise to a multitude of allotropic modifications. So 
far, no fewer than 17 allotropes of sulfur, namely 
S 6 , S 7 (4 modifications), S 8 (3 modifications), S 9 (2 
modifications), S 10 , S n , S 12 , S 13 , S 18 (2 conformers), 
and S 20 , have been isolated in pure, crystalline 
form [1]. In fact, sulfur remains the element with 
the largest number of known allotropes, although 
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this may change in the near future with the 
prospective discoveries of new fullerenes [2]. 

The most common form of sulfur is the octasul- 
fur S 8 [1], which exists in three crystalline modifi¬ 
cations, each composed of eight-membered rings 
possessing the exo-exo conformation [Fig. 1(a)] 
[3—5]. The monoclinic a-selenium and y-selenium 
have analogous crystal structures [6, 7]. In con¬ 
trast, salts containing the S 8 + and Se 8 + dications 
form crystals in which the chalcogen homocycles 
assume the less symmetrical endo-exo conforma¬ 
tion [Fig. 1(b)] [8-11]. These salts are prepared by 
the oxidation of S 8 with a variety of reagents such 



c 


FIGURE 1. Three conformations of the eight-membered 
homocycles of sulfur and selenium: (a) exo-exo, (b) 
endo-exo } (c) endo-endo. 


as AsF 5 or SbF 5 [12]: 

S 8 + 3XF S 

-» (S 8 + )(XF 6 ") 2 + XF 3 , X = As or Sb. (1) 

The salts of the Se 8 + dication can be obtained in a 
similar way [13]. 

The short intraring S 3 -S 7 distance [Fig 1(b)] 
constitutes a prominent structural feature of the 
experimentally determined solid-state geometries 
of the S 8 + dication. The proximity of these two 
atoms, which are found to be spaced by only ca. 
2.9 A [8, 9], has given rise to the speculation of 
transannular interactions operating across the sul¬ 
fur octagons. Early theoretical studies have seemed 
to confirm the presence of such interactions. A 
short (R = 2.22 A) transannular bond has been 
found in HF/STO-3G calculations on S 8 + [14]. 
Although the properties of this bond have been 
analyzed, the discrepancy between the experimen¬ 
tal and theoretical bond lengths has been neither 
explained nor even discussed. An even shorter 
bond (R = 2.060 A) has been predicted by the 
MNDO method [15]. INDO calculations, carried 
out at the experimental geometries of the dications 
in question, have produced localized molecular 
orbitals spanning transannular bonds in S 8 + and 
Se 8 + [16]. For the latter species, no ab initio elec¬ 
tronic structure studies have ever been reported. 

The aforedescribed results clearly demonstrate 
the tendency of the Hartree-Fock (HF) method to 
grossly exaggerate the binding between the 
transannular atoms in the octachalcogen dications. 
It is therefore obvious that approximate treatments 
of electron correlation have to be invoked to pro¬ 
vide a balanced description of bonding in these 
systems. Results of such a study are reported in 
this work. In particular, relative stabilities and 
geometries of the three conformers of the S 8 + and 
Se 8 + dications are predicted. The failures of the 
earlier calculations and the difficulties encoun¬ 
tered in the present study are traced down to the 
unusual electronic structures of these species. The 
implications of the new predictions to the future 
experimental and theoretical work are discussed. 


Details of Calculations 

All the calculations described in this study were 
carried out with the GAUSSIAN G92/DFT system 
of programs [17] running on an SP2 cluster located 
at FSU/SCRI. For the sulfur homocycles, the basis 
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set of McLean and Chandler [18] augmented with 
polarization functions (as implemented under the 
name 6-311G* in GAUSSIAN 92/DFT) was used. 
Optimized geometries and vibrational frequencies 
were computed at the HF/6-311G*, MP2/6-311G* 
(frozen core), and BLYP/6-311G* levels of theory. 
Less extensive calculations were performed for the 
three conformers of Se 8 and Se 8 + , geometries of 
which were optimized at the HF/TZVP and 
MP2/TZVP (frozen core) levels of theory [19]. The 
properties of atoms in the systems under study 
were determined with the algorithm described 
elsewhere [20]. The covalent bond orders [21] and 
the AOM-localized natural orbitals [22] were com¬ 
puted from the second-order Moller-Plesset (MP2) 
one-electron density matrices. 


Results 

Analysis of the data compiled in Table I leads to 
the conclusion that the stabilities of the individual 
conformers of S 8 and Se 8 are determined mostly 
by steric effects. This conclusion is borne out by 
two observations. First, the ordering of the relative 
energies follows that of the increasing crowding 
between the nonbonded chalcogen atoms, the 
exo-exo conformers being found the most stable 
and the endo-endo ones the least. Second, the com¬ 
puted energies barely change upon the inclusion of 
electron correlation effects, implying the predomi¬ 
nance of interactions that are not specific to the 
electronic properties of any particular conformer. 
In contrast, the relative stabilities of the S 8 + and 
Se 8 + dications do not appear to be governed by 
steric crowding and are strongly affected by elec¬ 
tron correlation, the endo-exo octacycles having 
the lowest energies at all levels of theory but the 
HF energy differences being too small to account 
for the experimental failure to observe the other 
two conformers. 

The presence of unusual bonding in the dicat¬ 
ions under study is also hinted at by the relative 
magnitudes of the second adiabatic ionization po¬ 
tentials of the parent species (Table II). The com¬ 
puted MP2 and BLYP ionization potentials are 
significantly lower than their FiF counterparts, de¬ 
spite the fact that the electron correlation effects 
tend to increase in importance with the number of 
electrons, in general stabilizing the neutral systems 
more than the doubly charged ones. As expected, 
Se 8 is predicted to undergo oxidation somewhat 
more readily than S 8 . 


TRANSANNULAR INTERACTIONS IN S* + AND Se* + 


TABLE I_ 

Relative energies (kcal / mol) of the eight-membered 
homocycles of sulfur and selenium. 3 



Conformation 


Method 


Species 

(symmetry) 

HF 

MP2 

BLYP 

$8 

exo-exo (D 4d ) 

0.0 

0.0 

0.0 


endo-exo (C s ) 

8.8 

8.4 

9.0 


endo-endo (D 2d ) 

20.5 

19.1 

19.1 

Si + 

exo-exo (C 2v ) 

6.4 

11.8 

9.1 


endo-exo (C s ) 

0.0 

0.0 

0.0 


endo-endo ( C 2v ) 

1.3 

9.3 

5.8 

Se 8 

exo-exo (D 4d ) 

0.0 

0.0 

n/a 


endo-exo (C s ) 

5.3 

5.4 

n/a 


endo-endo (D 2d ) 

12.8 

11.9 

n/a 

Se| + 

exo-exo ( C 2v ) 

2.9 

9.7 

n/a 


endo-exo (C $ ) 

0.0 

0.0 

n/a 


endo-endo ( C 2v ) 

1.5 

3.5 

n/a 


a Zero-point energies not included. 


The optimized geometries of the exo-exo con¬ 
formers of the neutral species are in good agree¬ 
ment with the experimental solid-state data, 
though the bond lengths are consistently overesti¬ 
mated (Table III). This overestimation is especially 
pronounced at the BLYP/6-311G* level of theory 
(which also appears to slightly overestimate the 
bond angles) and is worse for Se 8 than for S 8 . With 
better data lacking at present, these slight discrep¬ 
ancies can be attributed with equal probability to 
the basis-set incompleteness, crystal packing ef¬ 
fects, and/or experimental errors. 

The removal of two electrons from the exo-exo 
conformers of S 8 and Se 8 results in symmetry 
lowering from D 4d to C 2v . The distances between 
the formally nonbonded atoms 3 and 7 [Fig. 1(a)] 
become shorter, whereas those involving the 1-5 
pairs increase (Table III). The ensuing unequiva- 

TABLE II_ 

Second adiabatic potentials (eV) of the S 8 and 
Se 8 molecules. 


Method 


Species 

Conformation 

HF 

MP2 

BLYP 

S 8 

exo-exo 

22.35 

21.05 

21.44 

endo -exo 

21.69 

20.18 

20.65 


endo-endo 

21.24 

20.12 

20.47 

CO 

CD 

CO 

exo -exo 

20.80 

19.39 

n/a 


endo - exo 

20.44 

18.73 

n/a 


endo -endo 

20.19 

18.60 

n/a 
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TABLE III_ 

Geometries of the exo-exo conformers of the eight-membered homocycles of sulfur and selenium. 


Interatomic distances 3 (A) Bond angles 3 (°) 


Species 

Method 

1 -2 

2-3 

3-7 

1 -5 

1 -2-3 

2-3 -4 

4-5 -6 

S 8 

HF 

2.069 

2.069 

4.719 

4.719 

107.4 

107.4 

107.4 


MP2 

2.075 

2.075 

4.763 

4.763 

108.5 

108.5 

108.5 


BLYP 

2.146 

2.146 

4.967 

4.967 

109.8 

109.8 

109.8 


exp. b 

2.051 

2.051 

n/a 

n/a 

108.2 

108.2 

108.2 


exp. c 

2.050 

2.050 

n/a 

n/a 

107.9 

107.9 

107.9 


exp. d 

2.045 

2.045 

n/a 

n/a 

107.7 

107.7 

107.7 

sr 

HF 

2.064 

2.146 

2.225 

5.685 

94.8 

103.1 

96.9 


MP2 

2.093 

2.081 

3.328 

5.423 

99.8 

105.6 

92.7 


BLYP 

2.154 

2.160 

3.134 

5.670 

98.1 

106.7 

95.3 

Se 8 

HF 

2.391 

2.391 

5.406 

5.406 

106.1 

106.1 

106.1 


MP2 

2.413 

2.413 

5.498 

5.498 

107.3 

107.3 

107.3 


exp. 6 

2.336 

2.336 

n/a 

n / a 

105.7 

105.7 

105.7 


exp! 

2.334 

2.334 

n/a 

n/a 

105.8 

105.8 

105.8 

Se§ + 

HF 

2.394 

2.497 

2.492 

6.634 

95.2 

101.4 

96.5 


MP2 

2.443 

2.447 

4.000 

6.292 

99.7 

103.9 

91.3 


a See Fig. 1 (a) for atom numbering. 

b Symmetry-averaged X-ray structure of orthorhombic a-sulfur at 100 K [3]. 
c Symmetry-averaged X-ray structure of monoclinic /3-sulfur at 113 K [4]. 
d Symmetry-averaged X-ray structure of monoclinic y-sulfur [5]. 
e Symmetry-averaged X-ray structure of monoclinic a-selenium [6]. 
f Symmetry-averaged X-ray structure of monoclinic y-selenium [7]. 


lence of chalcogen atoms produces three distinct 
bond angles and a pattern of short and long bonds 
that interleave in pairs. The computed geometries 
are extremely sensitive to the electron correlation 
effects, which equalize interatomic distances (in¬ 
cluding bond lengths) but appear to have some¬ 
what less influence on the bond angles. Particu¬ 
larly drastic changes are observed for the 3-7 
distances. Without the inclusion of electron corre¬ 
lation, these distances are predicted short enough 
to suggest the presence of strong transannular 
bonds possessing lengths comparable to those of 
the ordinary S-S and Se-Se links. On the other 
hand, both the MP2 and BLYP data seem to rule 
out such a possibility. 

The shortening of distances between the non- 
bonded atoms (compare the 3-7 and 2-5 columns 
in Tables III and IV) observed upon going from the 
exo-exo to the endo~exo conformations of S 8 and 
Se 8 produces an increase in steric repulsions, which 
in turn results in a slight lengthening of bonds on 
average. Due to the lack of experimental data, 
neither this lengthening nor the concomitant bond 
alternation can be verified at present. On the other 
hand, experimentally determined geometries are 
available for the corresponding dications [8-11]. 


Examination of these geometries reveals the very 
poor performance of the Hartree-Fock method in 
predicting the strengths of the transannular inter¬ 
actions between the atoms in positions 3 and 7 in 
the endo-exo conformers of S 8 + and Se 8 + . Unfortu¬ 
nately, the approximate inclusion of electron corre¬ 
lation effects at the MP2 and BLYP levels of theory 
does not appear to bring the predicted bond lengths 
and angles into a closer agreement with their ex¬ 
perimental counterparts. Although the computed 
MP2/6-311G* and BLYP/6-311G* S 3 -S 7 dis¬ 
tances in the endo-exo conformer of S 8 + turn out 
to be very similar, they nevertheless deviate from 
the corresponding experimental value by roughly 
the same amount as the HF/6-311G* prediction 
does, the former deviation being positive and the 
latter negative. 

The optimized geometries of the experimentally 
unknown endo-endo conformers [Fig. 1(c)] are 
listed in Table V. As mentioned above, the non- 
bonded pairs of atoms are positioned much closer 
in those conformers than in the exo-exo ones. In 
fact, the close proximity of the flag-pole atoms 
turns the D 2d endo-endo conformer of S 8 into a 
transition state at the HF/6-311G* and MP2/6- 
311G* levels of theory, although the correspond- 
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TABLE IV 


Geometries of the endo -exo conformers of the elght-membered homocycles of sulfur and selenium. 


Species 
and method 


Interatomic distances 6 

'(A) 



Bond angles 3 

o 


1 -2 

2-3 

rf 

1 

CO 

4-5 

3-7 

2-5 

1 -2-3 

< 3 - 

i 

CO 

1 

CM 

3-4-5 

4-5 -6 

00 

1 

1 

ro 

S 8 

HF 

2.073 

2.075 

2.103 

2.056 

3.991 

3.609 

103.7 

105.8 

107.1 

111.4 

110.7 

MP2 

2.081 

2.072 

2.125 

2.056 

4.147 

3.583 

104.2 

105.5 

106.7 

113.2 

112.1 


BLYP 

2.166 

2.105 

2.240 

2.115 

4.623 

3.775 

106.5 

107.8 

108.1 

116.7 

113.0 

q2 + 
°8 

HF 

2.060 

2.140 

2.106 

2.069 

2.235 

3.816 

95.1 

107.5 

98.3 

95.8 

96.3 

MP2 

2.086 

2.059 

2.097 

2.073 

3.660 

3.510 

103.1 

106.3 

106.2 

97.9 

95.9 


BLYP 

2.146 

2.136 

2.146 

2.141 

3.669 

3.709 

102.6 

107.7 

106.8 

98.5 

96.7 


exp. b 

2.055 

2.033 

2.022 

2.043 

2.860 

n/a 

99.3 

104.2 

100.7 

90.6 

95.1 


exp. c 

2.037 

2.005 

2.046 

2.038 

2.906 

n /a 

98.3 

104.0 

101.6 

94.0 

93.1 

Se 8 

HF 

2.394 

2.397 

2.418 

2.378 

4.515 

4.062 

102.5 

104.2 

105.6 

109.8 

109.5 

MP2 

2.416 

2.415 

2.455 

2.396 

4.767 

4.019 

103.1 

103.3 

104.8 

112.0 

111.2 

Se§ + 

HF 

2.393 

2.486 

2.475 

2.395 

2.490 

4.384 

95.5 

105.1 

97.4 

96.1 

96.0 

MP2 

2.433 

2.428 

2.477 

2.425 

4.763 

3.938 

104.4 

103.6 

105.8 

102.3 

96.0 


exp. d 

2.32 

2.31 

2.35 

2.30 

2.84 

n/a 

95.2 

101.3 

98.2 

93.3 

89.8 


exp. e 

2.300 

2.295 

2.319 

2.312 

2.905 

n/a 

94.6 

102.4 

99.1 

92.7 

90.0 


a See Fig. 1(b) for atom numbering. 
b Symmetry-averaged X-ray structure of (S| + ) (AsF 6 ~) 2 [8]. 
c Symmetry-averaged X-ray structure of (S^) (SbgF^) (SbF 6 “) [9]. 
d Symmetry-averaged X-ray structure of (Se^*) (AICI^) 2 [10]. 
e Symmetry-averaged X-ray structure of (Teg + ) (Se| + ) (AsF 6 ~) 6 (S0 2 ) [11]. 


ing imaginary frequencies of 16/ and 33/ cm -1 , 
respectively, are too small to rule out the possibil¬ 
ity of this system being actually a true minimum 
on the corresponding potential energy hypersur¬ 
face. An analogous phenomenon is not observed 
for the endo-endo conformer of Se 8 . At the MP2/6- 
311G* level of theory, the C 2v endo-endo con- 
former of Sg + is also a transition state with a small 
imaginary frequency of 41/ cm -1 . Again, large 


differences between the HF and MP2 geometry 
predictions are found for S 8 + and Se 8 + . 

In summary, all the three theoretical methods 
employed in this study are successful in predicting 
both the energetics and geometries of the neutral 
octachalcogen species. For the corresponding dicat¬ 
ions, the inclusion of electron correlation within 
the MP2 and BLYP approximations suffices to pro¬ 
duce correct relative stabilities but fails to yield 


TABLE V_ 

Geometries of the endo -endo conformers of the eight-membered homocycles of sulfur and selenium. 


Species 

Method 


Interatomic distances 3 (A) 


Bond angles 3 (° 

) 

1 -2 

CO 

i 

CM 

3-7 

1 -5 

1 -2-3 

"3- 

i 

CO 

i 

CM 

4-5-6 

^8 

HF 

2.083 

2.083 

3.464 

3.464 

107.7 

112.8 

112.8 


MP2 

2.091 

2.091 

3.459 

3.459 

106.7 

113.3 

113.3 


BLYP 

2.162 

2.162 

3.686 

3.686 

107.2 

115.6 

115.6 

q2 + 

°8 

HF 

2.065 

2.118 

2.201 

3.665 

100.3 

117.0 

96.1 


MP2 

2.086 

2.065 

3.475 

3.466 

110.1 

120.7 

99.6 


BLYP 

2.143 

2.138 

3.223 

3.797 

108.8 

123.0 

97.7 

Se 8 

HF 

2.402 

2.402 

3.869 

3.869 

106.4 

114.4 

114.4 


MP2 

2.428 

2.428 

3.866 

3.866 

104.9 

112.0 

112.0 

Se| + 

HF 

2.394 

2.477 

2.470 

4.142 

99.2 

114.3 

96.4 


MP2 

2.411 

2.437 

6.418 

4.804 

116.5 

120.6 

115.4 

a See Fig. 

1 (c) for atom numbering. 
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accurate geometries. This failure is particularly 
conspicuous for the distances between the atoms 
supposedly engaged in transannular interactions, 
for which the discrepancies between the "best" 
theoretical predictions and the experimental val¬ 
ues are as large as 0.7 A (Table IV)! These trouble- 
somely large errors stem from a quite unusual 
coincidence of several unfavorable factors. First of 
all, the available experimental geometries [8-11] 
are of inferior quality by the contemporary stan¬ 
dards. Even more importantly, the potential en¬ 
ergy hypersurfaces of the dications in question are 
very flat in the directions corresponding to a lat¬ 
eral squashing of the rings. For example, in the 
endo-exo conformer of S 8 + , the frequency of the A 
vibration that involves predominantly the S 3 -S 7 
stretch is predicted to be equal 363 and 75 cm" 1 at 
the HF/6-311G* and MP2/6-311G* levels of the¬ 
ory, respectively. Consequently, the across-the-ring 
distances are expected to be quite sensitive to 
small perturbations. With respect to the experi¬ 
mental data, this sensitivity implies a substantial 
influence of the crystal packing and counterion 
effects on the measured solid-state geometries. At 
the same time, relatively small errors in the com¬ 
puted energies and energy gradients are bound to 
translate into large deviations in the optimized 
intraring distances. 

Rigorous analysis of the computed electronic 
wave functions elucidates the reasons behind the 
observed dramatic effects of electron correlation 
on bonding in the octachalcogen dications. The 
inspection of the AOM-localized natural orbitals 
(LNOs) in the endo-endo conformer of Se 8 reveals 
the presence of eight equivalent LNOs describing 
the Se-Se bonds (LNOs 129-136 in Table VI) in 
addition to two unequivalent lone-pair LNOs per 
each atom (LNOs 113-128). All of these LNOs are 
almost doubly occupied. The covalent bond orders 
equal 1.186 for the bonded pairs of atoms and 
0.083 for the Se 3 -Se 7 and Se 1 -Se 5 nonbonded pairs. 


At the MP2/TZVP level of theory, the removal of 
two electrons from this system produces a delocal¬ 
ized dication, as indicated by the computed atomic 
charges of 0.1842 (Sej and Se 5 ), 0.2486 (Se 2 , Se 4 , 
Se 6 , and Se 8 ), and 0.3186 (Se 3 and Se 7 ). The bond 
alternation observed in the optimized geometries 
(Table V) is reflected in the covalent bond orders 
of 1.229 (Se 1 -Se 2 , etc.) and 1.164 (Se 2 -Se 3 , etc.). At 
the same time, the covalent bond orders of 0.031 
and 0.124, obtained for the Se 1 -Se 5 and Se 3 -Se 7 
pairs of atoms, respectively, rule out the presence 
of any significant transannular interactions. 

In principle, two qualitatively distinct types of 
bonding in the dications under study are possible. 
On one hand, the removal of two electrons from 
two lone-pair LNOs located across the ring may 
yield an orbital bonding of two chalcogen atoms 
with localized positive charges (as in structure 1). 
Such bonding can easily overcome the strong elec¬ 
trostatic repulsion between the charges (the He 2 + 
dication with R He -He = 0-741 A being a good ex¬ 
ample) [23]. On the other hand, the two electrons 
may come from several lone-pair LNOs, producing 
a dication with a delocalized positive charge (as in 
structure 2). 


S + s 

c T c 

v V 

s + s 


s—s 
2 + 


s—s 

2 


S 

S 


In light of the above discussion, the exaggera¬ 
tion of transannular interactions observed within 
the Hartree-Fock approximation is attributable to 
an excessive localization of the positive charge. 
This spurious localization is readily explainable by 
the fact that the occupancies of the Hartree-Fock 
localized orbitals are restricted to either zero or 
two. In other words, the removal of two electrons 
from the lone-pair orbitals must produce either a 


TABLE VI 


MP2/TZVP valence localized orbitals in the endo-endo conformer of Se 8 . 


Orbitals 

Occup. 

Loc. Num. 

-T( a.u.) 

Description 8 

113-116 

1.982 

1.027 

-3.713 

Lone pairs of atoms 2, 4, 6, and 8 (98.7% loc.) 

117-120 

1.983 

1.030 

-3.682 

Lone pairs of atoms 1,3, 5, and 7 (98.5% loc.) 

121 -124 

1.981 

1.171 

-2.175 

Lone pairs of atoms 1,3, 5, and 7 (92.3% loc.) 

125-128 

1.981 

1.163 

-2.151 

Lone pairs of atoms 2, 4, 6, and 8 (92.6% loc.) 

129-136 

1.957 

2.072 

-2.112 

Covalent Se-Se bonds (98.2% loc.) 

a See Fig. 1(c) for atom numbering. 
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new bond or multicentric bonding orbitals, the 
latter possibility (which would be compatible with 
the structure 2) being highly unlikely because of 
the angular misalignment of the lone pairs. On the 
other hand, fractional occupancies of LNOs are 
possible once electron correlation is turned on, 
favoring the delocalized picture of bonding. Such 
fractional occupancies are indeed observed in the 
MP2/TZVP LNOs of the endo-endo conformer of 
Seg + (Table VII). 

Almost certainly, the actual bonding in the Sg + 
and Seg + dications is closer to that represented by 
structure 2 than to that portrayed by structure 1. 
Consequently, because of the relatively low occu¬ 
pancies of the LNOs, the actual ground-state wave 
functions of these species are expected to possess 
considerable multiconfigurational character, neces¬ 
sitating the use of high-level treatments such as 
CAS SCF or CCSD(T) for quantitative predictions 
of electronic properties and geometries. Because of 
the computer hardware limitations, such calcula¬ 
tions are not feasible at present. One should also 
note in passing that the electron correlation effects 
described in this work are quite unique, differing 
in their nature from the dispersive effects encoun¬ 
tered in the description of the attractive closed-shell 
interactions between chalcogen atoms [24]. 


Conclusions 

Transannular interactions in the octachalcogen 
dications pose a severe challenge to the modern 
techniques of computational quantum chemistry. 
The combination of flat potential energy hypersur¬ 
faces and unexpectedly large electron correlation 
effects renders the approximate approaches such 


as MP2 and BLYP inadequate for the prediction of 
the equilibrium geometries of these species. On the 
other hand, the relatively large size and low sym¬ 
metry of the dications in question preclude the use 
of more sophisticated methods such as CAS SCF or 
CCSD(T) at present. 

It is clear from the results reported in this study 
that the definitive answer to the question whether 
the transannular interactions in S§ + and Seg + are 
indeed reality or just an artifact will ultimately 
require a coalescence of experimental and theoreti¬ 
cal research. On the experimental front, precise 
gas-phase geometry determinations will be neces¬ 
sary in order to eliminate the ring distortions 
caused by the crystal packing effects and counteri¬ 
ons. At the same time, large-scale ab initio elec¬ 
tronic structure calculations involving compar¬ 
isons among several high-level electron correlation 
treatments will have to be carried out. 

The present work uncovers the reasons behind 
the failures of the previously published calcula¬ 
tions to correctly predict the transannular chalco- 
gen-chalcogen distances in the Sg + and Seg + dica¬ 
tions. The descriptions of bonding in these systems 
are completely different at the HF and correlated 
levels of theory, the latter appearing to be closer to 
reality. If present at all, the transannular interac¬ 
tion are weak and, stemming from a delicate bal¬ 
ance between charge delocalization and localized 
bonding, certainly distinct from conventional 
bonds. 
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TABLE VII_ 

MP2/TZVP valence localized orbitals in the endo-endo conformer of Se% + . 


Orbitals 

Occup. 

Loc. Num. 

- T (a.u.) 

Description 3 

113-114 

1.979 

1.036 

-3.714 

Lone pairs of atoms 3 and 7 (98.3% loc.) 

115-118 

1.979 

1.038 

-3.681 

Lone pairs of atoms 2, 4, 6, and 8 (98.2% loc.) 

119-120 

1.975 

1.034 

-3.671 

Lone pairs of atoms 1 and 5 (98.4% loc.) 

121 -122 

1.988 

1.278 

-2.181 

Lone pairs of atoms 1 and 5 (88.3% loc.) 

123-126 

1.929 

2.081 

-2.158 

2-3 (etc.) bonds (10.0% ion., 97.5% loc.) 

127-130 

1.926 

2.094 

-2.142 

1 -2 (etc.) bonds (6.1% ion., 97.5% loc.) 

131 -134 

1.818 

1.292 

-2.178 

Lone pairs of atoms 2, 4, 6, and 8 (87.8% loc.) 

135-136 

1.389 

1.313 

-2.243 

Lone pairs of atoms 3 and 7 (87.0% loc.) 


a See Fig. 1(c) for atom numbering. 
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ABSTRACT: Angular-resolved kinetic energy distributions of fragments resulting 
from dissociation induced by intense, short, linearly polarized laser pulses are calculated 
using an accurate three-dimensional Fourier transform method in spherical coordinates. 
The rotational excitation of the molecule leads, in general, to an alignment of the 
photofragments with respect to the field polarization vector. But, unexpectedly, increasing 
the field strength may also produce fewer aligned fragments at the higher kinetic 
energies of the multiphoton above the threshold dissociation spectrum. 
photodissociating by interaction with an Nd:YAG laser at A = 532 nm and for intensities 
of 10-50 TW/cm 2 is taken as an illustrative example, for which some angular-resolved 
experimental spectra are available. A comprehensive interpretation is provided within 
the field-dressed Floquet picture by referring to two strong-field mechanisms, namely, 
the potential barrier lowering (also responsible for bond softening) and the nonadiabatic 
transitions (also responsible for the vibrational trapping). © 1997 John Wiley & Sons, Inc. 
Int J Quant Chem 65: 617-624, 1997 


Introduction 

P rogress in laser technology producing very 
intense and short pulses has motivated a 
completely new formulation for the description of 
multiphoton dissociation dynamics of molecules 
[1]. Very complicated molecular behavior is ob¬ 
served as a response to the electromagnetic field. 

Contract grant sponsor: Institut du Developpement et des 
Ressources en Informatique Scientifique I.D.R.I.S. 

Contract grant number: 940425. 

International Journal of Quantum Chemistry, Vol. 65, 617-624 (1997) 
© 1997 John Wiley & Sons, Inc. 


Repeated stimulated photon absorption and emis¬ 
sion mechanisms lead to above-threshold dissocia¬ 
tion (ATD), evidenced by the observation, in the 
fragment kinetic energy distributions, of peaks 
spaced by one quantum of the photon energy [2, 
3]. In addition to scalar observables such as total 
or partial dissociation rates or the energy spectra 
of fragments, vectorial observables such as frag¬ 
ment angular distributions provide complemen¬ 
tary views for a detailed understanding of the 
strong laser-induced dissociation dynamics. Two 
possibilities have so far been invoked for the inter¬ 
pretation of aligned fragments with respect to the 

CCC 0020-7608 / 97 / 050617-08 
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field polarization vector, namely, an enhancement 
of the dissociation cross section for initially aligned 
molecules [2, 4, 5] or a dynamical alignment occur¬ 
ring during the fragmentation process as a conse¬ 
quence of the torque that the laser field exerts on 
the molecule [6-8], The extensive use of one-di¬ 
mensional models, where the fragmentation of 
diatomic molecules is merely described by a single 
degree of freedom, relies upon the first assump¬ 
tion [2, 9, 10]. The possibility of a dynamical align¬ 
ment motivates the development of models where 
rotational degrees of freedom are fully included 
[11-14]. 

The most popular conviction so far experimen¬ 
tally supported by different observations is that, 
whatever the mechanism is, molecules subjected to 
an intense, linearly polarized laser radiation disso¬ 
ciate into fragments aligned with the polarization 
axis. There is, however, experimental evidence, on 
H 2 multiphoton dissociation, that protons are 
emitted along the polarization axis in narrow dis¬ 
tributions, becoming broader as intensity increases 
[6]. The bond-softening (BS) mechanism resulting 
from laser-induced lowering of a potential energy 
barrier supports the interpretative scheme of the 
alignment process. More recently, we, conversely, 
pointed out that the vibrational trapping (VT) 
mechanism may be responsible for fragmentations 
occurring in a plane nearly orthogonal to the polar¬ 
ization axis [14]. This surprising prediction has 
been confirmed by quantum calculations on Hj 
with an excitation wavelength of A = 80 nm, pro¬ 
ducing a curve-crossing situation in a field-dressed 
molecule picture occurring at the left of the equi¬ 
librium position. The detailed understanding of 
such a mechanism involves very efficient trapping 
in all spatial directions except a direction quasi¬ 
perpendicular to the applied field where fragmen¬ 
tation may take place. 

The aim of the present work was to understand 
another somewhat unexpected observation in Hj 
of less aligned photofragments for stronger fields 
at a wavelength of A = 532 nm [6], by calculating 
the angular-resolved kinetic energy spectra and 
interpreting them in terms of the interplay of 
field-induced chemical bond softening and harden¬ 
ing (trapping) mechanisms. The article is orga¬ 
nized as follows: The next section provides a brief 
theoretical survey of a time-dependent quantum 
model where an initial (vibrational-rotational) 
state of Hj is propagated on its two lowest elec¬ 
tronic states, namely, 2 2* and 2 2/. The linearly 
polarized laser imposes a privileged direction in 


the laboratory frame with respect to which the 
molecular axis is positioned. For short internuclear 
distances defining an inner region, a Fourier trans¬ 
form spectral method is used in spherical coordi¬ 
nates by combining a contact transformation with 
Dateo and Metiu's [13] grid approach for angular 
degrees of freedom. For the large distances of the 
outer region, an analytic approach, which is a 
three-dimensional generalization of Volkov-type 
solutions of [15], is adopted. The third section is 
devoted to the interpretation of the angular-re¬ 
solved fragment kinetic energy spectra of FI J irra¬ 
diated by an intense pulse (I = 10-50 TW/cm 2 ) of 
A = 532 nm delivered by a Nd:YAG laser source. 
Special attention is directed toward the under¬ 
standing of the alignment properties of the experi¬ 
mental spectra of [6], in terms of multiphoton 
processes involving adiabatic potential energy bar¬ 
riers (BS situation) and nonadiabatic transitions 
(VT situations) affecting molecular dynamics dur¬ 
ing dissociation. 


Theory 


Our purpose was to solve the nuclear time-de¬ 
pendent Schrodinger equation on a Born-Op- 
penheimer level of approximation involving two 
electronic states labeled g (ground, 2 2*) and u 
(excited, 2 2/) following, for convenience, the stan¬ 
dard symmetry notations of H 2 : 



/ 7 ' 


H 


/ 7 \ 

% 

\k, 


(i) 


where the total molecule-plus-field Hamiltonian is 
H = l{f R + f 0 + f^) + W(R, 0; f), (2) 


1 being the identity matrix, and R , 6 , and cp, the 
spherical coordinates of the internuclear vector R 
in the laboratory frame [14]. Kinetic terms are 
given by 



h 2 d 
~ ImR 2 ~dR 
h 2 1 d 

ImR 2 sin 6 dO 


R 2 


d 

dR 





ft 2 1 d 2 

^ ImR 2 sin 2 6 dcp 2 ' 


(3) 

(4) 

( 5 ) 
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where m represents the reduced mass. The radia¬ 
tive coupling written in the length gauge involves 
the scalar product of the dipole moment Jl (paral¬ 
lel to R for a £ -> 2 transition, as is the case of 
Hj) and the electric-field vector s which is taken 
along the laboratory Z-axis: 


the cosine transform, which, in principle, assumes 
even parity, may lead to some numerical unstabili¬ 
ties and inaccuracies in angular-resolved spectra 
(due to the nonzero derivatives of the M N = ± 1 
wave function at 6 = 0, 77 ) [17]. The contact trans¬ 
formation, defined by 


/2 • s = fJi(R) cos 6s(t) cos cot. ( 6 ) 

The laser pulse carrier-wave frequency is co and its 
envelope s(t) has a Gaussian shape: 

e(t) = e 0 e- {t -^ )2/T2 (7) 

of half-width r. The potential part contains the 
field-free molecular potentials V as diagonal ele¬ 
ments and the matter-field couplings as nondiago¬ 
nal terms: 

~ / V g (R) ~fi(R)cos 0s(t)cos cot \ 

\ —/jl(R)c os Os(t )cos cot V u (R) ) 

( 8 ) 

As clear from Eq. ( 8 ), the motion associated with 
the azimuthal angle <p remains separated under 
the action of the (^-independent potential term and 
M n , the Z-projection of the total rotational angu¬ 
lar momentum N f is a good quantum number 
describing the invariance through rotation about 
the field-polarization vector. The potential-energy 
curves as well as the electronic transition moment 
are taken from Bunkin and Tugov's fit [16]. The 
initial state of Hj ( 2 £^) with an isotropic ensem¬ 
ble of para- and ortho- hydrogen in equilibrium 
involves two components with N = 1, M N = 0 
(statistical weight of 1 /3), and N = 1, M N = +1 
(statistical weight of 2/3), each of them having to 
be propagated separately and further combined 
using their respective weights [14]. 

Fourier transform methodology is one of the 
most popular approaches to solve Eq. (1), the an¬ 
gular variables being treated either by using basis- 
set expansions or grid techniques. This last ap¬ 
proach has recently been significantly improved 
by Dateo and Metiu [13] with the implementation 
of a unitary Cayley scheme for T e combined with a 
Feit-Fleck split operator technique. The action of 
potential and kinetic operators on the wave func¬ 
tion is evaluated through a combination of coordi¬ 
nate and momentum representations using an ex¬ 
ponential Fourier transform for R and a cosine 
transform for 0 . Although very accurate results are 
obtained for M N = 0, it turns out that for M N = +1 


'I'mS r > <Pf 0 = 


RVsin 0 




(9) 


is a way out of this difficulty; the numerical unsta¬ 
bilities related to the cosine transform performed 
on if/ MN can easily be removed by applying the 
procedure to $ Mjv . The propagation technique, 
previously described [13, 14], is based on the re¬ 
peated use of short-time propagators: 

e ' 1 

= expl- j[T r + T e + W MN (t)]dt\ 

X<J> Mn (R,0;O, (10) 

where kinetic operators are merely 

Tr ~ ~ 2m ~dB} 

^ fi 1 1 d 1 

Te ~ ~ 2m JO Je 1 


(ID 

( 12 ) 


and the effective potential is 


W MN (R,6-,t) = W(R,0;t ) 


h 2 

2 mR 2 


+ 


ft 1 

ImR 2 


(M 2 - 1/4) 
sin 2 6 


. (13) 


Since the three operators T R , T e , and do not 
commute, the exponential in [Eq. (10)] is split into 
five terms, using Moyal's formula [18] at the 
third-order approximation 0(8t 3 ). 

The calculation of kinetic energy spectra in¬ 
volves, in principle, the propagation of the full 
wavepacket on very large grids, during the total 
pulse duration, such that the postpulse analysis 
leads to free-fragment properties. The overall pro¬ 
cedure is very time-consuming, especially for a 
molecule as Hj (odd charged homonuclear ion) 
due to its asymptotically divergent coupling with 
the field. To overcome this difficulty, the wave 
function is split into two regions: an internal and 
an asymptotic one. The latter is analyzed analyti- 
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cally by a three-dimensional generalization of the 
Volkov-type solutions of [15], while the above- 
mentioned numerical propagation is carried out on 
the former. Angular-resolved fragments probabil¬ 
ity distributions are then given by 


lP(k, 6)dk 


= lim 

t 


oo L 


(A:, 0 ;t )I + ipuik, 6 ; t 


dk, 

(14) 


where (f -> oo) means that not only the laser pulse 
is switched off, but also that there is no outgoing 
continuum components of the inner part wave- 
packet. i p g A u represent the Fourier transform of the 
asymptotic region wavepacket components on the 
g and u channels, respectively. 


Results 



FIGURE 1. Field-dressed potential-energy curves of H 2 
(A = 532 nm), for an intensity of / - 10 13 W/cm 2 , in the 
diabatic (solid lines) and adiabatic (broken lines) frames. 
Curve-crossing regions are outlined by rectangular boxes 
XI, X2, and X3. The energy of the v = 4 vibrational level 
is indicated by the thin horizontal line. 


The angular-resolved fragments spectra are first 
analyzed with respect to their kinetic energy distri¬ 
butions. In the second step, the basic mechanisms 
leading to the alignment dynamics are presented 
and the part that they are playing is illustrated in 
some examples. Two strong-field intensities are 
considered, namely, 10 13 and 5 X 10 13 W/cm 2 , at 
a wavelength of A = 532 nm, with pulse rise and 
fall off times of r = 30 fs and the maximum at 
t 0 = 75 fs, in order to follow as closely as possible 
the experimental situation [6]. The field-dressed 
potential energy curves involved in the two main 
Floquet blocks are displayed in their diabatic and 
adiabatic representations in Figure 1. The adiabatic 
representation showing avoided curve-crossings 
results from the diagonalization of the radiative 
interaction for the intensity 10 13 W/cm 2 and a 
fixed molecule-field orientation (9 = 0 or 7 r, which 
maximizes the coupling). The curve-crossing re¬ 
gions upon which the interpretation rests are indi¬ 
cated by rectangular boxes: XI between the ground 
and the one-photon channels, X2 between the 
ground and the two-photon channels, and X3 be¬ 
tween the two- and three-photon channels. Three 
initial vibrational states r of Hj are taken into 
account, their energies corresponding to some spe¬ 
cific situations with respect to the height of the 
adiabatic barrier. At the XI avoided crossing v — 2 
lies below and v = 5 above this barrier for both 
field strengths; as for v = 4, it lies above the bar¬ 
rier for the highest intensity and below for the 
lowest. 


KINETIC ENERGY DISTRIBUTIONS 

Three-dimensional viewgraphs of the spectra 
are given for I — 10 13 W/cm 2 in Figure 2. A 
behavior which is typical for ATD can be detected 
in terms of peaks, in the proton kinetic energy 
distributions, separated by the photon energy [2, 
3]. The branching ratios (relative heights of the 
so-called one-, two- or three-photon peaks) reflect 
the energy positioning of the initial states. For 
v = 2, the potential barrier is too high at XI for the 
dissociation to proceed by the absorption of a 
single photon. A third-order matter-field interac¬ 
tion at X2 gives rise to dissociation by absorbing 
two or three photons. For the intensity under con¬ 
sideration, an important nonadiabatic transition at 
X3 results in a competition between the two- and 
three-photon peaks. The situation is completely 
different for v = 5; the field-induced barrier low¬ 
ering at XI favors the single-photon process, with 
the one-photon peak in the spectra. The case of 
v = 4 is intermediate: There is still some competi¬ 
tion between the barrier lowering (XI) and the 
nonadiabatic transition (X2) mechanisms, leading 
to a spectrum displaying three peaks with a major 
one-photon peak contribution. 

ANGULAR DISTRIBUTIONS 

Alignment processes are very sensitive to the 
dynamics which takes place at the crossing regions 
XI, X2, and X3. They are basically controlled 
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FIGURE 2. Angular-resolved kinetic energy spectra for the initial vibrational level v = 2 [panel (a)], v = 4 [panel (b)] 5 
and v - 5 [panel (c)] of HJ irradiated by a A = 532 nm laser delivering an intensity of / = 10 13 W/cm 2 . Panel (d) 
gathers angular distributions in polar representation, for each v, of fragments contributing to the first photon peak (thick 
solid line) to the second photon peak (thin solid line) or the third photon peak (broken line). 


by two mechanisms: the barrier lowering and 
the nonadiabatic transition. Let us examine them 
separately: 

Barrier Lowering 

In strong-field situations, the adiabatic potential 
barrier is lowered in an efficient way proportional 
to the radiative coupling, i.e., y//x(jR)cos 0. For a 
given intensity, this lowering is much more impor¬ 
tant for cos 6 ~ 1, i.e., for 0 ~ 0 or it. For cos 0 ~ 0 
(0 ~ tt/2), the high potential barrier is hardly pen¬ 
etrable. The torque, which is exerted by the laser 
on the molecule, allows the wavepacket to skirt 
around the potential barrier and find the mini¬ 
mum-energy pathway toward the dissociative val¬ 
ley (i.e., for directions close to 0 = 0 or 7r). In later 


steps, the large distance dynamics is no longer 
affected by an additional torque (due to the R~ 2 
factor in T 0 [Eq. (12)]), such that the angular distri¬ 
bution is close to its value at the position XI of the 
barrier, where alignment with the field (0 = 0 
or 77) is favored. 

Nonadiabatic Transition 

In avoided crossing regions X2 and X3, the 
dynamics proceeds via a nonadiabatic jump from 
the upper attractive to the lower repulsive adia¬ 
batic surfaces. This is monitored by a nonadiabatic 
kinetic interstate coupling located at the avoided 
crossing point and presenting an amplitude maxi¬ 
mum for 0 ~ 77 / 2 . We recall that strong radiative 
interactions in the diabatic frame, either for 0 = 0, 
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7 r or for high fields, correspond, accordingly, 
to small kinetic, nonadiabatic couplings respon¬ 
sible for the dissociation in the adiabatic frame. 
Finally, less aligned fragments are expected when 
nonadiabatic transitions are responsible for the 
dissociation. 

In realistic situations, it is in the interplay of the 
two mechanisms that the interpretation is to be 
looked for. A direct consequence of the previous 
analysis remains, however, that the one-photon 
peak, resulting from the barrier-lowering mecha¬ 
nism (XI), leads to aligned fragments, whereas the 
two-photon peak resulting from a single nonadia¬ 
batic transition (X2) is characterized by less aligned 
fragments. The protons with a velocity distribution 
within the three-photon peak are even less aligned, 
as two nonadiabatic transitions (X2 and X3) are 
involved in their dynamics. This is clearly illus¬ 
trated in Figure 2 for all three initial states v = 2, 
4, or 5. Another direct consequence is the intensity 
dependence of the alignment. An increase in inten¬ 
sity produces more efficient barrier lowering, re¬ 
sulting in more aligned fragments when the lead¬ 
ing mechanism is precisely the tunneling at XI. 
This can be seen in the angular distributions of the 
one-photon peaks of v = 4 and 5 displayed in 
Figure 3. Alignments are efficiently increased when 
going from 10 13 to 5 X 10 13 W/cm 2 • v = 5, with 
an energy well above the adiabatic potential barri¬ 
ers for this range of intensities, is less sensitive to 


the tunneling mechanism. The increase in its align¬ 
ment is much less than v = 4. Conversely, when 
nonadiabatic transitions monitor the dynamics, 
distributions are less aligned at high fields. This is 
the case for v = 4 and 5 in Figure 3 and corrobo¬ 
rates the experimental observation of [6]. 

More detailed analysis is required for a thor¬ 
ough interpretation of some observations on the 
alignment patterns in terms of the role played by 
the initial vibrational state v or the intensity-de¬ 
pendent relative contribution of the initial angular 
distribution (i.e., M N — 0 or ± 1). Two arguments 
can be invoked to support this analysis: 

(i) Different initial vibrational states give rise 
to different kinetic energies during the dis¬ 
sociation process. A wavepacket issued from 
a high v presents a high kinetic energy 
distribution and feels the alignment effects 
of the X2 and X3 avoided crossing regions 
in a reduced way. It is also less sensitive to 
the long-range dynamics of the two-photon 
channel surface which tends to favor the 
6 ~ 7t/ 2 direction for the fragmentation. 
This is why the fragments of a given kinetic 
energy (i.e., within a given peak) are more 
aligned for higher initial vibrational states. 
Figure 2 shows, precisely, that fragments 
contributing to the two-photon peak are 
more efficiently aligned for v = 5 than for 


13 2 

10 W/cm 


5.10° W/cm 



FIGURE 3. Angular distributions, in polar representation, of photons contributing to the first (thick solid line), second 
(thin solid line), or third photon peaks (broken line) and resulting from the photodissociation of HJ (v = 4 and 5) for two 
field strengths / = 10 13 and 5 x 10 13 W/cm 2 (A = 532 nm). Note that the relative contributions of the three peaks to the 
angular distributions are different, the scales being indicated by the numbers (in arbitrary units) on the coordinate axis. 
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v = 4. The same observation is valid for the 
three-photon peak whose alignment in¬ 
creases with v, 

(ii) To obtain results corresponding to an initial 
isotropic ensemble of molecules, an incoher¬ 
ent sum of two probabilities based on two 
initial angular distributions Y 1O (0, <p) (i.e., 
M n = 0) and Y n (0, <p) (i.e., M N = ±1) is to 
be performed [Y N/M (0, <p) designates the 
spherical harmonics and Y x _ 1 leads to the 
same result as Y u because the total Hamil¬ 
tonian does not depend upon the sign of 
M n , as is clear from Eq. (13)]. Having in 
mind that Y 10 behaves as cos 0 and Y n as 
sin 0, the potential barrier, with a maxi¬ 
mum at 0 = 7t/ 2, is hardly penetrable for 
Y u . It is only for very strong fields that the 
barrier is lowered enough for the M N = ± 1 
component to be in competition with the 
M n = 0 component. The consequence is that 
for high intensities (or for vibrational levels 
above the barrier) fragments resulting from 
an initial Y n distribution may significantly 
contribute to the spectrum, producing less 
aligned distribution patterns. This argu¬ 
ment may also contribute to support the 
fact observed in Figure 3 that at high inten¬ 
sity, I = 5 X 10 13 W/cm 2 , the one-photon 
peak is less aligned for v = 5 than for v = 4 
(the potential barrier is more transparent for 
the Y n component of the higher-energy state 
v = 5 as compared to v = 4). 

We finally note that the polar plots of some 
angular distributions show, at the high intensity of 
Figure 3, off-axis structures for both v = 4 and 5. 
Such structures have already been experimentally 
evidenced in similar contexts, i.e., in the angular 
distribution of the photoelectron ATI spectrum of 
xenon [19] and the angular distributions of 0 + and 
S + from dissociation of C0 2 or CS 2 [20]. In the 
latter case, an interpretation based on molecular 
pendular states arising from the large field-in¬ 
duced polarizabilities of these systems is given. In 
the former case, the mechanism which is invoked 
is rather close to the one monitoring the dissocia¬ 
tion dynamics of the ATD regime. More precisely, 
the rings of the ionization dynamics are inter¬ 
preted in terms of rescattering from the ion core of 
the tunneling component of the continuum wave 
function [19]. Here, it is the interferences (en¬ 


hanced by the high-field intensity) between the 
short-distance nonadiabatic transition (responsible 
of the multiphoton excitation at X2) and the large 
distance tunneling (responsible of the dynamics at 
XI) which are at the origin of the mixing of Y 10 
and Y u distributions and of bond softening and 
vibrational trapping mechanisms manifesting 
themselves as structures in the strongly anisotropic 
angular distributions of dissociation products. 

As a word of conclusion, the widely discussed 
alignment of the fragments resulting from a multi¬ 
photon dissociation process is basically related 
to an adiabatic potential energy barrier lowering 
(which is also responsible for the bond softening). 
When using intense fields, there are, however, 
other mechanisms which also lead to dissociation, 
namely, the nonadiabatic transitions (as the ones 
involved in vibrational trapping). These transitions 
favor the 6 = tt/2 direction in the dissociation 
dynamics. The most original finding is the possi¬ 
bility, at least for some wavelengths [14] or for 
some kinetic energies, to observe photofragments 
in a plane orthogonal to the polarization axis, a 
possibility enhanced by the high intensity. This is 
the first step aimed at the detailed interpretation 
of the intensity dependence of angular-resolved 
spectra and, in particular, to the understanding of 
some experimental observations of less aligned 
fragments with increasing field strength [6]. Work 
is in progress in our group for a more detailed 
study of the experimental data of [6]. 
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Introduction 

I n recent years, interest in the surface chemistry 
of metal oxides has rapidly grown. The en¬ 
hanced attention is related to different types of 
catalytic reactions on oxide surfaces and the prepa¬ 
ration of thin films of these materials with impor¬ 
tant technological applications [1-11]. Metal ox¬ 
ides, such as ZnO, Sn0 2 , and Ti0 2 , are materials 
with varistor properties that are used as semicon¬ 
ductor devices which present high nonlinearity 
between the current density and electric field, thus 
being suitable for transient voltage suppression 
[1-3]. Most of the chemistry of metal oxides is due 
to the presence of highly reactive defective sites 
such as cation or anion vacancies and/or a doping 
process by substitution of metal cations. As most 
of the surfaces studied experimentally contain de¬ 
fects resulting from preparation conditions or from 
ionic or electronic bombardment, one is led, after 
the theoretical study of ideal surfaces, to consider 
the case of surfaces containing defects. Once the 
vacancies are formed, their electronic nature can 
be changed by adding or removing electrons. 
Therefore, the electronic behavior is the key con¬ 
cept for understanding the operation. 

In particular, different research groups [2-5] 
have studied doped Ti0 2 , ceramics having useful 
varistor properties, with nonlinearity coefficients 
(a) values in the range of a = 3-12, a being 
defined by the relation I = KV a , where I is the 
current; V, the voltage; and K, a proportionality 
constant [8]. The presence of this potential barrier 
is due to the creation of defects formed during 
sintering of Ti0 2 systems. The lack of lattice peri¬ 
odicity due to intrinsic defects leads to a high 
density of structural defects that can originate a 
potential barrier associated to a double space 
charge distribution and this phenomenon estab¬ 
lishes variable resistance as a function of the ap¬ 
plied electric field to the solid. 

Understanding the chemical and physical prop¬ 
erties of local active sites of metal oxides has a 
twofold interest: First, from a fundamental point 
of view, it is interesting to see which new features 
are induced by the presence of defects in the 
surface and how these surface defects are related 
to the bulk electronic and atomic structure. Sec¬ 
ond, the practical interest results from the numer¬ 


ous technological applications where they are used. 
Among these, varistor properties are probably the 
most important for us. 

This is a collaborative project between experi¬ 
mental and theoretical research groups. Our in¬ 
creasing experimental efforts [7,8,12-14] can be 
accompanied by quantum chemical studies of local 
defects; among them, the cluster approach [5-17] 
is probably the most widely adopted and many 
successful applications of this model have been 
made on Ti0 2 [18,19] and related systems [20-23]. 
The aim of this work was to use ab initio tech¬ 
niques to investigate the electronic structures of 
doped Ti0 2 systems in order to understand, at the 
molecular level, the varistor properties of Ti0 2 
and to rationalize the role played by the oxygen 
vacancies and the substitution of Nb and Cr atoms, 
as dopants, replacing Ti atoms in the lattice. 

The article is organized as follows: The model 
and method of calculation are summarized in the 
next section. The results and discussion are pre¬ 
sented in the third section. A short section of 
conclusions closes this article. 


Models and Method of Calculation 

Ti0 2 crystallizes in the rutile structure, which 
has tetragonal symmetry. There are two for¬ 
mula units per primitive unit cell, with the three- 
fold-coordinated oxygen atoms forming a dis¬ 
torted octahedron around the titanium atoms [24]. 
This material has a relatively strong ionic character 
and its bulk valence band has a dominant O-p 
orbital character and the orbital composition of the 
lower conduction bands result from atomic Ti-3d 
states. Experimental analysis [25-28] shows that 
the (110) surface is the most stable face of Ti0 2 
and oxygen vacancies are by far the most fre¬ 
quently occurring defect. Therefore, we restrict 
ourselves here to the study of this structure the 
different types of defects and illustrate some basic 
aspects of the varistor phenomena. This face is a 
nonpolar surface that can be studied even in its 
bulk-derived experimental geometry. The rutile 
structure can be regarded as consisting of (110) 
planes of atoms containing both metal, Ti, and 
oxygen, O, atoms, separated by planes containing 
oxygen alone, so that the sequence of planes is O 
—Ti 2 0 2 —O—O—Ti 2 0 2 —O, etc. The entire crys¬ 
tal can then be built up of the symmetrical three- 
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plane O—Ti 2 0 2 —O units. We selected the large 
(Ti0 2 ) 15 cluster model, shown in Figure 1, derived 
from the crystallographic data. 

Models for the stoichiometric (110) surface of 
the titanium oxide surface in the oxidized and 
reduced forms, as well as doped with niobium and 
chromium atoms, were constructed in this work 
by using the usual repeating slab geometry. The 
following models were selected: (Ti0 2 ) 15 (I) to 
represent the fully oxidized surface; (Ti0 2 ) 14 Ti0 
(II) and (Ti0 2 ) 13 (Ti0) 2 (III) were made by remov¬ 
ing one oxygen in position 16 and two oxygens in 
positions 16 and 17 from model I, respectively. 
(Ti0 2 ) 13 (Cr0) 2 (IV) to represent the surface doped 
with the Cr atom in positions 14 and 14' and 
(Ti0 2 ) 13 (Nb0) 2 to represent the surface doped with 
the Nb atom in sites 3 and 3' (V) and sites 1 and V 

(VI) ; (Ti0 2 ) n (Nb0) 2 (Cr0) 2 corresponds to the sur¬ 
face simultaneously doped with the Nb atom in 
the 1 and V sites and Cr in the 14 and 14' sites 

(VII) . In model VIII, the Nb atom is placed in the 
3 and 3' sites, while the Cr atom is located in the 
14 and 14' sites, and in model IX, Nb is placed in 
the 14 and 14' sites, while the Cr atom is located in 
the 1 and V sites. 

The calculations were carried out with the 
Gaussian94 package [29]. The ab initio Hartree- 


Fock (HF) level of the theory was employed with 
the contracted basis set for titanium (4322/42/3), 
oxygen (33/3), niobium (43333/433/43), 
and chromium (4322/42/3) atoms developed by 
Huzinaga et al. [30,31]. The orbital exponent and 
contraction coefficients of the atomic wave func¬ 
tions were determined by optimizing the total en¬ 
ergies as explained in detail in a previous article of 
Sakai et al. [32]. All calculations were carried out 
in the ground singlet electronic state. HOMO- 
LUMO corresponds to the band gap value. 

In addition, we try to determine how the 
boundary conditions can affect the different values 
of the band gap and net atomic charges of the 
different cluster models. The computational ap¬ 
proach for the theoretical treatment is based on 
continuum models [33]: The cluster is embedded 
in a cavity while the environment, treated as a 
continuous medium having the same dielectric 
constant as the bulk system, is incorporated into 
the solute Hamiltonian as a perturbation. In this 
reaction-field approach, the bulk system is polar¬ 
ized by the environment. The electronic distribu¬ 
tion of the cluster system polarizes the continuum 
which generates an electric field inside the cavity, 
which, in turn, affects the solute's geometry and 
electronic structure. The implementation of this 



FIGURE 1. (Ti0 2 ) 15 cluster model for the etoichiometric (110) surface. Black and gray circles represent oxygen and 
metal ions, respectively. 
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interaction scheme is achieved through the self- 
consistent reaction field method developed ini¬ 
tially by Tapia [34,35]. In this work, we employed 
the continuum model of Rivail et al. [36-38] based 
on the use of cavities and a multipolar expansion 
of the solute electrostatic potential. Calculations 
were made using ellipsoidal cavities surrounded 
by a continuum of the dielectric constant (s = 86.0, 
bulk Ti0 2 rutile) and multipolar expansion up to 
order 6. An analysis and comparison between the 
results of the different cluster models obtained in 
vacuo and by taking into account a continuum 
representation of the environment show that the 
values of the band gap and net atomic charges are 
similar. For this reason, only the results given in 
vacuo are presented in this work. 


Results and Discussion 

Charges on atoms are calculated using the Mul- 
liken population analysis. In addition, to improve 
the population representation, we selected the nat¬ 
ural population analysis based on Weinhold's nat¬ 
ural bond orbital (NBO) [39-41]. The Mulliken 
charges are always lower than the NBO, but simi¬ 
lar trends are obtained with respect to the varia¬ 
tion between pure (I) and doped (IV, V, VI, VII, 
VIII, and IX) clusters and clusters with vacancies 
(II, III). It seems difficult to discuss the absolute 
values of the charge on the individual Ti and O 
atoms; however, some useful information could be 
drawn from the results by comparing the charge 
on the atoms in the corresponding model. 

The values of the net atomic charge and band 
gap for the nine models are presented in Table I. 


The results point out that the surface is reduced 
when oxygen vacancies are formed in the II and 
III models, with a corresponding decrease in the 
charge density of titanium, i.e., the Ti 14 and Ti 15 
atoms decrease their positive charge whereas 
changes in the charge for more distant atoms are 
very small. This result can be represented by the 
following equation: 

TiO, 

Ti0 2 Ti'yj + V 0 " + O* + |0 2 , (1) 

in which the species Ti 4i are formed. 

Therefore, the creation of intrinsic defects in the 
structure of titanium oxide promotes the reduction 
of the titanium atoms. The ab initio results repro¬ 
duce, in a reasonable way, this redox reaction 
which occurs in n-type semiconductors such as 
Ti0 2 . As experimentally verified in our laboratory 
[7,8,12-14], there is a decreasing of the band gap 
value in the formation of n-type semiconductors, 
which is due to the V 6 - and V 0 -formed species. The 
results show that this effect is proportional to the 
increase of the concentration of the vacancies. The 
two-step reduction of the titanium atom can be 
rationalized on the basis of the simulation of one 
and two oxygen vacancies' formation. The per- 
centual variation of the charge density of oxidized 
titanium to reduced titanium, Ti' 14 and Ti 15 , is 
around 40 and 6%, respectively, going from model 
I to model II. When the second vacancy is formed, 
Ti 14 experiences a percentual reduction of 45% in 
the charge density. These results suggest the for¬ 
mation of the species Ti 4i . Marucco et al. [43] 
proposed that the defects induced by the reduction 
process in titanium dioxide are oxygen vacancies 
and interstitial titanium, their relative importance 


TABLE I_ 

Charge on titanium, oxygen, chromium, and niobium atoms and band gap (eV). 


Atoms 

i 

li 

III 

IV 

V 

VI 

VII 

VIII 

IX 

Ti(i5) 

1.70 

1.60 

1.46 

1.49 

1.37 

1.31 

1.07 

1.42 

1.51 

X(14) 

1.56 

1.60 

0.87 

0.97 

1.19 

1.22 

1.15 

1.20 

1.07 

X(14') 

1.56 

0.94 

0.87 

0.97 

1.19 

1.22 

1.15 

1.20 

1.07 

x„ 

1.71 

1.62 

1.55 

1.70 

1.63 

1.60 

1.61 

1.62 

1.32 

X,.’, 

1.71 

1.62 

1.55 

1.70 

1.63 

1.60 

1.61 

1.62 

1.32 

X (3) 

1.73 

1.64 

1.71 

1.77 

1.68 

1.73 

1.67 

1.57 

1.75 

V) 

Band 

1.73 

1.74 

1.71 

1.77 

1.68 

1.73 

1.67 

1.57 

1.75 

gap 

3.53 

2.28 

1.98 

3.64 

2.57 

3.64 

3.20 

2.62 

3.12 


X is Ti, Nb, or Cr in the x and x' positions on the crystal cluster of Ti0 2 . 
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being governed mainly by temperature. Oxygen 
vacancies dominate below 1000°C. To interpret the 
electrical behavior, one can assume that an inter¬ 
mediate ion between Ti 4+ and Ti 2+ , the Ti 34 ion, 
can be formed during vacancy formation, accord¬ 
ing to the equation 

2Ti- y, + O* -» 2Ti' Ti + V Q + ±0 2 . (2) 

This assumption is also in agreement with the 
experimental work of Watanabe et al. [9] which 
studied the reduction of the Sn0 2 surface associ¬ 
ated with the change of the valence state from 
Sn +4 to Sn +2 , which is similar to that observed in 
the Ti0 2 . These authors pointed out a decreasing 
of the band gap value associated with an incre¬ 
ment of oxygen vacancies. On the basis of their 
results, they proposed the coexistence of an inter¬ 
mediate Sn 3 0 4 structure in the reduced crystal 
(Sn0 2 _ Y ), which characterizes the Sn 3+ ion. In the 
present work, the substitution of the Ti atom by a 
metal, either Nb or Cr, should involve a change in 
the stoichiometric plane [O]—[Ti0 2 ]—[O], with 
formal charges [2 — ]—[4 + ]—[2 — ], by the 
planes [O]—[Cr 2 0 2 ]—[O] or [O]—[Nb 2 0 2 ]—[O], 
with formal charges [2 — ]—[2 + ]—[2 — 3 and 
[2 — ]—[6 + ]—[2 — ], respectively. The results of 
Table I support this suggestion, showing the coex¬ 
istence of two species with different charge densi¬ 
ties in the site after reduction: the Ti +3 and the 
Ti +2 species. 

After substitution of two Cr atoms for two Ti 
atoms, the IV model, the Ti 4 atoms increase their 
positive charge, around 10%, while the 0 12 atoms 
increase their negative charge. After substitution 
of two Nb atoms for two Ti atoms, the V model, 
Ti 5 , Ti n , and Ti 15 atoms decrease their positive 
charge around 6%, while Ti 1 and Ti u atoms in¬ 
crease their positive charge around 5 and 37%, 
respectively, and 0 4 atoms decrease their negative 
charge while 0 12 atoms increase their negative 
charge. The double substitution of niobium and 
chromium atoms in Ti sites, the VI model, yields a 
decreasing of the positive charge for Ti 5 , Ti n , and 
Ti 15 atoms, 4, 6, and 10%, respectively, while Ti 14 
atoms increase their positive charge about 40% 
and 0 4 and O 10 atoms increase their negative 
charge. For the VII model, Ti 3 , Ti n , and Ti 15 
atoms decrease their positive charge around 2, 4, 
and 27%, respectively, and 0 9 , O 10 , and 0 12 atoms 
increase their negative charge while O z atoms de¬ 
crease their negative charge. For the VIII model. 


Ti a atoms increase their positive charge about 5%, 
while 0 12 atoms increase their negative charge. 
For the IX model, Ti n atoms decrease their posi¬ 
tive charge about 9%, while Ti 14 atoms increase 
their positive charge about 23%. 

Table I presents the calculated values of the 
band gap, which shows the influence of vacancy 
formation and the presence of dopants on the 
surface of Ti0 2 . There is a partial oxidation when 
chromium substitutes titanium in the lattice and 
the species Cr^ and V a are formed to preserve the 
electroneutrality, thus decreasing the conductivity 
of Ti0 2 . When the reduced Ti0 2 is doped with 
chromium or niobium, the band gap value in¬ 
creases and Ti0 2 becomes an n-type semiconduc¬ 
tor when a vacancy is formed. Our results for the 
doped Ti0 2 agree with the formation of an n-type 
semiconductor; however, this effect is highly de¬ 
pendent on the location of the dopant in the lattice. 
The band gap presents the lowest value when the 
doping process takes place in the Ti 3 or Ti 3 , (V 
and VIII models). Similar values of the band gap 
are found when two Cr atoms or two Nb atoms 
are located in position 14, 14', or 1,1', respectively 
(IV and VI models). There is a partial oxidation 
when Cr or Nb substitutes titanium in the lattice 
and the species X' Ti (X = Cr, Nb) and Vq are formed 
to preserve the electroneutrality, thus decreasing 
the conductivity of Ti0 2 . This result agrees with 
the experimental results of Gupta and Carlson [4] 
and Leite et al. [7] which characterize the degrada¬ 
tion of the varistor. For example, the addition of 
Nb 2 O s in small amounts to the Ti0 2 system leads 
to an increase of the electronic conductivity in the 
Ti0 2 lattice [13,14]. It is promoted by the forma¬ 
tion of defects in the surface and in the interior of 
the grain, tending to electroneutralize mutually. 
The varistor behavior of Ti0 2 can be explained by 
the introduction of defects in the crystal lattice that 
are responsible for the formation of Schottky-type 
potential barriers at the grain boundaries. By anal¬ 
ogy with the atomic defect model proposed by 
Gupta and Carlson [4] for the ZnO varistor, the 
potential barrier is formed by intrinsic defects of 
Ti0 2 (oxygen vacancy and interstitial ionized tita¬ 
nium atoms) and by extrinsic defects created by 
solid substitution of dopants and negative charges 
at the interface. To explain the adsorption of oxy¬ 
gen on the Ti0 2 surface, Bueno et al. [8] proposed 
reactions 1 and 2 to be responsible for the forma¬ 
tion of a potential barrier and, consequently, for 
the nonlinearity of Ti0 2 -based varistors. 
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Conclusions 

We carried out an ab initio study on the Ti0 2 
system in order to rationalize our previous experi¬ 
mental results. The varistor properties of this ma¬ 
terial were analyzed taking into account the for¬ 
mation of oxygen vacancies and the presence of 
Nb and Cr as dopants in the lattice. Large cluster 
models were selected and a contracted basis set 
developed by Huzinaga et al. was used to repre¬ 
sent the atomic centers. The specific details of the 
process may change at higher levels of theory (e.g v 
inclusion of correlation energy), but despite the 
approximate procedure of the calculations (e.g., 
band gap and Mulliken charges values obtained at 
HF level and embedding representation of cluster 
models) employed here, some important features 
were clarified. Taken together with the results of 
our previous experimental studies, the following 
conclusions can be drawn from the results re¬ 
ported in this study: (i) The ab initio calculations 
show that the formation of oxygen vacancies in 
Ti0 2 leads to a decrease in the band gap value, (ii) 
Two different valence states of titanium may coex¬ 
ist, probably Ti +2 and Ti + 3 . (iii) The band gap 
values are strongly affected by the doping process, 
(iv) The global analysis of the results agrees with 
the experimental data and the physical conception 
of the performance of the Ti0 2 varistor, (v) Volt¬ 
age barrier formation and degradation, associated 
with the band gap, can be explained by a redox 
reaction between atomic defects. This fact can be 
of relevance for the understanding of varistor 
properties of Ti0 2 and related materials. 
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ABSTRACT: The geometry dependence of the exchange integrals and the total 
energies of sulfur-bridged complexes of the types L 5 NiSNiL 5 , L 4 NiS 2 NiL 4 , and 
L 3 NiS 3 NiL 3 is studied by quantum chemical ab initio methods. The linear monobridged 
complex is antiferromagnetic, the bi- and triply-bridged complexes are ferromagnetic for 
NiSNi angles between 85° and 100° and antiferromagnetic for smaller and larger angles. 
The superexchange mechanism is analyzed, and a comparison with oxygen-bridged 
complexes and experimental data is performed. © 1997 John Wiley & Sons, Inc. Int J Quant 
Chem 65: 633-641, 1997 


Introduction 

T hough sulfur-bridged binuclear Ni(II) units 
are the active sites in some enzymes, e.g., 
urease [1-3], not much is known about the mag¬ 
netic coupling of the spins at the two Ni 2+ cations 
in S-bridged binuclear Ni(II) complexes of the gen¬ 
eral form LNiS, t NiL, n = 1,2,3, with one, two, or 
three sulfur bridges (L stands for any terminal 
ligands). In urease, it seems that the Ni(II) ions 
are all high spin and exhibit a weak antiferromag¬ 
netic coupling (/ = — 6 cm -1 ) [1-3]. Trinuclear 
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complexes of the form LNiS 3 NiS 3 CoL and 
LNiS 3 NiS 3 NiL with octahedrally coordinated 
Ni(II) ions have been recently synthesized by Beis- 
sel et al. [4]. The analysis of magnetic susceptibil¬ 
ity measurements yielded values of about —30 
cm” 1 for the exchange coupling between two adja¬ 
cent Ni(II) ions bridged by three S atoms (which, 
however, are not isolated S atoms or S 2 ” anions, 
but belong to the ligand system). Such values 
indicate that the antiferromagnetic coupling is 
stronger than in similar oxygen-bridged com¬ 
plexes, but it is not certain whether differences in 
the geometries, partial d-occupations of the sulfur 
atoms in the bridging unit or other reasons are the 
source for the stronger antiferromagnetism in S- 
bridged systems. One trinuclear sulfur-bridged 
nickel thiolate cluster with an almost planar Ni 3 S 3 
ring has been reported recently in the literature as 

CCC 0020-7608 / 97 / 050633-09 



FINK, WANG, AND STAEMMLER 


well [5]. This complex, however, contains Ni 2+ 
cations in square-planar coordination and is dia¬ 
magnetic. To the best of our knowledge, no further 
bi- or trinuclear S-bridged Ni(II) complexes are 
known for which both X-ray structures and mag¬ 
netic susceptibility data have been determined. 

In a previous work [6] we have presented ab 
initio calculations for the exchange coupling in 
oxygen-bridged NiONi and Ni0 2 Ni systems, to¬ 
gether with an analysis of the geometry depen¬ 
dence of the exchange integral /. The present work 
extends this study to S-bridged Ni(II) complexes of 
the form LNiS n NiL, n = 1,2,3. Since the Ni-S 
distances vary considerably in different com¬ 
pounds, we analyzed the geometry dependence of 
the total energies and of the exchange integrals in 
the mono- and bibridged complexes and compared 
the results to those of oxygen-bridged complexes. 

The study is organized as follows: The second 
section gives a brief description of the method of 
calculation. The third section contains the results 
for the linear monobridged complexes, the fourth 
and fifth sections those for the bi-bridged and 
triply-bridged complexes, respectively. The last 
section gives a short summary. 


Method of Calculation 

The present calculations have been performed 
in essentially the same way as in our previous 
study [6] on oxygen-bridged Ni(II) complexes. 
They consist of mainly three steps: 

1. A self-consistent field (SCF) or, more pre¬ 
cisely, a restricted open-shell Hartree-Fock 
(ROHF) calculation for the high-multiplicity 
state: For all complexes in this study this is a 
quintet state since the two Ni(II) ions, each of 
them with a d s configuration and a spatially 
nondegenerate 1 2 3 A 2? (S = 1) ground state in 
octahedral coordination, can be coupled to a 
quintet (S — 2), a triplet (S = 1), and a sin¬ 
glet (S = 0) state. 

2. A full valence configuration interaction (VCI) 
calculation [7] using the ROFIF orbitals of the 
first step, or alternatively a complete active 
space SCF (CASSCF) calculation [8] for all 
three states (quintet, triplet, singlet) simulta¬ 

neously or for each of the states separately in 
order to reoptimize the orbitals for the triplet 


and singlet states. There is virtually no differ¬ 
ence between these three treatments since the 
three electronic states are very close in en¬ 
ergy. The active space for the VCI and 
CASSCF calculations consisted of the four 
c^-type 3d atomic orbitals (AOs) at the two 
Ni atoms (for more details see Ref. [6]. 

3. Dynamic correlation effects are included on 
top of the VCI or CASSCF calculation by the 
multiconfiguration coupled electron pair ap¬ 
proach (MC-CEPA) method [9]. We have al¬ 
ways correlated the ten 3d AOs at the metal 
atoms and the 3s,3p valence orbitals at the 
bridging S atoms. In general, the averaged 
coupled pair functional (ACPF) variant of 
our MC-CEPA program [9] has been used, 
since this variant yields the most reliable 
values for the exchange splittings [6]. 

It has been found before [6, 10-12] that the 
inclusion of dynamic correlation effects is neces¬ 
sary if one wants to calculate reliable values for 
exchange integrals. The main reason is that the 
antiferromagnetic contribution, J AF , to the total 
exchange integral / is determined by the extent to 
which " ionic" (charge transfer) configurations are 
mixed to the dominant "neutral" (covalent) ones 
in the VCI. As long as the ionic configurations are 
built up from orbitals which have been optimized 
for covalent states and no relaxation effects are 
accounted for, their energies are too high and 
consequently their contribution to / AF is too small. 
In a separate study [10] we present "effective" Cl 
calculations in which the VCI matrix is modified in 
such a way as to simulate these relaxation effects 
in an effective and cheap manner. In the present 
work we have simply scaled the antiferromagnetic 
contribution with a weighting factor, which has 
determined by a comparison between VCI and 
ACPF results. More details are given in the section 
on the bridged L 4 NiS 2 NiL 4 complex. 

In order to save computer time, we have re¬ 
placed the bulky terminal ligands, which are nec¬ 
essary in the experimental work to stabilize the 
rather unstable bi- and trinuclear complexes, by 
the He-like model ligands introduced in the earlier 
studies of this series [6, 11, 12]. In the present 
calculations the same parameters have been used 
as optimized before for the NiONi and Ni0 2 Ni 
complexes [6], i.e., an effective charge of 1.80 for 
the He-like model ligands L and a Ni-L distance 
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of 1.725 A. In all complexes the number of termi¬ 
nal ligands was chosen in such a way that each 
metal ion had a full though distorted octahedral 
coordination. 

The two Gaussian basis sets that have been 
used in the present calculations are summarized in 
Table I. The basis set for the model ligand is 
identical to the one optimized before for NiONi 
[6]. For Ni and S, the basis I has double zeta (DZ) 
quality, basis II has triple zeta plus polarization 
(TZP) quality. 

Since the difference in the antiferromagnetic 
coupling through O and S bridges may be at¬ 
tributed to a partial d-occupation in the S 2- bridg¬ 
ing ions, we have systematically varied the d ex¬ 
ponent at the S-atom between 0.0 (no d functions) 
and 1.5 and monitored its influence on both the 
total energies and the value of /, i.e., the splitting 
between the lowest singlet, triplet, and quintet 
states. These tests have been performed for a linear 
NiSNi complex at a fixed Ni-S distance of 2.20 A 
(i.e., slightly shorter than the equilibrium distance, 
compare next section) and for a bent NiS 2 Ni com¬ 
plex in the vicinity of its SCF equilibrium structure 
with R(Ni-S) - 2.35 A and ZNiSNi = 90° (see 
fourth section). Basis set I was used for these tests. 
The results of these calculations can be summa¬ 
rized shortly as follows: The optimum value for 
the d exponent—as judged from the gain in the 
total energies—is about 0.2 at the CASSCF and 0.3 
at the ACPF level. The gain in total energy is 
rather small at the CASSCF level (~ 0.020 a.u. for 
the linear NiSNi complex and ~ 0.050 a.u. for 
NiS 2 Ni), but large at the ACPF level (~ 0.120 a.u. 
for NiSNi; NiS 2 Ni was not treated at this level). 
The / values, on the other hand, are hardly af¬ 
fected by the change of the d exponent: At the 


CASSCF level there is no noticeable influence of 
the d functions on /. This holds both for the NiSNi 
and the NiS 2 Ni complexes, but for the bibridged 
complex with an NiSNi angle of 90° the antiferro¬ 
magnetic part of the magnetic coupling is so small 
that statements concerning the d contribution are 
not conclusive. At the ACPF level, the contribution 
of the d AOs to / is in the order of 10% for the 
linear NiSNi unit. 

The results of these basis tests indicate that the 
d orbitals at the bridging sulfur atoms do not 
directly influence the active orbitals, but are only 
needed for describing dynamic correlation and re¬ 
laxation effects. All calculations reported in the 
next sections are performed with basis II and a d 
exponent of 0.3 (compare Table I). 


Linear L 5 MSML 5 Complex 

Table II contains our results for the total energy 
of the 1 A lg ground state and for the exchange 
integral / in the linear monobridged L 5 NiSNiL 5 
complex, as functions of the Ni-S distance. Only 
the results of the ACPF variant of our MC-CEPA 
program [9] are included. As mentioned above, all 
metal 3d and the sulfur 3s3p orbitals have been 
correlated. 

From the data of Table II one can conclude that 

o 

the equilibrium distance amounts to 2.32 A at the 
VCI (or SCF) level and is slightly shortened to 2.28 
A at the ACPF level. A direct comparison with 
experiment is not possible since no X-ray structure 
of a complex with a linear NiSNi bridge is known 
and since our calculations have been performed 
with the model ligands, which make the complex 
charge neutral. Bulky terminal ligands, effective 


TABLE I 


Gaussian basis sets for S-bridged binuclear Ni(ll) complexes. 

Atom 

Type 

Contraction 

Ref. 



Basis I 


Ligand 

TZ 

7s -> 3s(5,2*1) (30.2,9.8,3.5,1.4,0.55,0.17, 0.05) 

6 

Ni 

DZ 

12s6p4d -* 7,5*1; 4, 2*1; 2, 2*1 

13 

S 

DZ 

9s5p -> 4,5*1; 2,3*1 + 1s(0.1), 1p(0.1) 

14 



Basis II 


Ligand 

TZ 

As basis 1 


Ni 

TZP 

14s9p5d -> 10,4*1; 6,3*1; 3,2*1 + s(0.35), f (2.0) 

15 

S 

TZP 

11s7p -»• 5,6*1; 4, 3*1 + 2s(1.0,0.075), 2p(1.0, 0.075), Id(0.3) 

14 
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TABLE II_ 

VCI and ACPF results for the total energy and the exchange integral J in the linear L 5 NiSNiL 5 complex . 9 * 5 


ft (Ni-S) 

o 

A 

ECA, g ) 

a.u. 

VCI 

^0 -» 1 
cm™ 1 

VCI 

cm” 1 

VCI 

EVA 1g ) 

a.u. 

ACPF 

J Q ^ -i 

cm” 1 

ACPF 

J ~\ —* 2 

cm “ 1 

ACPF 


-3433.083 115 

-44.1 

-45.2 

-3433.657 409 

-112.7 

-116.2 

2.25 

.087 559 

-39.1 

-39.9 

.659 846 

-101.7 

-104.5 


.089 419 

-34.5 

-35.1 

.659 905 

-91.9 

-94.0 

2.35 

.089 163 

-30.5 

-31.1 

.658 081 

-83.3 

-84.9 


0.87 171 

-27.2 

-27.5 

.654 691 

-76.2 

-77.2 

a L = 
b J 

J Q -> 

He-like model ligand [6], 

, =1/2[E(S = 1) - E(S = 0)1; J^ 2 

= 1 / 4[£(S = 2) 

- £(S= 1)1. 





charges on the complexes, counter ions, etc. might 
have some influence on the exact value of the 
equilibrium distance, in particular since the poten¬ 
tial curve is rather flat and its minimum is much 
less pronounced than, e.g., in L 5 NiONiL 5 [6]. On 
the other hand, Ni-S distances of about 2.30 A 
seem reasonable for octahedrally coordinated Ni 
atoms [1, 16, 17]. 

The exchange integral / is negative for all dis¬ 
tances and decays nearly exponentially with in¬ 
creasing distance R, both at the VCI and the ACPF 
levels. As for the linear NiONi unit the J values at 
the ACPF level, i.e., after inclusion of dynamical 
correlation effects, are nearly a factor of 3 larger 
than at the VCI level. Again, we attribute this 
failure of the "active orbital" approach [18] to the 
fact that in the VCI or CASSCF treatments the 
energies of the charge transfer states are calculated 
with the unrelaxed orbitals optimized for the cova¬ 
lent ground state [10]. Finally, the splitting be¬ 
tween the singlet ground state and the first excited 
triplet and quintet states shows a regular Lande 
pattern with a ratio 2:4, both in the VCI and ACPF 
treatments. 

If we compare the absolute value of the ex¬ 
change integral in the linear NiSNi unit (Table II) 
with our previous results for the NiONi bridge [6] 
at the respective equilibrium distances, we find 
that the antiferromagnetic coupling through the S 
bridge is slightly, but not substantially, larger than 
through the O bridge: For L 5 NiONiL 5 at R(Ni-O) 
= 2.08 A we found / = —24 cm -1 (VCI) and / = 
— 80 cm -1 (best ACPF value) [6] while the corre¬ 
sponding values for the L 5 NiSNiL 5 complex at 
R(Ni-S) = 2.30 A are -35 and -94 cm" 1 . This is 
again an indication that a partial occupation of the 
3d AOs in the bridging sulfur atom is no impor¬ 


tant effect for the antiferromagnetic coupling 
through S bridges. It is more important that J 
depends quite sensitively on the Ni-S distance. 


Bi-bridged L 4 NiS 2 NiL 4 Complex 

Figure 1 contains contour plots for (a) the total 
energy of the lowest quintet state and (b) the 
exchange integral of the bi-bridged L 4 NiS 2 NiL 4 
complex as functions of the Ni-S distance and the 
NiSNi angle, calculated at the VCI level with basis 
II without the / functions at the Ni atoms. 

Figure 1(a) shows that the potential energy sur¬ 
face possesses a rather shallow minimum. At the 
VCI level the equilibrium geometry corresponds to 
a Ni-S distance of 2.41 A and a NiSNi angle of 
86.3°. The elongation of the Ni-S bond length as 
compared to the value for the NiSNi unit with a 
single linear sulfur bridge (2.32 A, previous sec¬ 
tion) and the fact that the NiSNi bond angle is 
markedly below 90° indicate that the bulky S 2 ~ 
anions try to avoid each other. The direct compari¬ 
son with experimental data is again difficult, since 
most Ni(II) complexes with S-donor ligands have a 
square planar coordination [5, 17] and, again, our 
model ligands are not directly related to real lig¬ 
ands. However, bond angles between 85° and 94° 
have been reported for bi-bridged NiS 2 Ni units 
[17]. On the other hand, the binuclear Ni(II) com¬ 
plex with two SPh bridges and octahedrally coor¬ 
dinated Ni(II) ions, as synthesized by Baidya et al. 
[16], exhibits much longer Ni-S distances (2.41 
and 2.52 A) and a larger NiSNi angle of 105.3°. 

Figure 1(b) shows that the exchange integral 
changes sign just in the region of the minimum of 
the potential surface. The traditional explanation 


636 


VOL 65, NO. 5 







SULFUR-BRIDGED BINUCLEAR Ni(ll) COMPLEXES 



NiSNi angle / 

(a) 



FIGURE 1. (a) Contour plot for the total energy of the lowest quintet state of the L 4 NiS 2 NiL 4 complex. ROHF, basis II 
without f functions at Ni. The lowest contour (dotted line) has an energy of -3826.225 a.u., the spacing of the contours 
is 0.010 a.u.= 26.24 kJ/mol. (b) Contour plot of the exchange integral in the L 4 NiS 2 NiL 4 complex. The spacing of the 
contours is 5 cm" 1 ; full contours represent negative, dashed ones positive values of J, the nodeline is dotted. 


[6, 18, 19] is that the magnetic orbitals are orthogo¬ 
nal at an NiSNi angle of about 90° or slightly 
larger. The antiferromagnetic superexchange con¬ 
tribution in this region of the potential energy 
surface is therefore smaller than the direct cou¬ 


pling, hence the coupling is ferromagnetic in the 
region of about 85° to 100° [6] and antiferromag¬ 
netic for smaller and larger angles. This explana¬ 
tion is corroborated by the present calculations. 
Figure 1(b) shows that / is very sensitively depen- 
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dent on the geometry of the bridging unit, there¬ 
fore an interpretation of experimental data for J is 
meaningless unless an X-ray structure is available. 

As is well known [18], the exchange integral / 
consists of two contributions, the direct ferromag¬ 
netic exchange coupling, J F , between the magnetic 
orbitals and the antiferromagnetic superexchange 
coupling, / AF , which is caused by an admixture of 
ionic configurations to the predominantly covalent 
wave functions of the low-lying spin states [11]: 

/ = Jf + /af • 

The ferromagnetic contribution J F can be deter¬ 
mined separately in a first step from a modified 
VCI calculation in which the active orbitals are 
localized, e.g., by a Boys localization procedure 
[20], and only the "neutral" (covalent) determi¬ 
nants, i.e., those determinants in which exactly two 
electrons occupy the localized c^-type orbitals at 
each Ni center, are included. The "ionic" (charge 
transfer) determinants which are responsible for 
the superexchange coupling are thus excluded 
from the VCI and only the direct exchange cou¬ 
pling f F between the localized magnetic orbitals is 
obtained. Of course, the value of } F depends to a 
certain extent on the localization scheme. 

In a second step, the full VCI calculation con¬ 
taining both neutral and ionic determinants is per¬ 
formed. It yields the total exchange splitting J, i.e., 
the sum of J F and /af* However, as long as ROHF 
or CASSCF orbitals are used which have been 
optimized for the predominantly covalent low-ly¬ 


ing spin states, J F is described correctly but / AF is 
largely underestimated. The reason is that the en¬ 
ergies of the ionic states calculated with nonre- 
laxed covalent orbitals are much too high. Cl cal¬ 
culations with single and double excitations into 
the full virtual space are necessary to properly 
account for the relaxation effects in the charge 
transfer states [10]. Our previous experience with 
ACPF and MC-CEPA calculations for various bi- 
nuclear systems [6, 10-12] has shown that the 
antiferromagnetic contribution /af is enhanced by 
a factor of 2-3 after inclusion of dynamic correla¬ 
tion and relaxation effects. Stated differently, the 
relaxation effects lower the excitation energies of 
the charge transfer states by as much as a factor of 
2 or even more. 

In order to avoid the very time consuming ACPF 
calculations at every point of the two-dimensional 
potential energy surface for the L 4 NiS 2 NiL 4 com¬ 
plex, we have simply scaled the VCI result for the 
antiferromagnetic part /af by a scaling factor £, 
fixing } F at its VCI value. £ was determined 
semiempirically by comparing the VCI results and 
ACPF results at some preselected points. Table III 
shows that the ACPF results can be reasonably 
well simulated by a scaled VCI exchange interac¬ 
tion 

/sc = /f + £/af 

with a scaling factor £ = 2.0. 

Figure 2 shows a full two-dimensional contour 
plot of the scaled exchange integral J sc using 


TABLE III______ 

Decomposition of the exchange integral J In the L 4 NiS 2 NiL 4 complex in a direct ( J F ) and a superexchange 
(J AF ) contribution 8 


Distance* 3 

NiSNi 

angle 

J(VCI) C 

J p 

^AF 

Jsc* ~ + 2*Jaf 

J (ACPF) 

2.0 

CO 

o 

o 

-47.52 

24.80 

-72.32 

-119.83 


2.1 

90° 

-17.34 

16.19 

-33.52 

-50.86 


2.2 

90° 

-4.17 

11.08 

-15.25 

-19.42 

-19.7 

2.3 

90° 

0.99 

7.74 

-6.75 

-5.76 


2.2 

100° 

9.44 

14.43 

-4.99 

4.44 

8.9 

2.2 

80° 

-23.04 

9.66 

-32.70 

-55.75 

-50.3 

a Basis II, 

without f functions, 

J in cm -1 . 






b Ni-S distance in A. 
c J calculated at VCI level. 

d The antiferromagnetic contribution to J was scaled by a factor of 2.0 to obtain good agreement with the ACPF results. 
e ACPF results. 
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70 75 80 85 90 95 100 

NiSNi angle / 0 

FIGURE 2. Same as Figure 1 (b), the antiferromagnetic part of J was scaled by a factor of 2.0. 


consistently £ = 2.0. The comparison between the 
unmodified VCI results in Figure 1(b), and the 
scaled VCI results in Figure 2 shows that the 
ferromagnetic part of / remains unchanged while 
the antiferromagnetic part of / is strongly en¬ 
hanced after the inclusion of dynamic correlation 
effects, either by real ACPF calculations or by the 
scaling of / AF . Therefore, the region of positive 
values and the position of the node line are very 
similar in Figures 1(b) and 2, while the increase 
toward larger antiferromagnetic values of / for 
smaller Ni-S distances and bond angles is much 
steeper at the scaled VCI level. 

The dependence of the exchange integral / on 
the Ni-S distance and on the NiSNi angle, found 
in the present study for L 4 NiS 2 NiL 4 , is very simi¬ 
lar to the behavior in the corresponding O-bridged 
L 4 Ni0 2 NiL 4 complex [6]. 


L 3 NiS 3 NiL 3 

We did not calculate full two-dimensional sur¬ 
faces of the total energy and the exchange integral 
for the triply S-bridged complex L 3 NiS 3 NiL 3 . The 
reason is that due to the use of the nearly charge 
neutral model ligands our L 3 NiS 3 NiL 3 complex 


has a total charge of —3.2 and no stable equilib¬ 
rium geometry could be found. If the Ni-S distance 
is fixed to the experimental value of 2.36 A ob¬ 
served by Beissel et al. [4] in the trinuclear 
NiS 3 NiS 3 Ni unit, the optimum NiSNi angle turns 
out to be smaller (~ 71°) than the experimental 
equilibrium value (79° [4]). The electrostatic repul¬ 
sion between the three negative S 2 ” anions is too 
large in our model and has to be reduced by using 
either singly charged SR” anions or by adding 
reasonable counterions. 

The behavior of / as a function of the Ni-S 
bond length and the NiSNi angle in the triply 
S-bridged L 3 NiS 3 NiL 3 complex is very similar to 
what was found before in the bibridged complex. 
This is nicely documented in Figure 3. From the 
similarity of the curves for the L 4 NiS 2 NiL 4 com¬ 
plex with an NiSNi angle of 80° and for 
L 3 NiS 3 NiL 3 at 79°, we conclude that the scaled 
VCI results for / as presented in Figure 2 for the 
bibridged complex can as well be used to predict 
the magneto-structural relations in triply S-bridged 
Ni(II) complexes. 

At the experimental geometry (R(Ni-S) = 2.36 
A with an NiSNi angle of 79° [4]) we obtained 
/ = —9.5 cm -1 with J F = 7.0 cm” 1 and J AF = 
-16.5 cm” 1 . If we again scale /af by a factor of 2, 
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FIGURE 3. Exchange integral J as a function of the Ni-S distance in mono-, bi-, and triply S-bridged binuclear Ni(ll) 
complexes. VCI level, basis II. 


we obtained / = J F + 2/ AF = -26 cm" 1 , which is 
in agreement with the experimental value / = 
-27.0 cm -1 , reported by Beissel et al. for the 
trinuclear NiS 3 NiS 3 Ni complex [4]. 


Sum mary and Conclusions 

In the present study we calculated the exchange 
integral / in mono-, bi-, and triply S-bridged binu¬ 
clear Ni(II) complexes, we examined the geometry 
dependence of / and the total energy of the com¬ 
plexes, and compared the results to oxygen- 
bridged complexes. Inclusion of the 3d orbitals at 
the bridging sulfur ions lowers the total energy of 
the complexes, but has nearly no influence on /. 
The superexchange mechanism is the same as in 
oxygen-bridged complexes. Even the angular be¬ 
havior of J is quite similar. The complexes are 
ferromagnetic for NiSNi angles between 85° and 
100° and antiferromagnetic for smaller and larger 
angles. 

In contrast to the oxygen-bridged Ni(II) com¬ 
plexes in which the Ni-O distances are quite simi¬ 
lar in different compounds, the experimentally 
known values for Ni-S distances in different com¬ 
plexes with octahedrally coordinated Ni vary be¬ 
tween 2.3 and 2.5 A. This is caused by very shal¬ 
low potential surfaces. Because of the strong 


geometry dependence of / and the change of its 
sign in the region of the minimum of the poten¬ 
tial surfaces, the values for / will be different in 
different complexes. 

In order to avoid the time-consuming ACPF 
calculations, we used a scaled VCI in which J AF 
was weighted with a factor of 2. With this method 
we found very good agreement between our calcu¬ 
lated value for / in a triply S-bridged complex and 
the experimental data of Beissel et al. [4]. 
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Ab Initio Study 


WILLIAM R. ROCHA, WAGNER R. DE ALMEIDA 

Laboratorio de Quimica Computational e Modelagem Molecular , Depto. de Qutmica, ICEX-UFMG, 
Belo Horizonte , MG 31270-901, Brazil 

Received 11 March 1997; revised 16 April 1997; accepted 17 April 1997 


ABSTRACT: The reaction pathway for the insertion reaction of SnCl 2 into the Pt—Cl 
bond on the ris-Pt(Cl) 2 (PH 3 )2 compound was investigated at the ab initio MO level of 
theory. The optimized structure obtained for the transition state indicates that this 
reaction proceeds through a three-center transition state, and the formed intermediate 
as-Pt(Cl)(PH 3 ) 2 (SnCl 3 ) easily isomerizes to the frans-Pt(Cl)(PH 3 ) 2 (SnCl 3 ) compound. 
The nature of the bonds was investigated with the charge decomposition analysis (CDA) 
method and this method indicates that the SnCl 3 group is a stronger trans director than 
is the PH 3 group. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 643-650, 1997 


Introduction 

T he hydroformilation of olefins catalyzed by 
platinum complexes in combination with 
group IV A metal halides is of great interest. These 
systems have been shown to be highly active and 
selective in hydroformilation of olefins and their 
catalytic activity is comparable to that industrially 
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used and well-known rhodium catalyst [1], As 
an example, catalyst precursors of the type cis- 
Pt(Cl) 2 (PR 3 ) 2 , when combined with SnCl 2 , have 
been shown to be useful catalysts for the hydro¬ 
genation and hydroformilation of primary olefins 
[lb-le]. Several questions related to this catalyst 
are still unclear or unanswered. For example, the 
fact that catalytic activity is observed only in the 
presence of SnCl 2 [2] suggest that the Pt—Sn bond 
plays a fundamental role in the catalytic cycle. 
Therefore, an understanding of the nature of the 
Pt—Sn bond is very important. 

The 31 P and 195 Pt nuclear magnetic resonance 
(NMR) characterization of the complexes which 
arise in solution when SnCl 2 reacts with cis- Pt 
(C1) 2 (PR 3 ) 2 was studied by Pregosin and Sze [3], 
The reaction scheme proposed by these authors (1) 
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for this reaction indicates that the insertion of 
SnCl 2 into the platinum/chlorine bond in cis- 
Pt(Cl) 2 (PR 3 ) 2 is followed by rapid isomerization: 



R,P Pt Cl + SnCI ■ > 

3 I 2 CH 2 C1 2 

Cl 

PR 3 PR 3 

R 3 P — Pt — SnCl 3 —>C1 — Pt — SnCl 3 (1) 

Cl PR 3 

cis trans 

SCHEME 1 

Despite the fact that 31 PH{ 1 H}- and 195 Pt-NMR 
spectroscopy of complexes containing platinum, 
tin, and phosphorus provide a sensitive probe for 
complex structures, questions related to the elec¬ 
tronic nature of the Pt—Sn and Pt—P bonds are 
not answered directly by this method. 

In this work, we investigated the reaction path 
for the insertion reaction of SnCl 2 into the Pt—Cl 
bond (Scheme 1 in gas phase and using R = H) at 
the ab initio level of theory. All the stationary 
points present on the potential energy surface (PES) 
for this reaction were located and the nature of the 
bonds between the ligands and the platinum atom 
were investigated. The trans effects of the SnCl 3 
and PH 3 ligands are discussed and this may give 
some insight into how the elementary reactions 
(olefin insertion, carbon monoxide insertion, etc.), 
which take place in the catalytic cycle for hydro- 
formilation of olefins, may proceed. 


Method of Calculation 

Full-symmetry unconstrained geometry opti¬ 
mizations were performed at the RHF level of 
theory using the ECP and valence basis sets of 
Stevens, Basch, Krauss, and Jasien (SBKJ) [4] for 
the Pt and Sn atoms, 6-31G* [5] basis sets for the 
P and Cl atoms, and a 3-21G [5] basis set for the 
hydrogen atoms. The tin atom can expand its va¬ 
lence shell and use d empty orbitals to form a back 
coordination of the type d~d with the platinum 
atom [6]. Therefore, to analyze the role of the d 
empty orbitals of the tin atom on the nature of the 
Pt—Sn bond, a set of five d polarization functions 
(a = 0.180) [7] were added to the Sn atom. The 
stationary points located on the PES were identi¬ 


fied as minima (none imaginary frequencies) or 
transition states (characterized by having at least 
one imaginary frequency) through harmonic fre¬ 
quency calculation. To obtain better energetic re¬ 
sults, we carried out second-order Moller-Plesset 
(MP2) perturbation theory single-point calcula¬ 
tions on the RHF-optimized geometries with the 
same basis set. All calculations were carried out 
using the GAMESS96 [8] and GAUSSIAN94 [9] 
packages, at the Laboratorio de Quimica Computa- 
cional e Modelagem Molecular (LQC-MM), Depar- 
tamento de Quimica, Universidade Federal de Mi¬ 
nas Gerais (UFMG). 


Results and Discussions 

The optimized structural parameters obtained 
for the reactants [SnCl 2 1 and ris-Pt(Cl) 2 (PH 3 ) 2 2], 
transition state (TS), and the products cis- 
Pt(C 1)(PH 3 ) 2 (SnC 1 3 ) 4 and trans- 

Pt(Cl)(PH 3 ) 2 (SnCl 3 ) 5] are shown in Figure 1. The 
optimized angles of 2, 4, and 5 around the plat¬ 
inum atom are a little distorted from the expected 
optimal value of 90° for a d 8 square planar com¬ 
plex. The Pt—P bond distances predicted in our 
calculations are in good agreement with the exper¬ 
imentally observed Pt—P bond distances of 2.284 
A in Pt(CH3) 2 (PCH 3 Ph 2 ) 2 [10] and 2.303 A in 
trans- Pt (H) (SnCl 3 ) (PPh 3 ) 2 [11]. The Pt—Sn bond 
length of 2.604 A in complex 4 and 2.576 A in 
complex 5 are in good agreement with the experi¬ 
mental Pt—Sn values of 2.634 A in trans- Pt(SnCl 3 ) 
(COPh) (PEt 3 ) 2 [12], 2.601 A in trans- 

Pt(H)(SnCl 3 )(PPh 3 ) 2 [11], and 2.600 A in trans- 
Pt(H)(SnCl 3 )(PCy 3 ) 2 [13]. The Sn—Cl bond dis¬ 
tances of 2.331 and 2.350 A in complexes 4 and 5, 
respectively (see Fig. 1), agrees with the experi¬ 
mental finding of 2.283-2.367 A in the complex 
trans-Pt(H)(SnCl 3 )(PPH 3 ) 2 [13]. The structural pa¬ 
rameters calculated for the transition state (see Fig. 
1) indicate that the formation of compound 4 oc¬ 
curs in a concerted way, where the Pt—Sn bond is 
forming and the Pt—Cl bond is breaking, leading 
to a three-center transition state. 

It is well known that bond distance is a very 
sensitive parameter in describing the trans influ¬ 
ence, i.e., the ability of a ligand to labilyze the 
bond trans to it. If we compare the bond length of 
compounds 4 and 5 (see Fig. 1), it can be seen that 
the SnCl 3 group has a stronger trans influence 
than has the PH 3 group, because the Pt—Cl bond 
in compound 5 (2.411 A), where the chlorine is 
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r(Sn-Cl) = 2.377 
a = 97.851 


r(Pt-Cl) = 2.354 

r(Pt-P) = 2.228 
r(P-H) « 1.406 
a = 92.413 p = 83.664 
Y = 101.206 S = 82.668 

r(Pt-C16) = 2.364 
r(Pt-P) = 2.281(trans to SnCl 3 > 
r(Pt-P) « 2.340 (cis to SnCl 3 ) 
r(Pt-C13) - 2.871 
r(Pt-Sn) = 3.000 
r(Sn-C14) « 2.421 
0(Pt-Sn-C13) = 61.099 
0(Sn-C13-Pt) - 66.186 
0{C13-Pt-Sn) = 52.715 


FIGURE 1. RFH-optimized structural parameters for the SnCI 2 (1), frans-Pt(CI) 2 (PH 3 ) 2 (2), transition state (TS), 
c/s-Pt(CI)(PH 3 ) 2 (SnCI 3 ) (4) and frans-Pt(CI)(PH 3 ) 2 (SnCI) 3 ) (5) molecules. Bond lengths in angstroms and bond angles 
in degrees. 


trans to SnCl 3 group, is greater than is the Pt—Cl 
bond in compound 4 (2.373 A) where the chlorine 
is trans to the PH 3 group. It can also be seen from 
Figure 1 that the Pt—P bond length trans to the 
SnCl 3 group in compound 4 (2.412 A) is greater 
than is o the Pt—P bond length in compound 2 
(2.228 A). This fact corroborates our assertion that 
SnCl 3 is a stronger trans director than is the PH 3 
group. 

The trans influence can qualitatively explain 
why the compound cis- Pt(Cl)(PH 3 ) 2 (SnCl 3 ) / 4, iso- 
merizes to the trans- Pt(Cl)(PH 3 ) 2 (SnCl 3 ), 5, com¬ 
pound. If the PH 3 ligand stays in a trans position 


in relation to the SnCl 3 , these two groups will 
compete unevenly for the electron density from 
the Pt atom, so the Ph 3 group chooses to stay in a 
more favorable position trans to another PH 3 
group in compound 5. These qualitative argu¬ 
ments will be discussed in a quantitative way 
later. 

The total energies obtained for compounds 1-5 
are shown in Table I, and the energy diagram for 
the reaction is shown in Figure 2. As can be seen 
from Figure 2, the energetic trend is maintained 
for the reactants (1 + 2) and products (4 and 5) 
when an MP2 single-point calculation is per- 
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r(Pt-Cl) - 2.373 



r<Pt-Cl) = 2.411 



FIGURE 1. (continued) 


formed on the RHF-optimized geometries. The for¬ 
mation of intermediate czs-Pt(Cl)(PH 3 ) 2 (SnCl 3 ), 4, 
is favored by an amount of —37.5 kcal/mol and 
the occurrence of the isomerization cis- 
Pt(Cl)(PH 3 ) 2 (SnCl 3 ) -> trans- Pt(Cl)(PH 3 ) 2 (SnCl 3 ) 
(4 -> 5) is favored by -7.0 kcal/mol. The MP2 
single-point calculation on the RHF-optimized ge¬ 


ometry for the transition state (TS) structure gave 
an unexpected result —4.4 kcal/mol for the differ¬ 
ence between the energies of the reactants and the 
transition state (see Table 1 and Fig. 2). What this 
result shows us is that the transition state is more 
stable than are the reactants, i.e., the SnCl 2 inserts 
into the Pt—Cl bond without a barrier. This result 


TABLE I_ 

Total energy values obtained for the stationary points on the potential energy surface (Fig. 1). 


Energy (au) 


Compound 

RHF a 

MP2 b 

SnCI 2 (1) 

- 922.3586 

-922.6748 

Pt(CI) 2 (PH 3 ) 2 (2) 

-1722.8362 

-1723.4704 

TS 

-2645.1839 

-2646.1522 

c/s-Pt(CI)(PH 3 ) 2 (SnCI 3 ) (4) 

-2645.2359 

-2646.2050 

frarcs-Pt(CI)(PH 3 ) 2 (SnCI 3 ) (5) 

-2645.2497 

-2646.2161 


a The basis set used is described in the Method of Calculation section. 
b Single-point calculation at the RHF-optimized geometry. 
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can be related to the fact that the formation of the 
TS occurs in a concerted way, where the Pt—Sn 
bond formation and Pt—Cl bond breaking occur 
at the same time. Therefore, the energy involved in 
the Pt—Sn bond formation is sufficiently high to 
stabilize the TS leading to an insertion without 
barrier. 


Nature of the Bond 

The nature of the bond in transition-metal com¬ 
plexes has gained much attention from the theoret¬ 
ical point of view since the appearance of the 
Dewar-Chatt-Duncanson model [14]. This model 
considers two factors responsible for stabilizing 
the bond in transition-metal complexes: the cr~ 
donating interactions from the ligands to the metal 
and the tt back donation from the metal to the 
ligands. 

Several methods have appeared in the literature 
focusing on the evaluation of the energetic contri¬ 
bution to the bond energy as well as investigating 
the mixing of orbitals in the region of interaction 
between the metal and ligand. The construction of 
molecular orbitals in terms of fragment molecular 
orbitals was earlier employed by Fukui and co¬ 
workers [15] in a method called coupled fragment 
molecular orbital interaction for interacting sys¬ 
tems. Bagus et al. [16] developed the constrained 
space orbital variation (CSOV), where the wave 
function of the complex is constructed from the 
molecular orbitals of the ligand and the metal 
atom. Ziegler and co-workers developed the ex¬ 
tended transition state (ETS) method [17], based 
on the density functional formalism, which is simi¬ 


lar to the CSOV method. More recently, Frenking 
and co-workers developed the charge decomposi¬ 
tion analysis (CDA) [18] using fragment molecular 
orbitals. The CDA method consists of using linear 
combinations of fragment orbitals (LCFO) of prop¬ 
erly chosen fragments A and B for the interpreta¬ 
tion of the interactions in a molecule AB. The 
interaction is divided into three main contribu¬ 
tions: (a) the mixing between the occupied orbitals 
of A and empty orbitals of B, which indicates the 
magnitude of electron donation from A to B (A -> 
B), (b) the mixing between the occupied orbitals of 
B and the empty orbitals of A, which gives the 
extent of back donation, i.e., the electron donation 
from B to A (A <- B), and (c) the mixing between 
the occupied orbitals of A and the occupied or¬ 
bitals of B which indicates the extent of charge 
polarization in the region of bonding. Based on 
these points, the CDA method can be used as a 
quantitative view of the Dewar-Chatt-Duncanson 
model. 

To analyze the nature of the bond in complexes 
ds-Pt(Cl)(PH 3 ) 2 (SnCl 3 ), 4, and trans- 

Pt(Cl)(PH 3 ) 2 (SnCl 3 ), 5, we used the CDA method, 
as implemented in the CDA 2.1 program [19]. The 
total amount of donation, back donation, and bond 
energy obtained by the CDA is shown in Table II. 
As can be seen, in all situations analyzed, the 
magnitude of donation from the PH 3 fragment to 
the Pt(Cl)(PH 3 )(SnCl 3 ) fragment is greater than is 
the magnitude of back donation, which indicates 
that the PH 3 is not a good 7r-acceptor. The other 
fact that contributes to the low extent of back 
donation in relation to the electron donation is that 
back donation is favored when there is an 
electron-rich metal, which is not the present case. 


TABLE II_ 

Total amount of donation, back donation, and bond energy for different chosen fragments A and B, 
obtained by the CDA method. 



Fragment 

2 donation 

2 backdonation 

Bond energy 
(kcal / mol) 

A 

B 




P K 

Pt(CI) (PH 3 )(SnCI 3 ) 

0.597 e 

0.172 e 

52.0 

PH 3 b 

Pt(CI)(PH 3 )(SnCI 3 ) 

0.563 e 

0.071 e 

26.0 

PH 3 c 

Pt(CI)(PH 3 )(SnCI 3 ) 

0.614 e 

0.125 e 

46.2 


a PH 3 cis to SnCI 3 group in c/s-Pt(CI)(PH 3 ) 2 (SnCI 3 ), 4, compound (see Fig. 1). 
b PH 3 trans to SnCI 3 group in c/s-Pt(CI)(PH 3 ) 2 (SnCI 3 ), 4, compound. 

C PH 3 trans to another PH 3 group in compound trans- Pt(CI)(PH 3 ) 2 (SnCI 3 ), 5, compound (see Fig. 1). 
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The platinum atom is in a formal oxidation state of 
+ 2, assuming a d s configuration. Therefore, de¬ 
spite the fact that PH 3 is a poor 77 -acceptor ligand, 
the extent of back donation should be increased if 
we have a more electron-richer metal atom. 

When we compare the extent of back donation 
of the two PH 3 groups in the cis- 
Pt(Cl)(PH 3 ) 2 (SnCl 3 ) complex, 4 (see Fig. 1 and 
Table II), the extent of back donation from the PH 3 
group trans to the SnCl 3 is diminished, indicating 
that the SnCl 3 group is withdrawing electron den¬ 
sity from the platinum atom, and this reduces the 
electron density available for donating to the PH 3 
ligand. Based on this argument, the extent of back 
donation of the PH 3 trans to another PH 3 group in 
compound trans- Pt(Cl)(PH 3 ) 2 (SnCl 3 ), 5 (see Fig. 
1), should be greater than the one found to the 
PH 3 group trans to SnCl 3 in complex 4 and less 
than the extent of back donation of the PH 3 group 
cis to the SnCl 3 group in compound 4 (see Fig. 1 
and Table II). The results of CD A agree with these 
arguments as can be seen in Table II. 

The Pt—P bond energy calculated for the PH 3 
group cis to the SnCl 3 group (52.0 kcal/mol) is ca. 
26 kcal/mol greater than is the Pt—P bond energy 
calculated for the PH 3 group trans to the SnCl 3 
group (26.0 kcal/mol), as can be seen in Table II. 
This result provides a semiquantitative view of the 
trans effect of the SnCl 3 group, which, of course, 
could change if a correlated calculation is used to 
analyze the bond energy. 


The occurrence of cis -> trans isomerization in 
platinum (II) and Palladium (II) phosphine com¬ 
plexes is well known [20]. Three mechanisms can 
explain this reaction: the consecutive displacement 
mechanism [21], the Berry pseudorotation mecha¬ 
nism [20,22], and the dissociative pathway [23]. In 
addition, changes of metal, ligand, solvent, or tem¬ 
perature can substantially affect the equilibrium. 
Therefore, the ds-Pt(Cl)(PH3) 2 (SnCl 3 ) 2 -» trans- 
Pt(Cl)(PH 3 ) 2 (SnCl 3 ) 2 (4 -> 5) isomerization reac¬ 
tion needs a more detailed study focusing on the 
solvent and ligand influence. A theoretical study 
on this isomerization reaction is already under¬ 
way. Taking into account the strong trans influ¬ 
ence of the SnCl 3 (Table II), as discussed before, 
can lead to a possible mechanism for the isomer¬ 
ization occurring in the gas phase. This may be a 
dissociative pathway through T-shaped three-coor¬ 
dinate intermediates, as is exemplified in Scheme 2 
below. Some kinetic evidence for the intermediacy 
of a three-coordinate T-shaped species was ob¬ 
tained for cis and fnms-PdMe 2 (PR 3 ) 2 complexes 
[23]. 


Concluding Remarks 

We studied the insertion reaction of SnCl 2 into 
the Pt—Cl bond and the resulting complex cis- 
Pt(Cl) 2 (PH 3 ) 2 using ab initio MO methods. All the 
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stationary points located on the PES were fully 
optimized. The transition-state structure obtained 
indicates that the reaction proceeds through a 
three-center transition state. The optimized struc¬ 
tures are in good agreement with experimental 
findings for similar systems. The analysis of the 
nature of the bonds showed that the SnCl 3 ligand 
is a stronger trans director than is the PH 3 group, 
and it is capable of weakening the Pt—P bond 
trans to it by ca. 26 kcal/mol. The amount of back 
donation of the PH 3 group is less than the dona¬ 
tion, indicating that the PH 3 is not a good 7r-accep- 
tor ligand. The results of the charge decomposition 
analysis gave us support to infer with the cis- 
Pt(Cl)(PH 3 ) 2 (SnCl 3 ) -> frans-Pt(Cl)(PH 3 ) 2 (SnCl 3 ) 
isomerization reaction proceeds through a disso¬ 
ciative pathway with formation of T-shaped 
three-coordinate intermediates. We think that the 
information provided about this cocatalyst (strong 
trans influence of SnCl 3 and the formation of very 
stable compounds formed when tin interacts with 
platinum) is very important to study the elemen¬ 
tary reactions for the hydroformilation of olefins 
using this heterobimetallic complex. Our group is 
currently investigating this catalytic cycle. 

ACKNOWLEDGMENTS 

The authors would like to thank the CNPq 
(Conselho Nacional de Desenvolvimento Cientifico 
e Tecnologico) for support. We are also grateful to 
Prof. Carlos A. L. Filgueiras and Dr. Wagner M. 
Teles for stimulating discussions. This work was 
partially supported by the FAPEMIG (Fundagao 
de Amparo a Pesquisa do Estado de Minas Gerais) 
and the PADCT (Programa de Apoio ao Desen¬ 
volvimento Cintifinco e Tecnologico-Proc. No. 
62.0241/95.0). 


References 

1. (a) C. Y. Hsu and ML Orchin, J. Am. Chem Soc. 97, 3553 
(1975). (b) G. Parrinelo and J. K. Stille, J. Am. Chem. Soc. 
109, 7122 (1987). (c) J. K. Stille and G. Parrinelo, J. Mol. 
Catal. 21, 203, (1983). (d) L. Kollar, G. Parrinelo, and P. 
Pino, J. Organomet. Chem. 330, 305 (1987). (e) L. Kollar, J. 
Bakos, I. Toth, and B. Heil, J. Organomet. Chem. 350, 277, 
(1988). 

2. (a) R. Bardi, A. M. Piazzesi, G. Cavinato, P. Cavoli, and L. 
Toniolo, J. Organomet. Chem. 224, 407 (1982). (b) R. Bard, 
A. M. Piazzesi, A. Del Pra, G. Cavinato, and L. Toniolo, 
J. Organomet. Chem. 234, 107 (1982). (c) G. Cavinato and 
L. Toniolo, J. Organomet. Chem. 241, 275 (1983). 


3. P. Pregosin and N. Sze. Helv. Chim. Acta 61, 1848 (1978). 

4. (a) W. J. Stevens, H. Basch, and M. Krauss, J. Chem. Phys. 
81, 6026 (1984). (b) W. J. Stevens, H. Basch, M. Krauss, and 
P. Jasien, Can. J. Chem. 70, 612 (1992). 

5. W. J. Hehre, R. Ditchfield, and J. A. Pople, J. Chem. Phys. 
56, 2257 (1972). 

6. L. D. Pettit, Q. Rev. 1 (1971). 

7. A. Hollwarth, M. Bohme, S. Dapprich, A. W. Ehlers, A. 
Gobbi, V. Jonas, K. F. Koller, R. Stegmann, A. Veldkampe, 
and G. Frenking, Chem. Phys. Lett. 208, 237 (1993). 

8. M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert, 
M. S. Gordon, J. H. Jensen, S. Koseki, N. Matsunaga, K. A. 
Nguyen, S. J. Su, T. L. Windus, M. Dupuis, and J. A. 
Montgomery, GAMESS, J. Comp. Chem. 14, 1347 (1993). 

9. M. J. Frisch, G. W. Trucks, H. B. Schlegel, P. M. W. Gill, 
B. G. Johnson, M. A. Robb, J. R. Cheeseman, T. A. Keith, G. 
A. Petersson, J. A. Montgomery, K. Raghavachari, M. A. 
Al-Laham, V. G. Zakrzewski, L. V. Ortiz, J. B. Foresman, J. 
Cioslowski, B. B. Stefanov, A. Nanayaakkara, M. Challa- 
combe, C. Y. Peng, P. Y. Ayala, W. Chen, M. W. Wong, J. L. 
Andres, E. S. Replogle, R. Gomperts, R. L. Martin, D. J. Fox, 
J. S. Binkley, D. J. Defrees, J. Baker, J. P. Stewart, M. 
Head-Gordon, C. Gonzalez and J. A. Pople, GAUSSIAN 94 
(Revision A.l) (Gaussian, Inc., Pittsburgh, PA, 1995). 

10. J. M. Wisner, T. J. Bartchak, J. A. Ibers, J. J. Low, and W. A. 
Goddard III, J. Am. Chem. Soc. 108, 347 (1986). 

11. M. Gomez, G. Muller, D. Sainz, and J. Sales, Organometallics 
10, 4036 (1991). 

12. A. Albinati, U. N. Gunten, P. S. Pregosin, and H. J. Ruegg, 
J. Organomet. Chem. 295, 239 (1985). 

13. A. Del Pra, E. Forsellini, G. Bomberi, R. A. Michelin, and A. 
Ros, J. Chem. Soc., Dalton Trans. 1862 (1979). 

14. (a) J. S. Dewar, Bull. Soc. Chim. Fr. 18, C79 (1951). (b) 
J. Chatt and L. A. Duncanson, J. Chem. Soc. 2939 (1953). 

15. (a) H. Fujimoto, S. Kato, S. Yamabe, and K. Fukui, J. Chem. 
Phys. 60, 572 (1974). (b) S. Kato, H. Fujimoto, S. Yamabe, 
and K. Fukui, J. Am. Chem. Soc. 94, 2024 (1974). (c) H. 
Fujimoto, N. Koga, and K. Fukui, J. Am. Chem. Soc. 103, 
7452 (1981). 

16. (a) P. S. Bagus, K. Hermann, and C. W. Bauschlicher, 
J. Chem. Phys. 80, 4378 (1984). (b) P. S. Bagus, K. Hermann, 
and C. W. Bauschlicher, J. Chem. Phys. 81, 1966 (1984). (c) 
P. S. Bagus and F. Illas, J. Chem. Phys. 96, 8962 (1992). 

17. (a) T. Ziegler and A. Rauk, Theor. Chim. Acta 46, 1 (1977). 
(b) E. J. Baerends and A. Rozendaal, NATO ASI C176, 159 
(1986). (c) T. Ziegler, NATO ASI C378, 367 (1992). 

18. (a) S. Dapprich and G. Frenking, J. Phys. Chem. 99, 9352 
(1995). (b) A. W. Ehlers, S. Dapprich, S. F. Vyboischikov, 
and G. Frenking, Organometallics 15, 105 (1996). 

19. S. Dapprich and G. Frenking, CDA 2.1 (Marburg, 1994). 
internet: ftp/chemie.uni-marburg.de, login anonymous, 
directory pub/cda. 

20. G. K. Anderson and R. J. Gross, Chem. Soc. Rev. 9, 185 
(1980). 

21. F. Basolo and R. G. Pearson, Mechanism of Inorganic Reac¬ 
tions , 2nd ed. (Wiley, New York, 1967). 

22. R. S. Berry, J. Chem. Phys. 32, 933 (1960). 

23. (a) F. Ozawa, T. Ito, Y. Nakamura, and A. Yamamoto, Bull. 
Chem. Soc. Jpn. 54, 1868 (1981). (b) R. S. Paonessa and W. C. 
Trogler, J. Am. Chem. Soc. 104, 3529 (1982). 


650 


VOL 65, NO. 5 




Study of Ground and Excited States of 
Doped Polyacetylene 


G. P. DAS, D. S. DUDIS 

WL / MLBP, Wright Patterson Air Force Base, Ohio 45433 

Received 5 March 1997; revised 17 July 1997; accepted 29 July 1997 


ABSTRACT: Both theoretical and experimental studies in the past have indicated that 
the charge transport in doped polyacetylene is due primarily to traveling charged solitary 
waves along the polymer chain backbone accompanied by hopping from one chain to 
another. The conductivity in this model is still determined by a band gap. The nature of 
the ground and excited states of the doped system, however, is not fully understood. 
Previous ab initio calculations, based on transoctatetraene simulating transpolyacetylene 
interacting with a single iodine atom and also a fairly low level of basis set, have pointed 
to the possibility that, while the calculations at the Hartree-Fock level puts the charge- 
transfer state below the "covalent" state, the results are reversed when correlation is 
included. Apart from the basis-set inadequacy and the small size of the polyene 
considered in this study, it is also true that, in solution, iodine exists only as polyiodide 
ions 1^, I 5 , etc. In this work, we address some of the above deficiencies of the model. 

© 1997 John Wiley & Sons, Inc. Int J. Quant Chem 65: 651-654, 1997 


Introduction 

T he discovery [1, 2] that polyacetylene can be 
doped to near metallic conductivities, using 
donors, such as alkali metals, and acceptors, such 
as 1 2 or AsF 5 , as dopants has intensified interest in 
understanding the mechanism of charge transport 
in these materials. While qualitatively the conduc¬ 
tivity is now firmly believed to result from charged 
solitary waves, as first proposed by Su, Schrieffer, 
and Heeger (SSH) [3], the description of the ground 

Correspondence to: G. P. Das. 

International Journal of Quantum Chemistry, Vol. 65, 651-654 (1997) 
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and excited states participating in the charge- 
transport process remains a theoretical challenge. 

In a previous work, Das et al. [4] showed that if 
one simulates transpolyacetylene (f-PA) by 
transoctatetraene, the charge environment of a sin¬ 
gle iodine atom interacting with the polyene goes 
through a complete reversal as one goes from 
Hartree-Fock to a correlated calculation. While the 
Hartree-Fock ground state is a charge-transfer 
state with iodine acquiring close to a full electron 
at the expense of the polyene, the picture changes 
radically when correlation is included; the ground 
state now is essentially a neutral iodine interacting 
with the polyene, while the excited state is a 
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charge-transfer state I + C S H^ 0 . Based on this 
finding, it was proposed that the charge is trans¬ 
ported by the electrons populating these excited 
states via excitons representing a "charged soliton" 
traveling along the length of the polymeric chains. 
A quantitative study of the time evolution of such 
processes was carried out, leading to a high diffu¬ 
sion mobility and, consequently, large conduc¬ 
tivity. 

The above study, however, is flawed by several 
deficiencies: First, the polyene considered is too 
small with potentially large end effects. Second, 
iodine in solution is found to exist only in clusters 
such as 1 3 and I 5 . Also, the basis set used in the 
above calculation may not be adequate to repre¬ 
sent the charge-transfer states via-a-vis the neutral 
states even though the correlation level considered 
may have been right. Moreover, we neglected the 
effect of the neighboring polymer units, which is 
expected to be important since the separation be¬ 
tween neighboring units (6-7A) is not large enough 
for the dopant system to interact solely with one 
unit. In what follows, we analyze all the above 
deficiencies except that of the basis set, which, 
because of the great complexity of the problem, we 
defer to a later work, retaining for the present the 
simplest basis used in our previous work, namely, 
the Hay-Wadt ECP basis for iodine and STO-3G 
for the rest. 


Choice of Our Prototype System 

Using 1 3 instead of I increases the complexity of 
the analysis in several ways: First, we have a much 
larger number of valence electrons to deal with. 
Also, the determination of the ground and excited 
states has to take into account the fact that the 
geometry of 13 is very different from I 3 in addi¬ 
tion to the large geometry change the polyene 
system undergoes. Since I 3 is really 12 + 1 and 
Hartree-Fock gives very poor results for the bond¬ 
ing in 1 2 (usually predicting no binding at all), it is 
necessary that we employ all excitations involving 
all the valence electrons. At the Hay-Wadt level of 
the ECP basis, the various quantities of interest 
pertaining to the I 2 + I problem is shown in 
Table I. 

We shall consider C 12 H 14 instead of C 8 H 14 in 
this work. This will ensure much smaller end 
effects when the dopant is a single iodine atom 
and is likely to be adequate for I 3 . 


TABLE I_ 

ECP energies of atomic, diatomic, and triatomic 
iodine and their ions at the level of Hay-Wadt 
ECP basis set. 


System 

Energy (SCF) 
(Hartree) 

Energy (MCSCF) 
(Hartree) 

1 

-11.15723 


1 — 

-11.23469 


•2 

-22.31185 

-22.34149 

'3 

— 33.43640 

-33.48324 

i 3 - 

-33.59687 

-33.61844 

I 2 + 1 

-33.46908 

-33.49872 


Finally, adding to the above complexity, we 
have also to include in our discussion the interac¬ 
tion of the iodine molecule with both the adjacent 
polyene systems between which it finds itself. 
Thus, it will be necessary to calculate or estimate 
the interaction of the iodide ion if and when it 
forms not only with the polyene that it shares but 
also the neutral one on its other side. 


General Computational Framework 

The present calculations were performed using 
the GAMESS software with some important modi¬ 
fications. In our earlier work on the interaction of a 
solitary iodine atom with transoctatetraene, the 
active space selected for the CASSCF calculations 
consisted of the eight 77 valence and virtual or¬ 
bitals on the polyene and three on the iodine atom. 
The geometry optimization using these CASSCF 
wave functions was a major effort but it was still 
well within the usual time and "economic" restric¬ 
tions. In the present case, however, the active 
space needs to be enlarged to 21 orbitals (nine 
pertaining to the iodine atoms and 12 to the 
polyene 7 r-space). This is too large an effort to be 
treated in a straightforward CASSCF framework, 
the number of configuration running to millions. 
We adopted a different scheme to be called the 
modified MP2 scheme (MMP2) to extract the cor¬ 
relation information. 

For each state, we carry out the following steps: 
(a) We first construct the wave function for the 
state in question as a limited CASSCF involving 
only the Hartree-Fock occupied valence orbitals as 
the active space. In our case, this simply implies 
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inclusion of a single excitations into the only open 
shell available. The main purpose of using the 
CASSCF description is to retain the nature of the 
input orbitals through the SCF process for easy 
interpretability both for the "neutral" and 
"charge-transfer" states, (b) The bonding and anti¬ 
bonding 7r-orbitals are then localized. This is an 
important step and the details are found in [4]. 
This reduces the magnitude of the cross terms that 
are neglected in the present MP2 process, (c) A 
first-order Cl is then performed using all single 
excitations into the given virtual space from the 
configurations generated in the first step. This step, 
by virtue of Brillouin's theorem, will not yield any 
significant contribution for the state that has been 
optimized via a CASSCF process described above. 
Flowever, this will affect the state not optimized, 
(d) Now, pairwise correlation contributions are 
evaluated for each double excitation for both the 
ground and excited states. In all the calculations 
being reported here, we selected for our virtual 
space only the set of 7r-antibonding orbitals be¬ 
longing to the polyene. 


Results and Discussion 

In Table II, we compare the results obtained 
from rigorous CASSCF calculations with those 
based on our approximate MMP2 model. Since the 
quantities of interest are the excitation energies of 
the states and not their absolute energies, it is 
evident that the approximate scheme is adequate 
for our purpose. 


In Table III, the interaction potential of I 3 ap¬ 
proaching the plane of C 12 H 14 vertically is shown 
in the ground and excited states. No geometry 
reoptimization is carried out other than the fact 
that the geometry used for the charge-transfer state, 
which is the ground state at the uncorrelated level, 
is that of the cation C 12 H^ 4 . Similarly, the opti¬ 
mized geometry of neutral C 12 H 14 is used for the 
covalent state, which is the midgap excited state at 
the Hartree-Fock level. The geometry of 13 was 
frozen at the minimum geometry of the ion IJ. 

In Table IV, we show the RHF interaction ener¬ 
gies of 13 in its linear optimized geometry as it 
approaches the plane of a neutral polyene molecule 
C^ H u . In Table V, we show a balance sheet of 
energies based on our MMP2 calculations for I 3 . 
Again, the geometries used for the neutral and 
charge-transfer states are those of C 12 FI 14 and 
C 12 H 14 , respectively. The charge-transfer-state en¬ 
ergy is corrected upward because of a net repul¬ 
sion from the neighboring polyene unit assumed 

TABLE III_ 

MMP2 energies vs. distance (R) of l 3 from the plane 
of the polyene C 12 H 14 ; the length of 1 3 is taken 
parallel to the polyene chain axis. 


R 

(angstroms) 

Energy 

Ground state 
(Hartree) 

Excited state 
(Hartree) 

2.94 

“490.43777 

-490.38256 

3.44 

-490.45474 

-490.39861 

3.94 

-490.45388 

-490.40027 

4.44 

-490.44918 

-490.39996 


TABLE II 


Comparison of rigorous CASSCf with MMP2: columns 2 and 4 represent the CASSCF results for the ground- 
state and first-excited-state energies, while columns 3 and 5 are the corresponding MMP2 results. 

R 

(angstroms) 


Energy 



Ground state 

Excited state 

CASSCF 

(Hartree) 

MMP2 

(Hartree) 

CASSCF 

(Hartree) 

MMP2 

(Hartree) 

2.5 

-316.23030 

-316.18606 

-316.17745 

-316.12908 

3.0 

-316.23979 

-316.20431 

-316.19560 

-316.15834 

3.5 

-316.23999 

-316.20677 

-316.19101 

-316.16276 

4.0 

-316.23893 

-316.20625 

-316.18479 

-316.15795 

4.5 

-316.23831 

-316.20577 

-316.17734 

-316.15097 
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TABLE IV_ 

Energies as l 3 “ approaches the neutral C 12 H 14 ; R is 
the perpendicular distance of of l 3 ~ from the plane 
of the polyene (the length of l 3 is taken parallel to the 
polyene chain axis). 


R 

(angstrom) 


RHF energy 
(Hartree) 


2.94 

-490.36435 

3.44 

-490.37127 

3.94 

-490.37145 

4.44 

-490.37144 


— 490.40382 


TABLE V_ 

Balance sheet of energies of the charge-transfer 
and covalent states for the l 3 + C 12 H 14 system 
calculated at the optimized geometries of C 12 H 14 
and C 12 H 14 , respectively. 

Energies 


Charge-transfer 

state Neutral state 

(Hartree) (Hartree) 


MMP2 energy 
Correlation error 

— 490.45474 

-490.40027 

in l 3 and l 3 and 
Repulsive 
correction from 

-0.02157 

-0.02964 

neighbor 

Geometry 

0.03255 

-0.00227 

correction 


-0.03268 

Total energy 

-490.44376 

-490.46486 


separated from the first by 6.88 A. The neutral-state 
energy is corrected downward since the lowest 
energy geometry of I 3 is readily iodine and I 2 well 
separated. 

It is clear that within the approximation of the 
basis set used in this work as well as subject to the 
omission of "atomic" correlation corrections in the 
calculations our present model predicts that with 
I 3 as the dopant the energy of the charge-transfer 
state is slightly higher than that of the neutral 
state. When the atomic correlation terms are in¬ 
cluded, which essentially to improve the electron 
affinity of iodine by nearly 1 eV, these two states 
promise to be near-degenerate. 
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ABSTRACT: We propose a two-parameter model Hamiltonian to analyze the low- 
energy dynamics of endohedral C 60 fullerene complexes such as Li + @C 60 , Na + @C 60 , 
CO@C 60 , LiF@C 60 , and LiH@C 60 . The simplicity of the model is a direct consequence of 
very strong constraints imposed by both high symmetry and close confinement conditions, 
on the spherical anisotropy of the guest-cage interaction. In the parameter space region 
expanded by the selected systems, the guest low-energy dynamics undergo transitions 
from almost free rotations to more or less hindered rotations. These dynamics produce 
very particular features in the low-temperature rotational spectra, which can be taken, 
therefore, as fingerprints of a particular guest-cage interaction. © 1997 John Wiley & Sons, 
Inc. Int J Quant Chem 65: 655-663, 1997 
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Introduction 

T he recent discovery of the fullerenes has open 
a new field in science and technology, joining 
physicists and chemists in an effort to understand 
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the many interesting possibilities that these new 
molecules offer, both in the gas and the fullerite 
solid phases. Among these possibilities we find 
those related with the use of these carbon aggre¬ 
gates as microscopic cavities to encapsulate atoms 
and even small molecules. Some experiments have 
succeeded in producing traces of metallofullerenes 
(Li, Na, K, Rb, Cs, Ca, Ba, La,...) [1, 2], rare-gas 
fullerenes [3, 4], and diatomic fullerenes (CO@C 58 ) 
[5]. 

The solid phases of these endohedral complexes 
(endohedrally doped fullerites) become, in general, 
significantly more stable than the fullerites with 

CCC 0020-7608 / 97 / 050655-09 
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exohedral doping. On the other hand, the very 
interesting electric and magnetic properties that 
have been found in exohedral fullerites are also 
expected to occur in their endohedral counterparts. 
Moreover, very interesting applications have been 
suggested for these particular materials. As an 
example, we mention Cioslowski and Nanay ak- 
kara's proposal [6] for endohedral fullerites of po¬ 
lar molecules (CO, LiF, LiCl, NaF, NaCl), which 
may rotate more or less freely depending on the 
dipole-dipole interaction and on the temperature. 
These endohedral fullerites would "constitute a 
new class of ferroelectric materials and the first 
practical realization of ideal electric dipolar lat- 
tices," which could show low-temperature phase 
transitions between dipole ordered and disordered 
states. The transition temperatures estimated by 
these authors are in the range 25-60 K. 

All these potential applications have stimulated 
many theoretical studies to elucidate the electronic 
structure of these compounds and the nature of 
the guest-cage interaction. The traditional theoret¬ 
ical methods of molecular physics and quantum 
chemistry have been proved to be very useful in 
establishing new results. Besides, great experimen¬ 
tal efforts are currently devoted to obtain endohe¬ 
dral species in macroscopic quantities, and a very 
promising technique has been reported recently 
[7]. In the meanwhile, the results from these theo¬ 
retical and experimental work have motivated fur¬ 
ther theoretical studies on dynamical aspects such 
as the rotational-vibrational motion of the confined 
species. These dynamics are going to affect some 
of the fullerite properties. For instance, a more 
accurate treatment of the phase transitions pre¬ 
dicted by Cioslowski and Nanayakkara would re¬ 
quire knowledge of this motion. Vibrational-rota¬ 
tional spectra are a primary source of information 
on these dynamics, but experimental data of this 
kind are not yet available. 

Ab initio calculations for different trapped 
species have produced some sparse data on the 
guest-cage interaction; they provide mainly infor¬ 
mation on the equilibrium positions and vibra¬ 
tional frequencies for the confined atom or 
molecule. When the interaction is of physical na¬ 
ture, as may be the case when the guest species is 
a closed-shell atom or molecule, this may be fairly 
estimated using semiempirical electrostatic argu¬ 
ments. In this way analytical expressions have 
been proposed [8-11], which reproduce rather ac¬ 
curately [8, 9] the experimental and ab initio [12, 
13] data. All this information on the nature of the 


guest-cage interaction has made possible several 
attempts to simulate theoretically the vibrational- 
rotational spectra of some endohedral fullerenes. 

The first analyses [14-18] dealt with confined 
atoms and used a spherical representation of the 
atom-cage interaction in which the anisotropic 
terms were neglected. Within this approximation 
the rotational-vibrational spectrum for on-center 
endohedral fullerenes is reduced to that of an 
isotropic anharmonic three-dimensional (3D) oscil¬ 
lator [16, 17], and the corresponding spectrum for 
off-center fullerenes is that of an effective diatomic 
molecule [14, 15, 18]. The spherical representation 
of the atom-cage interaction is in general well 
justified for on-center atom complexes at low vi¬ 
brational energies, but not for diatomic guests and 
off-center atoms. Classical trajectory studies for 
Ne@C 60 [19] and classical simulations of the dipole 
spectra of Li + @C 60 [20] anticipated the relevance 
of the interaction anisotropy. This has been con¬ 
firmed systematically by our group in a series 
of publications [21-25] in which we simulate di¬ 
pole and Raman rotational-vibrational spectra for 
closed-shell confined atoms and diatomic mole¬ 
cules. We have found that the most significant 
manifestations of the interaction anisotropy appear 
in the low temperature (T ~ 50 K and lower) spec¬ 
tra. Low anisotropic interactions are found for the 
CO@C 60 and Na + @C 60 complexes, whose spectra 
suggest the dominance of almost free rotations. 
The more anisotropic interactions of Li + @C 60 , 
LiF@C 60 , and LiH@C 60 give rise to spectra, whose 
assignment requires the existence of librations and 
hindered rotations. 

In this study we will show that all these spectra 
can be understood in terms of only two scaled 
parameters, which determine the relative magni¬ 
tude of the anisotropic part of the guest-cage 
potential. We will consider isolated complexes, but 
many of our conclusions can be easily extended 
with little change to the corresponding fullerites. 
For completeness we will also review some of our 
previous results. In the following section we pre¬ 
sent an effective Hamiltonian describing the rota¬ 
tional motion of endohedral complexes of atoms 
and diatomic molecules. The third section is dedi¬ 
cated to a systematic analysis of the eigenvalue 
spectrum of this Hamiltonian as a function of the 
two scaled parameters. In the fourth section we 
show how to use this information in order to 
assign the features of the low-temperature rota¬ 
tional spectra. A further discussion and the main 
conclusions are presented in the last section. 
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Effective Scaled 
Rotational Hamiltonian 

The different time scales involved in the dy¬ 
namics of endohedral complexes of atoms and 
diatomic molecules can be used to obtain an effec¬ 
tive Hamiltonian for the low-frequency motion in 
which we are interested. After separating the elec¬ 
tronic degrees of freedom within the Born-Op- 
penheimer approximation, the next shorter time 
scales correspond to the cage, guest, and relative 
guest-cage vibrational motions [21-23]. At suffi¬ 
ciently low energies (up to E ~ 150 cm -1 ) the 
complex will be in the ground state of these mo¬ 
tions, and a good approximation describing the 
remaining dynamics can be provided by the effec¬ 
tive rotational Hamiltonian [22, 23] 

1 l 2 

^rot = ITT K 2 + y + Kff( a / Pr T> 0/ <£)/ (1) 

where K = J — 1 is the cage angular momentum 
operator, which is the difference between the total 
angular momentum J and the angular momentum 
1 associated with the motion of the confined atom 
or molecule. I c = §6 OM c R^ is the cage moment 
of inertia, with M c being the carbon mass and R c 
the cage radius (3.55 A); I ? is the guest moment of 
inertia, and V e({ is an effective potential depending 
on the cage (Euler angles a, (3, y) and guest {9, <f>) 
orientations. This Hamiltonian represents two ro¬ 
tors coupled by kinetic and potential terms. J and 

V ei( are, in principle Independent operators due to 
centrifugal distortion. However, this dependence 
has been found to be unimportant in all cases 
studied so far [21-23], and only very fine details 
depend on it; thus we will neglect it in the follow¬ 
ing analyses. 

In Ref. [23] we established that this Hamiltonian 
is valid not only for confined atoms with large 
enough eccentricity in their equilibrium positions, 
but also for diatomic molecules with axial equilib¬ 
rium configurations; in this case, the relative di¬ 
atom-cage vibrational motion must also be in its 
ground state. The axial equilibrium configuration 
is found in all known examples of C 60 endohedral 
complexes of diatomic molecules. 

The relatively large value of the cage moment of 
inertia I c makes a full quantal treatment of the 
cage rotation a hard task; for instance, the number 
of J values involved in the rotational spectra, even 


at the low temperature considered here would be 
very large (/ up to ~ 300). However, this same 
large I c value helps in providing further approxi¬ 
mations. The simplest one assumes an infinite-mass 
cage, and leads to the Hamiltonian 

= y l 2 + V e{f (0, <f>), (2) 

where B e = h 2 /2l g is the effective rotational con¬ 
stant. This depends on the equilibrium configura¬ 
tion of the guest species, which is practically deter¬ 
mined by the spherically isotropic part of the 
guest-cage interaction [21, 22]. The B e values cal¬ 
culated for our endohedral complexes are given in 
Table I. 

It is known that there is a critical temperature, 
T ~ 250 K, in fullerites for the transition between 
nonrotating to almost free rotating fullerene mole¬ 
cules. For the low temperatures considered in this 
work, then, the motion of the fullerene molecules 
should be practically frozen, and the model in Eq. 
(2) would be then the adequate one. In this case 
Vg ff should also account for additional interactions 
due to the crystal lattice. As will be discussed 
later, these additional terms are expected to be 
small when the density of empty cages in the 
fullerite is large enough. 

If not fully quantally, the effects of the cage 
rotation can be included semiclassically with high 
reliability. This has been done for the determina¬ 
tion of rotational spectra in Refs. [21-23]. It is 
assumed there that the C 60 cage rotates freely with 
a classical and constant angular momentum. One 
then finds that the main features of these rota¬ 
tional spectra are already predicted by the Hamil¬ 
tonian in Eq. (2). So we will concentrate our atten¬ 
tion in the analysis of this simple model. The 
particular effects of the cage rotation will be dis¬ 
cussed in the fourth section. 


TABLE I 


Rotational constant and scaled potential parameters 
for different endohedral complexes. 


Guest 

B e (cm h 

°6 

U 10 

Li + 

1.45 

78.77 

80.22 

Na + 

1.35 

7.89 

0.78 

CO 

1.78 

5.46 

0.22 

LiH 

1.82 

53.65 

30.76 

LiF 

1.27 

80.94 

31.58 
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The reliability of our theoretical analysis now 
depends on the accuracy of cage-guest effective 
potential surface V eti , which belongs to the totally 
symmetric representation (A g ) of the I b icosahe- 
dral group. As discussed in Ref. [22], confinement 
and symmetry constrain enormously the form of 
the potential. A natural expansion of the potential 
surface is done in spherical harmonics (L, M); 
then the C 60 I h symmetry restricts severely the L 
values participating in such an expansion (L = 
0 , 6 , 10 , 12 ,...); furthermore, if electrostatic forces 
(dispersion-repulsion, cage polarization, etc.) dom¬ 
inate the interaction (as in the closed-shell ions and 
molecules considered in our work) the strong con¬ 
finement will make such interaction rather smooth. 
As a result of all these constraints, it turns out that 
practically only L = 0 , 6 , and 10 manifolds are 
required in the expansion of the semiempirical 
expressions for V e{{ (relative errors are ~ 5% at 
most). The L = 0 term is a constant; the L = 6 ,10 
manifolds are reducible representation of the I h 
point group. When reduced, each one gives just 
one A g representation. Therefore, we may accu¬ 
rately represent the effective interaction with the 
expression 

Ve ff = C 0 V o + C 6 V ( 5 + C 10 V 10 , (3) 

where C L are coefficients, and V L represents the 
irreducible A % representation coming from the L 
manifold; these functions are chosen to be real and 
normalized as fV L 2 d£l = 1. Some of their values 
are shown in Figure 1. 

After scaling and redefining the energy origin, 
Eqs. (2) and (3) lead to the expression 

H* m = l* 2 + C*V 6 + Cf 0 V 10 , (4) 

where H* m = (H im - C 0 V 0 )/B e , 1* = l/h, C* = 
C n /B e . In conclusion, only the two parameters C* 
and Cf 0 determine the low-energy rotational dy¬ 
namics in our model. Their values, for the systems 
selected in our work, are given in Table I. 

A similar model has been proposed for the 
rotational dynamics of atom-spherical top van der 
Waals complexes [26], to represent situations op¬ 
posite to ours, namely those in which the spherical 
top is much lighter than the atom. The meaning, 
range, and effect of these model parameters is thus 
quite different from ours. 

Since the dimension of the parameter space of 
Hamiltonian (4) is just two, a systematic analysis 
of the effect of the anisotropy on the eigenvalue 



e° 

FIGURE 1. Values of the V 6 and V 10 terms for the 
guest-cage interaction. They are given as a function of 
the polar angle 6 for azimutal angle <j> = 0. The z axis 
has been chosen along the C 5 fullerene symmetry axis 
and one of the neighbor C 2 symmetry axis lies on xz 
plane. The first maximum in V 10 corresponds to the 
middle of a short C—C bond and the second one to a 
long C—C bond. The four absolute minima in both 
panels correspond to the hexagon centers. 

spectrum of this Hamiltonian can be readily car¬ 
ried out. This will be presented in the following 
section. 


Analysis of the Rotational Eigenvalue 
Spectrum in the Parameter Space 

Figure 2 presents the parameter space expanded 
by the endohedral complexes selected in our work. 
A different symbol indicates the position of each 
system. We will now study the evolution of the 
low-energy eigenvalues following the four straight 
lines drawn in this figure. The corresponding cor¬ 
relation diagrams are given in Figures 3-6. The 
spherical harmonic (|/, m)) basis set has been used 
to diagonalize the scaled Hamiltonian in Eq. (4). 
Here, l values up to 30 have been included to get 
converged spectra. In Figure 3 we notice that the 
eigenvalue spectrum for Na + @C 60 corresponds to 
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C 6 * 


FIGURE 2. Parameter space of the model Hamiltonian 
in Eq. (4). Different symbols indicate the position of the 
endohedral complexes selected in our work. Along the 
lines drawn on this figure we have calculated the energy 
correlation diagrams in Figures 3 to 6. 

that of a slightly perturbed free rotor. In particular, 
each l manifold splits according to the different 
irreducible representations of the I h symmetry 
group, with practically no mixing between differ¬ 
ent l manifolds (basically the same spectrum is 
obtained for CO@C 60 [24, 27]). 



FIGURE 3. Energy correlation diagram as function of 
the Cg parameter calculated along the line in Figure 
2 . 



FIGURE 4. Energy correlation diagram as function of 
the Cg parameter calculated along the R 2 line in Figure 
2 . 


In going from Na + @C 60 to Li + @C 60 , mixing 
between different l manifolds increases and a sin¬ 
gle "quasi-band-gap" structure emerges at the bot¬ 
tom of the energy scale. The same structure ap¬ 
pears in the eigenvalue spectrum of LiH@C 60 and 
LiF@C 60 (see Figs. 4 and 5), and of any system 
lying in parameter space expanded by these three 



FIGURE 5. Energy correlation diagram as function of 
the C| parameter calculated along the R 3 line in Figure 
2 . 
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FIGURE 6. Energy correlation diagram as function of 
the C| parameter calculated along the R 4 line in Figure 
2 , 

complexes. Whenever the quasi-band is formed, it 
always contains 20 states grouped in the six I h 
symmetry manifolds © T lu © T 2u © G ? © G u 
© . This number coincides with the number of 

hexagon wells in the guest-cage potential surface. 
Thus the band represents trapping of the guest 
species in the hexagon wells (librational motion); 
tunneling connects different wells and accounts for 
the observed bandwidth. A look at the correspond¬ 
ing eigenfunctions confirms these conclusions: 
These states present the largest amplitudes around 
the hexagon wells of the potential [24, 25]. Above 
the gap, no other bands can be recognized in the 
eigenvalue spectra of any system lying in parame¬ 
ter space around and between the three anisotropic 
complexes; this is consistent with the fact that the 
hexagon well is not deep enough to support more 
than one quantum state; thus instead of additional 
bands, a kind of "quasi-continuum" of difficult 
assignment appears. The energy at which the 
quasi-continuum starts represents the onset of 
less-localized motion (hindered rotations). This re¬ 
sult is confirmed by the delocalized character of 
the wave functions for states lying in this energy 
region. An exception to this rule appears in a set of 
12 states, separated in the four symmetry species 
A g © T lu © T 2l( © H g , whose wave functions pre¬ 
sent localization around the 12 pentagon centers. 
This localization is, however, not as perfect as the 
one found around the hexagon wells; some mixing 


takes place with delocalized states. Around the 
pentagon centers the guest-cage potential takes 
maximum values; a local minimum may exist but 
not deep enough to support a state. By decreasing 
the magnitude of the C* parameter we can in¬ 
crease the depth of this well, which may now be 
able to support one state. This is what can be 
observed in Figure 6: a pentagon band emerges 
out of the manifold of delocalized states at low C* 
values. When this band hides in the delocalized 
manifold of states, mixing takes place, but most of 
the pentagon character is preserved, except at the 
infrequent avoided crossings, where that character 
may be shared by another state. The whole situa¬ 
tion is that of a kind of "quasi-resonance band." 
This is a quasi-band analog of the separatrix and 
saddle point quasi-resonances found in simple sys¬ 
tems [28]. It is known that this localization phe¬ 
nomenon is quite stable and robust against pertur¬ 
bations, as can be confirmed in our case by the 
correlation diagrams. 

At high enough energies mostly free rotations 
should dominate the dynamics; for instance, this 
happens in Li + @C 60 for angular momentum 
quantum numbers higher than l ~ 20. 


Examples of Rotational Spectra 

The features of the eigenvalue spectrum and 
eigenfunctions analyzed in the previous section 
are going to determine the structure of the low- 
temperature rotational spectra, which may now be 
considered as fingerprint of a particular guest-cage 
interaction. Low-temperature dipole and Raman 
rotational spectra for these endohedral complexes 
have been calculated in Refs. [21-25]. As an exam¬ 
ple we present in Figures 7 and 8 the dipole 
spectra (T = 50 K) for the Na + @C 60 and Li + @C 60 
complexes as a function of the reduced frequency 
co* = hco/B e . These spectra are the result of a 
convolution with a Lorentzian whose full width at 
half maximum is A co* = 0.2. While panels (a) cor¬ 
respond to the infinite-mass cage model of Eq. (4), 
panels (b) present the effect to the cage rotation 
when this is treated semiclassically as indicated 
earlier. Note that the rotation of the cage enhances 
the free rotation of the low-anisotropy Na + @C 60 , 
and broadens the librational bands of the more 
anisotropic Li + @C 60 . This is a general result estab¬ 
lished in Refs. [22, 23]; there we showed that this 
opposite effect is the result of the competition 


660 


VOL 65, NO. 5 




ROTATIONS IN ENDOHEDRAL C 60 FULLERENE COMPLEXES 



0 5 10 15 20 25 30 


FIGURE 7. Dipole rotational spectra for Na + @C 60 
calculated at T = 50 K. Intensities, in arbitrary units, are 
given as a function of scaled frequencies. Panel (a) was 
obtained with the infinite-mass-cage Hamiltonian in Eq. 
(2). Panel (b) was calculated using a semiclassical 
approximation for the cage rotation (Ref. [23]). In all 
cases a Lorentzian of full width 0.2 has been used to 
convolute the spectrum. 

between a centrifugal term that couples the guest 
rotation to the cage rotation and the anisotropic 
terms of the guest-cage interaction. When cen¬ 
trifugal coupling dominates over anisotropic po¬ 
tential coupling, the latter is effectively reduced; 



0 20 40 60 80 


FIGURE 8. Same as Figure 7 for Li + C 60 . 


the result is a sharper rotational spectrum. On the 
contrary, if the anisotropic potential coupling dom¬ 
inates as in the librational bands, the cage rotation 
adds an extra broadening, which physically trans¬ 
lates into a high tendency for the guest species to 
follow the cage rotation; as was shown in Refs. [22, 
23], this rotational broadening can be accurately 
simulated by a smoothing of the infinite-mass-cage 
spectrum with a convoluting function whose width 
is the estimated cage-rotational broadening ( — 1.5 
cm” 1 at T = 50 K). 

The spectrum for Na + @C 60 confirms the almost 
free-rotor nature of its eigenvalue spectrum. It 
shows the 2 B e (just 2 in reduced units) oscillation 
typical of a diatomic molecule. Practically equal 
spectrum is obtained for CO@C 60 . On the other 
extreme, the spectrum of Li + @C 60 shows wide 
librational bands. The less intense half of the spec¬ 
trum is basically due to transitions within the 
hexagon quasiband structure of the eigenvalue 
spectrum; the right and more intense half is caused 
by transition out of this band into the states lying 
above the gap. Most of the high-intensity lines can 
be assigned to transitions ending in one of the 12 
states of the quasi-resonance pentagon band. This 
is due to the fact that the threshold for hindered 
rotation is determined by the high potential re¬ 
gions around the pentagon centers. The complexes 
LiH@C 60 and LiF@C 60 show similar features. 


Discussion and Conclusions 

We have obtained a two-parameter scaled model 
Hamiltonian which may represent rather accu¬ 
rately the low-energy dynamics of endohedral C 60 
complexes of off-center atoms and small diatomic 
molecules. The simplicity of the model is a direct 
consequence of the high symmetry of the cage, the 
strong confinement conditions, and the mostly 
physical nature of the guest-cage interaction in 
the selected systems Na + @C 60 , Li + @C 60 , CO@ 
C 60 , LiH@C 60 , LiF@C 60 . This same simplicity has 
allowed us to perform a systematic analysis of the 
structure of the eigenvalue spectrum for the model 
in the parameter space relevant to those endohe¬ 
dral complexes. Energy correlation diagrams as a 
function of the parameters show transitions be¬ 
tween basically two types of structures. On one 
side we have a parameter region around Na + @C 60 
and CO@C 60 complexes in which the model 
eigenvalue spectrum and the corresponding low- 
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temperature rotational spectra are like those of 
slightly perturbed free rotors. In the higher param¬ 
eter region where Li + @C 60/ LiH@C 60/ and 
LiF@C 60 lie, the effects of the guest-cage interac¬ 
tion anisotropy becomes more relevant; a potential 
well appears around the C 60 hexagon centers deep 
enough to support a quantum state, which causes 
the appearance of a single quasi-band-gap struc¬ 
ture at the bottom of the energy scale, whose states 
show large amplitudes at the hexagon wells. This 
band should therefore be associated with libra- 
tional motion within these wells. Above the energy 
gap we find the threshold of hindered rotation and 
more delocalized states. Embedded in this region 
we have found a quasi-resonance band with states 
showing large amplitudes around the pentagon 
centers, which are, in general, regions of unstable 
equilibrium. By changing the relative magnitude 
of the model parameters one can stabilize this 
regions and convert the quasi-resonance-band into 
a quasi-band separated by energy gaps from the 
rest of the states. We have seen that the low 
temperature rotational spectra of the system in this 
region can be mostly assigned in terms of transi¬ 
tions within the hexagon band and out of this 
band into the pentagon quasi-resonance-band. 

All this analysis has been carried out for a 
model that assumes that the cage rotation is frozen. 
However, we have seen that this rotation does not 
change the features in the rotational spectra which 
are related to the anisotropy of the guest-cage 
interaction, on the contrary, it sharpens the transi¬ 
tion between the low and high anisotropy features. 
Actually one could realize rotating and nonrotat¬ 
ing cage cases using endohedrally doped fullerites. 
First, take into account the fact that fullerites pre¬ 
sent a transition (T ~ 250 K) between a phase in 
which the rotation of the individual fullerene 
molecules is hindered and a phase in which this 
rotation is isotropically free, and that below T ~ 90 
K all these motions are frozen [29]. If the density of 
doping is low enough, one can disregard the 
dipole-dipole interaction appearing between 
nonempty different cages; then the spherical 
anisotropy of the crystal field acting on a particu¬ 
lar guest is going to be practically determined by 
its cage, since the distance between cages is rela¬ 
tively large (— 10 A). So, if we lower the tempera¬ 
ture sufficiently (T < 90 K), we can realize our 
low-energy two-parameter model. Above the criti¬ 
cal temperature for free cage rotation (T ~ 250 K) 


we should observe the effects of this rotation on 
the rotational-vibrational spectra, but the tempera¬ 
tures required would make necessary the use of a 
model which treats higher frequency motions not 
included in our simple model. The best low-tem¬ 
perature experimental realization of the rotating 
case could be done, of course, using gas-phase 
techniques. 

Low-temperature spectra become, therefore, fin¬ 
gerprints of a particular guest-cage interaction, 
and thus a primary experimental source of infor¬ 
mation on this interaction. This interaction thus 
determines the low-energy rotational dynamics of 
the confined species, on which many of the proper¬ 
ties of endohedral fullerites may depend, e.g., the 
critical temperatures (25-60 K) for the dipole- 
ordered phases predicted by Cioslowski and Na¬ 
nay akkara [6]. 

The fact that the guest-cage interaction in com¬ 
plexes studied in our work has a dominantly phys¬ 
ical nature allowed us to use semiempirical ex¬ 
pressions for it. Strong confinement and symmetry 
constraints reduce enormously the effects of possi¬ 
ble inaccuracies and thus increase the reliability of 
the calculations. We believe that these results could 
be valid as well for interaction of more chemical 
nature, i.e., these same strong constraints may re¬ 
duce, also in this case, the number of relevant 
parameters to just the same two ones which come 
from the L = 6, 10 terms of the expansion of the 
potential in spherical harmonics. In this way on 
could extend our analysis to off-center complexes 
such O@C 60 . In this case, an epoxy-type (C—O 
—C) bond has been suggested under the short C 
—C fullerene bond [20]. A situation like this can 
be produced with our model by choosing a nega¬ 
tive C* 0 coefficient. Therefore, chemical interaction 
complexes would explore other regions of our 
two-parameter space. However, the higher libra- 
tional frequencies expected for chemical bonding 
may make necessary the joint treatment of these 
modes with other vibrational modes which were 
excluded in our model. 
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ABSTRACT: We extended the dynamic response theory in the Moller-Plesset 
perturbation theory (MPPT) based on the quasi-energy derivative (QED) method for 
closed-shell systems to that for open-shell systems. In this study we perform the 
calculations of frequency-dependent polarizabilities a ( — or, w) for nondegenerate open- 
shell doublet systems (Li, Na, and K atoms and BeH, MgH, CaH, CN, and NH 2 
molecules) in the second-order Moller-Plesset perturbation theory (MP2) starting with 
time-dependent restricted open-shell Hartree-Fock (TDROHF) approximation. © 1997 
John Wiley & Sons, Inc. Int J Quant Chem 65: 665-677, 1997 


Introduction 

W ith the rapid development of laser spec¬ 
troscopy in experiment, the dynamic re¬ 
sponse properties are great topics in quantum 
chemistry now. For closed-shell systems, there are 
results of calculations of frequency-dependent po¬ 
larizabilities and hyperpolarizabilities at the not 

Correspondence to: T. Kobayashi. 


only time-dependent Hartree-Fock (TDHF) level 
[1-3] but also correlated level [4-13] of theory. 
Sasagane, Aiga, and Itoh have proposed the 
quasi-energy derivative (QED) method, which is a 
more general method to define the dynamic re¬ 
sponse property [14]. In the QED method, the 
frequency-dependent polarizability is defined as 


<x A g( co; <d) 


d 2 W(t) 

de A (-(o) de B ((*)) f 


( 1 ) 
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where W(t) is quasi-energy and e B (a>) is the indi¬ 
rection electric field strength of the e~ lojt Fourier 
component. Aiga, Sasagane, and Itoh [15] have 
formulated the frequency-dependent polarizabili¬ 
ties and hyperpolarizabilities for closed-shell sys¬ 
tems at the second-order Moller-Plesset perturba¬ 
tion theory (MP2) level based on the QED method 
and Aiga et al. [16] have calculated them with 
consideration of the existence of poles proper to 
many-body perturbation theory (MBPT) pointed 
out by Hattig and He8 [17]. 

On the other hand with respect to dynamic 
response properties for open-shell systems, there 
have been only a few reports, in contrast with the 
case of closed-shell systems. Hettema and Wormer 
[18] have calculated the frequency-dependent po¬ 
larizabilities and determined van der Waals coeffi¬ 
cients from frequency-dependent polarizabilities 
with imaginary frequency by the time-dependent 
restricted open-shell Hartree-Fock (TDROHF) 
method. Kama [19] has formulated a time-depen¬ 
dent spin-unrestricted Flartree-Fock (TDUHF) the¬ 
ory and has performed the calculations of dynamic 
nonlinear response properties (up to third order 
[ (o u co 2 )]) by the TDUF1F method. Pipin 

and Bishop [20] have calculated the dynamic po¬ 
larizability of Li atoms by the sum-over-states 
(SOS) method based on the combined config¬ 
uration interaction (Cl) Hylleraas method. 
Very recently, Jaszunski and Rizzo [21] have 
presented the dynamic polarizabilities a( —a r, co) 
and second hyperpolarizabilities for the dc-Kerr 
effect yi-co'sco, 0,0) of Li atoms by the time- 
dependent multiconfiguration self-consistent field 
(TDMCSCF, which has usually been referred to as 
MCTDHF) method. However, the frequency-de¬ 
pendent polarizabilities for open-shell systems at 
the correlated level of theory have not been re¬ 
ported yet with the exception of Li atoms. The 
calculations of polarizabilities for open-shell sys¬ 
tems, except for Li atoms at the correlated level of 
theory, have been performed only in a static case 
by numerical energy derivative (finite-field) 
method [22-25]. Unfortunately, the dynamic re¬ 
sponse properties can be calculated only by analyt¬ 
ical approach. However, it is also very important 
to estimate both the frequency dependency and 
electron correlation of polarizabilities for open-shell 
systems at the same time. We extended the dy¬ 
namic response theory in the Moller-Plesset per¬ 
turbation theory (MPPT) based on the QED method 
for closed-shell systems to that of open-shell sys¬ 
tems [26]. 


In this work we perform the calculations of 
frequency-dependent polarizabilities for nonde¬ 
generate open-shell doublet systems (Li, Na, and K 
atoms and BeH, MgH, CaH, CN, and NH 2 
molecules) at the TDROHF level and MP2 starting < 
with TDROHF wave function (TDROHF 4- MP2) 
level based on the QED method and estimate the 
electron correlation contribution by MP2. There are 
some kinds of definitions of orbital energy in 
ROHF-based perturbation theory with respect to 
electron correlation. We employed the definition in 
open-shell perturbation theory 1 (OPT1), which 
has been proposed by Murray and Davidson [27, 

28], as the diagonal blocks of Fock matrix (orbital 
energy matrix). These are the first calculations of 
frequency-dependent polarizabilities for open-shell 
systems except for Li atoms at the correlated level 
of theory. 


Theory 

In this section the dynamic response theory for 
open-shell systems at the MP2 level of theory 
based on the QED method [26] is briefly showed. 

For convenience, we consider the high-spin 
open-shell systems. We use indices {/,/}, {p,q}, 
{a, b}, and {r, s) to denote dosed (doubly occupied), 
open (singly occupied), virtual, and arbitrary or¬ 
bitals, respectively, and abbreviate "closed," 
"open," and "virtual" to "C," "O," and "V," 
respectively. The TDROHF wave function is ap¬ 
proximated as a single determinant: 

1 ^* TDROHF(T)) = \4 > i4 > i4 > j4 > j *’’ *” I 

v-_— v _ 

closed open 

= expU(f)]|ROHF>, (2) 

where 


*(0 = Yj( K rs(t^Ers ~ K*(f)E/ s ) O) 


rs 


and 

E rs = £ a\ a a srT , rs e {(VC), (OC), (VO)}. (4) 

a 

4> r is the TDROHF orbital, E rs is the generators of 
unitary group, a ] nr , a S(T are creation and annihila¬ 
tion operators with the spin index a. Here we 
introduce the following notation: 

Ea - E rs' K x (t) = K rs (t) (5) 
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and 



defined over the open orbitals; h is the one-elec¬ 
tron part of the Hamiltonian. f OPT1 can be ex- 
( 6 ) pressed by matrix representation: 


Then kU ) can be rewritten as 

k(0 = Y k a (OE a . (7) 

i A 

In the following notation, we regard |HF) as 
|ROHF). Applying the time-dependent variational 
principle (TDVP) [29, 30] to I^tdrohf^) i R condi¬ 
tion of the orthonormality of TDROHF orbitals, we 
obtain TDROHF equation in the first-quantized 
form in case that the definition in open-shell per¬ 
turbation theory 1 (OPT 1 ) proposed by Murray 
and Davidson [27, 28] is used as the diagonal 
blocks of Fock matrix (orbital energy matrix). 

(fOp t1 -,AW>= £ \<f> s )9 sr , 

' <yt > s e (C, 0,V) 

r e (C, O, V ), ( 8 ) 

where 


g 7 & T 

f opti = ^ ( (F c )cc (F co )co (F c )cv ' 

(F co )oc (F c )oo (F°)ov 

T \ (F c )vc (F°)vo (F c )vv j 

8 rs can be written in the second quantized expres¬ 
sion as 


(cc) e ti = u, 




(OO) 


= -E< hf| 





— K 


d \ 11 

H — i— \e K IHF), (14) 

dt ) JJ + 




F OPT1 = E[(l - + P,F‘a -p)+ p,F c Pl ] 

I 

+ E P,F C0 P, + (1 - p)F c (l - p). (9) 

hi 

1*1 

In Eq. (9), p c , p Q , and p are defined as 
Pc= E I<£;><</>,I, Po = E 

feC peO 

P = D l<£ r >(<A r l = I^Pj (10) 

reC,0 7 

and /(/) summation runs over all occupied (closed 
and open) space and F c , F° and F co are given by 

K° 

F c = h + 2 } c -K C F}°~ (11) 

F° = F C -, (12) 

2 

K° 

F co — F c -i ——, (13) 

where } c and K c are the Coulomb and exchange 
operators defined over the closed orbitals and J° 
and K° are the Coulomb and exchange operators 


= 1e<HF| a\ a , a p „, 

H-iAe K |HF>, (15) 

dt J JJ + 

(W) e ab = (d> tt F c - i-h A 

= 1e<hfi 

d \ 11 

H-i— ]e K |HF), (16) 

) JJ + 

(00 6 it = U F c - i-h A 

= 1<HF| iy\e K ,E ai |HF> 

= 0, (17) 

(017) 0 pa = L F° - iy A 

= <HF| e~4H-iy\e K ,E ap |HF) 

= 0 , (18) 
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(co) e ip = {<f> i 


/ 

d 

Ui 

F co - i — 

dt 


= (HF| 
= 0 . 


<t>, 


e~ K H - i— e K ,E ni 


dt 


■pi 


|HF> 

(19) 


energy W(0 with respect to V 


|S>> = El®/> = li> 

;=o 


TDROHF 


>+ E |<h,.>, (25) 

; = i 


w(t) = E w,. = w 

i = 0 


TDROHF 


+ E W f/ (26) 


i — 2 


Equations (17), (18), and (19) are so-called Brillouin 
conditions. As the orbital mixing parameters in the 
same orbital blocks have no effects on variational 
principles, we can neglect them. Though we can 
get canonical TDROHF orbitals by diagonalizing 
the TDROHF orbital energy matrix, we use non- 
canonical TDROHF orbitals because of avoiding 
the possibility of singularity in the orbital mixing 
parameters in the same orbital blocks [15]. The 
zeroth-order Hamiltonian in OPT1 starting with 
TDROHF wave function becomes 

H 0 °PT1 = EF OPT1 (i) -i— L . (20) 

i dt 

The time-dependent Schrodinger equation in 
which an overall time-dependent phase factor is 
isolated is given by 

Ih - i-^)m = ( 2 D 

where W(t) is quasi-energy. The total Hamiltonian 
H is assumed to be written as 

H = H 0 + H U (22) 

where H 0 is nonrelativistic Born-Oppenheimer 
electron Hamiltonian of the isolated system and 
time-dependent perturbed Hamiltonian H l is 

given as 

n 

Hj = - E E e„( (Oj) M a exp( — f w, f)• (23) 

i = - n a = x, y, z 

M a is an a-component of electric dipole moment 
operator and e n (w y ) is the electric field strength of 
e~ l0)it Fourier component. We assume a periodicity 
in H 1 as is known in many-mode Floquet theory 
[31]. 

In the MPPT starting with TDROHF wave func¬ 
tion, Hamiltonian H is partitioned into 

H = F + V, (24) 

where F = E / F OPT 1 (0. In Eq. (21), we introduce 
the perturbation expansions of |T>) and quasi- 


which result in the nth-order equations 

W 0 )|4>„> + VIV,) 

n -1 

= El ( 27 > 

/=o 

with the intermediate normalization condition 
(O 0 |<E>) = 1. Writing as \& = 

exp(/c)| K)t K and multiplying Eq. (27) with 
n — 1 from the left by (L|exp(-K), we obtain 
the equation for determining the amplitude t K 
of the first-order MP wave function: 

XXHk - it L + (L\e~ K Ve K \HF) = 0, (28) 

K 


where 


G lk = (L\G\K) 



O rs E rs -W 0 


K), (29) 


| K ) and (L\ are all doubly excited configuration 
state functions (CSFs) from |ROHF> and <ROHF|, 
respectively. The existence of ii L in Eq. (28), which 
cause the poles proper to the MBPT, have been 
pointed out by Hattig and Hefi [17]. Multiplying 
Eq. (27) with n = 2 from the left by (<£> 0 | = 
(HF|exp(- k), we obtain the MP2 quasi-energy 
correction 


w 2 = E<HFk- K Ve K |K>V (30) 

K 


We introduce and ~t L , which are Lagrange mul¬ 
tipliers for equations for the TDROHF orbital 
rotation parameters k a and for the amplitudes 
t K , respectively. Then the MP2 quasi-energy La- 
grangian L is 

£■ = W TDROHF + E<HFk-«Ve*|K>t K 

K 


+ EM EGlkM - ih + ( L\e~ K Ve K \HF) 

L ^ K 


+ E &<hfi 

+ A 


E„e-*\H-i— \e« 

at 


|HF>. 


(31) 
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The permanent electric dipole moment fi A and 
frequency-dependent polarizability a AB ( — co; w) at 
the ROHF + MP2 level of theory based on the 
quasi-energy derivative method are defined as 


dUt ) 


a 


AB 


AA 


(~(o; m) = - 


deAO) 


d z L(t) 


de A (-w) deJa>) 


(32) 

(33) 


GlxM) 

= \ E E<HF| \a\ a ,u ra , A]] + |HF><L|EJK> 

A rs a 

- E E<HF|[flL,[ fli> , A]] + |HF><L|fO 

(EC (7 

E E<HF|[a; ff/ [ V ,A]] + lHF><L|JO. 

^ peO cr 

(38) 


and explicitly given as 

*A = (hfimjhf) + E^ 0) g lk {m a }4°> 

L, K 

+ E 4 (0) <HF|[ £ a , Mj|HF>, (34) 

i A 

a AB (- w; a>) 

= <HF|[M a ,/c b (o.)]|HF> 

- E<HF|[P< 0 >; k a (-«), K B («)]|fO4 0) 

K 

~ E“f[ 0) <U v (0) ; /c A (-co), k*U)]|HF> 

L 


- E([ 0 ) g lk {/ ab ( 

- 0 ), to)} if 0 

L,K 


+ YAk ( -&, )(G£j: 

- &>)f|(«) 

k 


+ E^|( w )(Goc + 


K 


-E£a ( 0 ) (hf|[e a/ 

i A 

} AB ( — u>, u> 


(35) 

where is an A component of electric dipole 
moment operator, and we have used the symmet¬ 
ric commutator 

[ A;B,C] = \([[A,B],C] + [[ A,C], B]) (36) 

and have defined the following notations: 

J AB (-a), a>) = - [ M a , k b ( (d)] - [M b , k a (~o))] 

+ [ H 0 ; k a (-(o), k b (o ))] 

+ a)[ k A (-~ (d), k b ( a))] (37) 


In Eqs. (34) and (35), and £ A (0) are the 

zeroth-order of t K , ~t L , and ^ A ; and k a (co) / t A (<w), 
and ~t A (co) are first-order of k, t K , and t L in the 
perturbation expansion with respect to electric 
field, respectively. These amplitudes and multipli¬ 
ers are determined by the time-averaged varia¬ 
tional condition for Lagrangian 8{L} T = 0. V (0) is 
the zeroth-order fluctuation potential. 


Computational Details 

The matrix elements in the second quantization 
language appearing in Eqs. (31) and (32) are calcu¬ 
lated automatically by using the program [32] in 
the algebraic programming language REDUCE3. 
The program for calculation of frequency-depen- 
dent polarizability for open-shell system at the 
TDROHF + MP2 level of theory is connected to 
Sasagane's TDROHF program. 

BASIS SETS 

The primitive Gaussian basis sets for Li, Be, and 
H on MgH and BeH are derived from Sasagane's 
smooth-tempered basis sets (14 s), (10 s), and (6 s), 
respectively. A series of exponents in smooth-tem¬ 
pered basis set is given by ^ (N) (a, /3, y, 8) = 
ap k ~ l y 8N \ (k = 1,2,..., N). We used the opti¬ 
mized parameter sets (a, /3, y, 8) = (0.023455420, 
2.3106846,17.03123, 0.6247924) for Li, (0.053619160, 
2.7557931, 6.0954213, 0.50286011) for Be, and 
(0.079899519, 2.6231810, 8.3605059, 0.56192361) for 
H on MgH and BeH. The diffuse/polarization 
functions (5sl4p/12d), (3s9 p/7d), and (3s/7p) are 
added to the primitive basis sets for Li, Be, and H 
on MgH and BeH, respectively, and contracted to 
[14sl2pl0d] for Li, [9s7p5d] for Be, and [6s5p] for 
H on MgH and BeH. The basis sets for Li, Be, and 
H on MgH and BeH are given in Table I. The 
primitive basis sets for Na, Mg, and K are derived 
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TABLE I 


Smooth-tempered basis sets for Li, Be, and H. 



Li [14s12p1 Oaf] 


Be [9s7p5cf] 


H [6s5p] 


Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

s 

21385.428 

0.00015000019 

s 

2996.0097 

0.00087628616 

s 

82.969412 

0.0025894488 


3194.5111 

0.0011938157 


442.62907 

0.0068262223 


12.476175 

0.019972614 


711.25708 

0.0063812155 


102.22305 

0.033436111 


2.8198849 

0.098435389 


203.21120 

0.025844243 


29.554163 

0.12406540 


0.80137208 

0.33188081 


67.847365 

0.088942149 


9.5663893 

0.34213598 


0.25901324 

0.64992411 


24.968615 

0.26721810 


3.2775362 

0.60474561 

s 

82.969412 

0.0018385256 


9.7649438 

0.69035061 

s 

2996.0097 

0.00069412747 


12.476175 

0.014297031 

s 

21385.428 

0.00014407673 


442.62907 

0.0054235614 


2.8198849 

0.072945077 


3194.5111 

0.0011474109 


102.22305 

0.026767969 


0.80137208 

0.27386902 


711.25708 

0.0061324481 


29.554163 

0.10217944 


0.25901324 

0.72303267 


203.21120 

0.024909655 


9.5663893 

0.30120542 

p 

2.8198849 

0.0054112577 


67.847365 

0.086074721 


3.2775362 

0.66381458 


0.80137208 

0.037682410 


24.968615 

0.26250884 

s 

1.1554537 

1.000000 


0.25901324 

0.20870216 


9.7649438 

0.69744586 

p 

9.5663893 

0.069383602 


0.089993840 

0.82404431 

s 

3.9668719 

1.0000000 


3.2775362 

0.21148033 

s 

0.089993840 

1.000000 

P 

24.968615 

0.065800904 


1.1554537 

0.80594128 

p 

2.8198849 

0.0049857914 


9.7649438 

0.19389662 

sp 

0.41323385 

1.000000 


0.80137208 

0.032937307 


3.9668719 

0.80737986 

sp 

0.14885938 

1.000000 


0.25901324 

0.20139597 

sp 

1.6502041 

1.0000000 

d 

1.1554537 

0.024542010 


0.089993840 

0.83219299 

sp 

0.69673783 

1.0000000 


0.41323385 

0.17376192 

sp 

0.032564898 

1.000000 

d 

3.9668719 

0.057990397 


0.14885938 

0.87145372 

sp 

0.012055978 

1.000000 


1.6502041 

0.12960203 

spd 

0.053818796 

1.000000 

sp 

0.0045209224 

1.000000 


0.69673783 

0.87849098 

spd 

0.019493284 

1.000000 




spd 

0.29691094 

1.0000000 

spd 

0.0070669971 

1.000000 




spd 

0.12726174 

1.0000000 

spd 

0.0025632172 

1.000000 




spd 

0.054744532 

1.0000000 







spd 

0.023602899 

1.0000000 







spd 

0.010190683 

1.0000000 







spd 

0.0044037687 

1.0000000 







spd 

0.0019040797 

1.0000000 







spd 

0.00082356019 

1.0000000 







spd 

0.00035628623 

1.0000000 








from Partridge's uncontracted basis sets [33]. The 
Partridge's primitive basis sets (18sl0p) for Na 
and Mg are contracted to [13s7p] by contracting 
(6 s) with the six largest exponents in s functions 
to [Is] by using LCAO coefficients of Is orbital in 
ROHF orbitals as the contraction coefficients and 
contracting (4 p) with the four largest exponents in 
p functions to [Ip] by using LCAO coefficients of 
2 p orbital in ROHF orbitals as the contraction 
coefficients. The contracted basis sets for Na 
and Mg [13s7p] are added by diffuse functions 
(2 s5p) with exponents obtained from the smallest 
ones divided by 2.4. Then the polarization funct¬ 
ions (10 d) are added for Na with exponents 
(13.51565, 5.966856, 2.700045, 1.218512, 0.542187, 
0.227413, 0.0948, 0.0395, 0.0165, 0.00685). For Mg, 


the polarization functions ( 5d ) are added with 
exponents (7.753275, 1.614324, 0.302946, 0.0526, 
0.00913). The Partridge's primitive basis set 
(20sl2p) for K are contracted to [13s7p] by con¬ 
tracting (10 s) with the 10 largest exponents in s 
functions to [3s] by using LCAO coefficients of Is, 
2 s, and 3 s orbitals in ROHF orbitals as the contrac¬ 
tion coefficients and contracting (7p) with the seven 
largest exponents in p functions to [2 p] by using 
LCAO coefficients of 2 p and 3 p orbitals in ROHF 
orbitals as the contraction coefficients. The con¬ 
tracted basis set for K [13s7p] is added by diffuse 
functions (2s4p) with exponents obtained from the 
smallest ones by divided by 2.5. The polarization 
functions ( 5d ) are added for K with exponents 
(4.086109, 0.845487, 0.15171, 0.0243, 0.00388). Con- 
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sequently, the basis sets for Na, Mg, and K be¬ 
comes [15sl2pl0d], [15sl2p5d], and [15sllp5d], 
respectively. For H on CaH and NH 2 , we em¬ 
ployed Sadlej's polarized basis set [34] [3s2p] 
modified by Sasagane [16, 36]. For H on NH 2 , the 
diffuse /polarization functions {Is/Ip) which have 
sp exponent 0.010215 are added. For C and N, we 
used Sadlej's polarized basis sets [6s4p2d] modi¬ 
fied by Sasagane with diffuse/polarization func¬ 
tions (1 sip/Id) in which s exponents are 0.014653 
for C and 0.020168 for N and pd exponents are 
0.012198 for C and 0.017337 for N. The basis sets 
for C, N, and H on CaFI and NF1 2 are shown in 
Table II. For Ca, Sadlej's original polarized basis 
set [9s7p2d] [35] are used. 

GEOMETRIES 

The calculations of dipole moments and dy¬ 
namic polarizabilities of BeH, MgH, CaFI, CN, and 
NF1 2 are performed at the experimental equilib¬ 
rium geometries [37, 38]. For BeH, MgH, CaH, and 


CN, R BeH = 1.3431 A, R Mg H - 1.7306 A, R CaH = 

2.002 A, and R CN = 1.1718 A. For NH 2 , the z axis 
is C 2v axis, R NH = 1.024 A, and 0 NHN = 103.36°. 


Results 

In Table III, the experimental [39, 40] and other 
theoretical [20, 21, 22-25] values of static polariz¬ 
abilities of Li, Na, and K atoms are given with our 
work. To our knowledge, no experimental values 
of dynamic polarizabilities of Li, Na, and K atoms 
have been reported. In Tables IV, V, and VI, the 
results of calculations of static and dynamic polar¬ 
izabilities for Li, Na, and K atoms at the TDROHF 
and TDROHF -I- MP2 levels of theory are pre¬ 
sented, respectively. The electron correlation con¬ 
tribution to the polarizabilities estimated by the 
second-order Moller-Plesset perturbation theory 
(MP2 contribution) is also shown. In Table IV, 
frequency-dependent polarizabilities for Li atom in 


TABLE II 


Modified Sadlej’s polarized basis sets for C, N, and H. 



C [7s5p3d] 


N [7s5p3d] 


H [3s2p] a 


Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

s 

5240.6353 

0.02235882 

s 

8104.0716 

0.02229515 

s 

33.865014 

0.008761077 


782.20480 

0.1724634 


1216.0215 

0.1715770 


5.094788 

0.06542281 


178.35083 

0.8672313 


277.23428 

0.8679323 


1.158786 

0.2928504 

s 

50.815942 

0.3030337 

s 

78.904023 

0.2899865 


0.325840 

0.7272122 


16.823562 

0.7400231 


25.874419 

0.7525437 

s 

0.102741 

1.000000 

$ 

6.175776 

0.6940676 

s 

9.346767 

0.6700486 

s 

0.032395 

1.000000 


2.418049 

0.3400776 


3.579794 

0.3669701 

p 

1.158786 

0.1864409 

s 

0.511900 

1.000000 

s 

0.739610 

1.000000 


0.325840 

0.8730840 

s 

0.156590 

1.000000 

s 

0.222617 

1.000000 

p 

0.102741 

0.9764826 

s 

0.047901 

1.000000 

s 

0.067006 

1.000000 


0.032395 

0.03443044 

s 

0.014653 

1.000000 

s 

0.020168 

1.000000 




p 

18.841800 

0.03923992 

p 

26.868987 

0.03948419 





4.159240 

0.2437874 


5.991227 

0.2486013 





1.206710 

0.8158682 


1.750842 

0.8111297 




p 

0.385540 

1.000000 

p 

0.560511 

1.000000 




p 

0.121940 

1.000000 

p 

0.175948 

1.000000 




p 

0.038568 

1.000000 

p 

0.055231 

1.000000 




p 

0.012198 

1.000000 

p 

0.017337 

1.000000 




d 

1.206710 

0.2679750 

d 

1.750842 

0.2770034 





0.385540 

0.8199855 


0.560511 

0.8128187 




d 

0.121940 

0.6277424 

d 

0.175948 

0.5974958 





0.038568 

0.4964327 


0.055231 

0.5295971 




d 

0.012198 

1.000000 

d 

0.017337 

1.000000 




a For H on NH 2 , the diffuse/polarization functions (Is/Ip) which have sp exponent 0.010215 are added. 
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TABLE III___ : _ 

Experimental and theoretical values of static polarizabilities of Li, Na, and K atoms (in a.u.). 



Li 

Na 

K 

ROHF (this work) 

170.13 

190.55 

416.27 

ROHF + MP2 (this work) 

165.01 

165.88 

285.23 

PNO-CEPA 

164.5 [22] 

165.02 [23] 

287.6 [23] 

Cl Hylleraas [20] 

164.1 



M.CSCF [21] 

164.91 



UMP4 [24] 

164.55 



UHF-CCSD(T) [25] 

164.187 



Experimental values 

164+3 [39] 

159.2 ± 3.4 [39] 

292.8 + 6.1 [39] 


164.6 + 11.5 [40] 

305.0 + 21.6 [40] 


TABLE IV_ _ 

Frequency-dependent polarizabilities a(-o>;co) of Li atoms (in a.u.). 


Frequency co 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

TDROHF 
a(— co] co) 

TDROHF + MP2 

170.13 

171.07 

173.95 

178.97 

186.52 

197.22 

212.11 

232.92 

a( — a>; co) 

Cl Hylleraas [20] 

165.01 

165.90 

168.63 

173.38 

180.52 

190.61 

204.61 

224.09 

a( — co; co) 

TDMCSCF [21] 

164.1 

165.0 

167.7 

172.4 

179.5 

189.6 

203.4 

222.8 

a( — co; co) 

164.91 

165.80 

168.53 

173.29 





MP2 contribution 3 
a( — co; co) 

-5.121 

-5.169 

-5.319 

-5.585 

-5.996 

-6.603 

-7.494 

-8.831 


-3.10% 

-3.12% 

-3.15% 

-3.22% 

-3.32% 

-3.46% 

-3.66% 

-3.94% 


a MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF 4- MP2 value. 


TABLE V_ 

Frequency-dependent polarizabilities a(-w; o>) of Na atoms (in a.u.). 


Frequency co 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

TDROHF 

a( — co; co) 

190.55 

191.48 

194.32 

199.25 

206.58 

216.86 

230.90 

250.05 

TDROHF + MP2 

a( — co; co) 

165.88 

166.57 

168.66 

172.27 

177.58 

184.88 

194.64 

207.51 

MP2 contribution 3 

a( — co; co) 

-24.67 

-24.91 

-25.65 

-26.98 

-29.01 

-31.97 

-36.26 

-42.54 


-14.87% 

-14.95% 

-15.21% 

-15.66% 

-16.33% 

-17.29% 

-18.63% 

-20.50% 


a MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 
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TABLE VI_ 

Frequency-dependent polarizabilities a(- co; to) of K atoms (in a.u.). 


Frequency co 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

TDROHF 

a( — co] co) 

416.27 

420.50 

433.71 

457.71 

496.22 

556.55 

TDROHF + MP2 

a( — co", co) 

285.23 

286.71 

291.18 

298.59 

308.58 

319.49 

MP2 contribution 3 
a( — a>; co) 

-131.05 

-133.79 

-142.53 

-159.12 

-187.64 

-237.06 


-45.94% 

-46.66% 

-48.95% 

-53.29% 

-60.81% 

-74.20% 


a MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 


Cl Hylleraas calculation by Pipin and Bishop [20] 
and those at the TDMCSCF level by Jaszunski and 
Rizzo [21] are shown with our work. Jaszunski and 
Rizzo used [13sllp9d5f] basis set and employed 
5s4p2d as the orbitals in the active space to obtain 
the results of dynamic polarizabilities for Li atom 
shown in Table IV. Compared with the experimen¬ 
tal and other theoretical values, the static polariz¬ 
abilities of Li, Na, and K atoms at the ROHF + 
MP2 level give good corrections to those at the 
ROHF level. Moreover the frequency-dependent 
polarizabilities of Li atom at the TDROHF + MP2 
level of theory are slightly higher than those in Cl 
Hylleraas calculations by Pipin and Bishop and 
TDMCSCF calculations by Jaszunski and Rizzo but 
in good agreement with them. The calculations of 
frequency-dependent polarizabilities for Li atom at 
the TDROHF + MP2 level of theory are expected 
to give reliable results. On the other hand, no other 
theoretical values of dynamic polarizabilities of Na 
and K atoms have been reported. However, as the 
static limits of polarizabilities for Na and K atoms 
at the TDROHF + MP2 level of theory have very 
close values to the experimental values, the dy¬ 
namic polarizabilities of Na and K atoms are also 
likely to give good results. In Tables VII, VIII, IX, 
X and XI, the results of calculations of dipole 
moments and static and dynamic polarizabilities 
for BeH, MgH, CaH, CN, and NH 2 at the ROHF 
and ROHF + MP2 levels of theory are presented 
with MP2 contribution, respectively. To our 
knowledge, no experimental and other theoretical 
values of static and dynamic polarizabilities for 
BeH, MgH, CaH, CN, and NH 2 have also been 
reported. So we cannot compare the calculations of 


frequency-dependent polarizabilities of those 
open-shell molecules at the TDROHF + MP2 level 
of theory with other results. 

The percentage of MP2 contribution to dynamic 
polarizabilities a( - co; a>) of Li, Na, and K atoms 
and average dynamic polarizabilities a( — co ; co) of 
BeH, MgH, CaH, and CN becomes greater as the 
frequency increases. The order of percentage of 
MP2 contribution to dynamic polarizabilities is 
Li < Na < K and BeH < MgH < CaH. The per¬ 
centage of MP2 contribution to a( — co; co) of NH 2 
shows little change over the frequency range co = 
0-0.045 a.u., where the percentage is 8.55-8.57%. 


Conclusion and Discussion 

We presented the first calculations of 
frequency-dependent polarizabilities for open-shell 
systems except for Li atoms at the correlated level 
of theory. Compared with experimental [39, 40] 
and other calculations [20, 21, 22-25], the static 
polarizabilities of Li, Na, and K atoms at the 
ROHF + MP2 level give good corrections to those 
at the ROHF level. The dynamic polarizabilities of 
Li atoms at the TDROHF + MP2 level of theory 
are in good agreement with the previous calcula¬ 
tions [20, 21]. As no experimental and other theo¬ 
retical values of static and dynamic polarizabilities 
for BeH, MgH, CaH, CN, and NH 2 have also been 
reported, we cannot tell whether the calculations 
of frequency-dependent polarizabilities of those 
open-shell molecules at the TDROHF + MP2 level 
of theory are reliable or not at this time. However, 
the treatment of frequency-dependent polarizabili¬ 
ties by TDROHF + MP2 method is expected to be 
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TABLE VII_ 

Dipole moments p and frequency-dependent polarizabilities a(-w; w) of BeH (in a.u.). 


Frequency co 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

0.040 

0.045 

ROHF 











fl z 

ROHF + MP2 

0.1096 










M'Z 

TDROHF 

0.1069 










a xx ( — oo) 

36.326 

36.378 

36.535 

36.802 

37.188 

37.703 

39.205 

39.205 

40.250 

41.551 

a zz {-co\ co) 

24.806 

24.815 

24.840 

24.883 

24.943 

25.021 

25.116 

25.231 

25.365 

25.518 

a( — co] coY 

TDROHF + MP2 

32.486 

32.523 

32.637 

32.829 

33.106 

33.476 

33.950 

34.547 

35.288 

36.207 

a xx ( — co) 

36.290 

36.345 

36.510 

36.791 

37.197 

37.742 

38.448 

39.344 

40.470 

41.886 

a zz {-oo\ co) 

25.578 

25.587 

25.614 

25.659 

25.723 

25.805 

25.907 

26.028 

26.170 

26.333 

a( — co] coY 

MP2 contribution 13 

32.719 

32.759 

32.878 

33.080 

33.372 

33.763 

34.268 

34.905 

35.703 

36.702 

a xx ( — co] co) 

-0.036 

-0.033 

-0.026 

-0.011 

0.009 

0.039 

0.081 

0.139 

0.220 

0.335 

-0.10% 

-0.09% 

-0.07% 

-0.03% 

0.02% 

0.10% 

0.21% 

0.35% 

0.54% 

0.80% 

a zz (~co] co) 

0.772 

0.772 

0.774 

0.776 

0.780 

0.785 

0.790 

0.797 

0.805 

0.815 

3.02% 

3.02% 

3.02% 

3.03% 

3.03% 

3.04% 

3.05% 

3.06% 

3.08% 

3.09% 

a( — co', co) a 

0.233 

0.235 

0.241 

0.251 

0.266 

0.288 

0.317 

0.359 

0.415 

0.495 


0.91% 

0.92% 

0.94% 

0.98% 

1.03% 

1.11% 

1.23% 

1.38% 

1.59% 

1.88% 


a — _ 1 .1 

a 3 a xx + 3 o l zz . 

b MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 


TABLE VIII_ 

Dipole moments p and frequency-dependent polarizabilities a(-w; co) of MgH (in a.u.). 


Frequency co 

0 0.005 0.010 0.015 0.020 0.025 0.030 0.035 0.040 0.045 


ROHF 

/x z 0.5862 

ROHF + MP2 


TDROHF 

0.5865 

a xx (- co\ o) 

69.222 

a zz {~co] co) 

53.780 

a( — co] coY 

TDROHF + MP2 

64.075 

a xx ( — co] co) 

65.762 

a zz (— co] co) 

56.452 

a( — co] co) a 

MP2 contribution 5 

62.658 

a xx ( — co] co) 

-3.460 

-5.26% 

a zz ( — co] co) 

2.672 

4.73% 

a( — co] coY 

-1.416 

-2.26% 


69.353 

69.749 

70.426 

53.817 

53.929 

54.117 

64.174 

64.476 

64.989 

65.880 

66.236 

66.845 

56.491 

56.609 

56.807 

62.750 

63.027 

63.499 

-3.473 

-3.513 

-3.580 

-5.27% 

-5.30% 

-5.36% 

2.674 

2.680 

2.690 

4.73% 

4.73% 

4.74% 

-1.424 

-1.449 

-1.490 

-2.27% 

-2.30% 

-2.35% 


71.406 

72.726 

74.441 

54.382 

54.728 

55.158 

65.731 

66.727 

68.013 

67.727 

68.917 

70.462 

57.087 

57.451 

57.904 

64.181 

65.095 

66.276 

-3.678 

-3.809 

-3.978 

-5.43% 

-5.53% 

-5.65% 

2.704 

2.723 

2.746 

4.74% 

4.74% 

4.74% 

-1.551 

-1.632 

-1.737 

-2.42% 

-2.51% 

-2.62% 


76.627 

79.395 

82.906 

55.676 

56.288 

57.002 

69.643 

71.693 

74.271 

72.435 

74.935 

78.114 

58.449 

59.094 

59.844 

67.773 

69.655 

72.024 

-4.192 

— 4.460 

-4.793 

-5.79% 

-5.95% 

-6.14% 

2.773 

2.806 

2.843 

4.74% 

4.75% 

4.75% 

-1.871 

-2.038 

-2.247 

-2.76% 

-2.93% 

-3.12% 


b MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 
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CALCULATION OF FREQUENCY-DEPENDENT POLARIZABILITIES 


TABLE IX_ 

Dipole moments p, and frequency-dependent polarizabilities a(-w; o>) of CaH (in a.u.). 


Frequency <a 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

ROHF 









ROHF + MP2 

0.8351 








Mz 

TDROHF 

1.0168 








OL xx (— (O] O)) 

176.63 

177.48 

180.10 

184.68 

191.57 

201.37 

215.10 

234.45 

OL zz (~ (O’, Cl)) 

102.00 

102.19 

102.78 

103.78 

105.23 

107.16 

109.65 

112.79 

a( — <x>; co) a 

TDROHF + MP2 

151.75 

152.39 

154.33 

157.71 

162.79 

169.97 

179.95 

193.89 

a xx ( — co] co) 

149.93 

150.48 

152.18 

155.12 

159.46 

165.47 

173.56 

184.33 

a zz {~(o; co) 

103.15 

103.34 

103.90 

104.86 

106.24 

108.09 

110.46 

113.44 

a( — (o\ co) a 

134.34 

134.77 

136.09 

138.37 

141.72 

146.34 

152.53 

160.70 

MP2 contribution 13 

a xx ( — (o\ co) 

-26.70 

-27.00 

-27.92 

-29.56 

-32.11 

-35.90 

-41.54 

-50.11 


-17.81% 

-17.94% 

-18.35% 

-19.06% 

-20.14% 

-21.70% 

-23.93% 

-27.19% 

a zz ( — co; co) 

1.16 

1.15 

1.12 

1.08 

1.02 

0.927 

0.810 

0.656 


1.12% 

1.11% 

1.08% 

1.03% 

0.96% 

0.86% 

0.73% 

0.58% 

a( — a>; w) a 

-17.42 

-17.62 

-18.24 

-19.35 

-21.07 

-23.63 

-27.42 

-33.19 


-12.97% 

-13.07% 

-13.40% 

-13.98% 

-14.87% 

-16.14% 

-17.98% 

-20.65% 


a— _ 2 ,1 

a. — 3 a xx ^ 3 a zz- 

b MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 


TABLE X_ 

Dipole moments (x and frequency-dependent polarizabilities a(- to; to) of CN (in a.u.). 


Frequency co 

0 0.005 0.010 0.015 0.020 0.025 0.030 


ROHF 


ROHF + MP2 

-0.9123 

TDROHF 

-0.5995 

a xx (— cu; co) 

17.424 

a zz (-(o\ co) 

20.008 

a( — co; co) a 

TDROHF + MP2 

18.285 

a xx ( —co; co) 

18.736 

a zz ( — co; co) 

24.039 

a(— co; co) a 

MP2 contribution 5 

20.504 

a xx ( — co; co) 

1.312 

7.00% 

a zz (-co; co) 

4.031 

16.77% 

a( — co; co) a 

2.218 

10.82% 


17.466 

17.599 

17.835 

20.010 

20.017 

20.029 

18.314 

18.405 

18.566 

18.840 

19.169 

19.777 

24.045 

24.064 

24.094 

20.575 

20.800 

21.216 

1.373 

1.570 

1.943 

7.29% 

8.19% 

9.82% 

4.035 

4.046 

4.065 

16.78% 

16.81% 

16.87% 

2.261 

2.396 

2.650 

10.99% 

11.52% 

12.49% 


18.203 

18.755 

19.591 

20.045 

20.067 

20.092 

18.817 

19.192 

19.758 

20.782 

22.415 

25.174 

24.138 

24.194 

24.263 

21.900 

23.008 

24.870 

2.579 

3.660 

5.583 

12.41% 

16.32% 

22.18% 

4.092 

4.127 

4.170 

16.95% 

17.06% 

17.19% 

3.083 

3.816 

5.112 

14.08% 

16.58% 

20.56% 


a— 2 , i 

Oi — 3 Ot xx + 3 0L zz . 

b MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 
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TABLE XI __ 

Dipole moments p and frequency-dependent polarizabilities a(-to; w) of NH 2 (in a.u.). 


Frequency co 



0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

0.040 

0.045 

ROHF 











ROHF + MP2 

-0.7391 










TDROHF 

-0.7052 










a xx ( — to) 

9.830 

9.834 

9.847 

9.870 

9.904 

9.949 

10.009 

10.086 

10.186 

10.317 

a yy (-o)] co) 

11.932 

11.933 

11.935 

11.940 

11.946 

11.955 

11.965 

11.977 

11.991 

12.007 

a zz {-co\ o)) 

11.736 

11.738 

11.741 

11.747 

11.756 

11.767 

11.780 

11.796 

11.814 

11.835 

a( — co] co) a 

TDROHF + MP2 

11.166 

11.168 

11.175 

11.186 

11.202 

11.224 

11.251 

11.286 

11.330 

11.386 

a xx ( — o)\ to) 

10.774 

10.779 

10.792 

10.815 

10.848 

10.894 

10.953 

11.031 

11.131 

11.263 

OLyy{- toj to) 

12.684 

12.685 

12.688 

12.693 

12.701 

12.711 

12.723 

12.737 

12.754 

12.772 

a 2Z (- to; to) 

13.180 

13.181 

13.186 

13.195 

13.207 

13.222 

13.240 

13.262 

13.288 

13.317 

a( — to; to) a 

MP2 contribution 13 

12.212 

12.215 

12.222 

12.234 

12.252 

12.275 

12.306 

12.343 

12.391 

12.451 

a xx (- a>; to) 

0.945 

0.945 

0.945 

0.945 

0.945 

0.944 

0.944 

0.945 

0.945 

0.946 

8.77% 

8.76% 

8.75% 

8.73% 

8.71% 

8.67% 

8.62% 

8.56% 

8.49% 

8.40% 

a yy (-~to; to) 

0.752 

0.752 

0.753 

0.754 

0.755 

0.756 

0.758 

0.760 

0.763 

0.765 

5.93% 

5.93% 

5.93% 

5.94% 

5.94% 

5.95% 

5.96% 

5.97% 

5.98% 

5.99% 

a zz (- to; to) 

1.443 

1.444 

1.445 

1.447 

1.451 

1.455 

1.460 

1.467 

1.474 

1.482 

10.95% 

10.95% 

10.96% 

10.97% 

10.99% 

11.01% 

11.03% 

11.06% 

11.09% 

11.13% 

a( — o>; to) a 

1.047 

1.047 

1.047 

1.049 

1.050 

1.052 

1.054 

1.057 

1.061 

1.065 


8.57% 

8.57% 

8.57% 

8.57% 

8.57% 

8.57% 

8.57% 

8.56% 

8.56% 

8.55% 


a g (tt XX + OLyy + OL ZZ ). 

b MP2 contribution is obtained after subtracting the TDROHF value from the TDROHF + MP2 value. The percentage is obtained by 
dividing the MP2 contribution with the TDROHF + MP2 value. 


valid for nondegenerate open-shell systems in 
which only dynamical pair correlation is signifi¬ 
cant, that is, TDROHF wave function is good as 
a reference state of Moller-Plesset perturbation 
theory. 

In this calculation, OPT1 definition is used as 
the orbital energy matrix. There are some kinds of 
open-shell perturbation theory with respect to 
electron correlation by spin-unrestricted or re¬ 
stricted and the difference of definition of orbital 
energy. It is necessary to estimate the effect to 
response property by such differences in open-shell 
perturbation theory. In this study, we treated 
open-shell doublet systems for the calculation of 
dynamic polarizabilities at the TDROHF + MP2 
level of theory. In the dynamic polarizability for 
open-shell system which has spin multiplicity more 
than triplet, the MBPT poles corresponding to the 
difference between (open + open) and (closed + 
closed) orbital energy (e + e — e i — ej) may ex¬ 
ist, and they may be smaller than the first excited 


energy by spin-restricted open-shell random-phase 
approximation (RPA) or TDROHF since open or¬ 
bital energy may be very close to closed orbital 
energy. In such a case, the first dispersion of dy¬ 
namic polarizability exists before the first excited 
energy by RPA. However, such a MBPT pole has 
no physical meaning. That difficulty may be 
avoided by utilizing the arbitrarity of orbital 
energy. 
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Electron Correlation Effects upon the 
Static (Hyper)polarizabilities 
of Push-Pull Conjugated 
Polyenes and Polyynes 
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ABSTRACT: Ab initio calculations, including the electron correlation effects via the 
Moller-Plesset partitioning, of the static longitudinal polarizability (a L ), first and second 
hyperpolarizabilities (/3 L and y L ) of small push-pull molecules have been carried out 
with the 6-31G and 6-31G* basis sets. Two types of conjugated segment (polyacetylene 
and polyyne) were used as linker between the nitro and amino fragments, chosen as 
acceptor and donor groups. For small chains, the triply bonded structures lead to larger 
y L than the doubly bonded structures, although the a L and /3 L values order in the 
opposite direction. The second-order Moller-Plesset (MP2) corrections are essential to 
obtain correct estimates of the hyperpolarizabilities and are generally much larger than 
the MP3 and MP4 contributions. Geometry optimizations were carried out with both 
basis sets within the Hartree-Fock and MP2 schemes; the MP2 geometries lead to smaller 
bond length alternations and thus larger (hyper)polarizabilities. For a L a careful choice of 
the geometry is more crucial than the use of polarization functions during the electronic 
state calculations; however, for the hyperpolarizabilities, both the geometry and the basis 
set are of importance. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 679-688, 1997 
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Introduction 

T he nonlinear optical (NLO) properties of 
push-pull systems have been widely investi¬ 
gated, both theoretically and experimentally in the 
recent years. These molecules are composed of a 
conjugated segment end-capped by donor and ac¬ 
ceptor groups. Numerous end groups and linkers 
were tested in order to obtain large NLO responses 
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(see the review article of Karris et al. [1]). As 
shown by Sim et al. for the typical case of p-nitro- 
aniline the electron correlation could lead to signi¬ 
ficant variations of the predicted NLO responses 
[2]. Indeed the inclusion of the second-order 
Moller-Plesset (MP2) correction gives, in compari¬ 
son to the Hartree-Fock level, variations of +12%, 
+ 88% and +100% for the polarizability, the first 
and second hyperpolarizability, respectively. How¬ 
ever, most of the ab initio calculations including 
the electron correlation effects have focused on 
small systems, and we refer the reader to the 
works of Shelton and Rice [3] and Bartlett and 
Sekino [4] for complete reviews of their 
(hyper)polarizability evaluations. 

To maximize the longitudinal polarizability 
(a L ), first hyperpolarizability (/3 L ), and second hy¬ 
perpolarizability (y L ), a large electronic delocaliza¬ 
tion along the backbone of the molecule is needed; 
so, increasing the length of the linker in push-pull 
molecules leads to larger a L , fi L , and y L values. In 
the case of fi L , the electronic asymmetry of the 
fragment is also of importance. Indeed, it was 
shown experimentally [5] and theoretically [6, 7] 
that /3 l of a symmetric conjugated segment end- 
capped by donor and acceptor groups presents a 
saturation to a constant when the length of the 
linker increases. This is explained by the fact that 
the push-pull strength becomes diluted in the con¬ 
jugated space for large chains. This saturation is 
even reproduced by the simple two-state picture, 
because the energy difference and the dipolar tran¬ 
sition moment between the highest occupied and 
lowest unoccupied molecular orbitals (HOMO and 
LUMO) as well as the difference of dipole moment 
between the ground and excited states become 
constant for very large chains [1]. Nevertheless, the 
saturation regime is only reached for extended 
chains; for example, in the case of a a, co~ 
nitro,amino-polyacetylene systems, 18 carbon 
atoms are necessary in order to reach the inflection 
point (where the increase of begins to slow 
down), whereas fi L is not yet constant for a chain 
containing 28 carbon atoms [7], So the formula, 
/3 l = kN 17 , which is generally used to fit the expo¬ 
nential grow of fi L with chain length (N), is only 
valid for small chain length and, in order to fit the 
global shape of the evolution of /3 L , one has to use 
more elaborated functions [7, 8]. To compensate 
this drawback of a smaller fi L /N (longitudinal first 
hyperpolarizability per unit cell) for large chains. 


one could think of an asymmetric linker which 
will also contribute to the enhancement of jS L . If 
the use of such a segment prevents the saturation 
of (3 L when chain length increases, it is however to 
the prejudice of a smaller electronic delocalization 
along the backbone (as shown, for small oligomers, 
by Tsunekawa et al. [9, 10]), and thus /3 L of 
medium chains is smaller than for the correspond¬ 
ing symmetric linker [7]. Another idea is to bring 
more tt electrons to the chain by using triply 
bonded structures. This has sometimes been pro¬ 
posed to optimize the NLO properties, although it 
has not produced experimental [11] or theoretical 
[1, 12, 13] larger fi L values than the corresponding 
doubly bonded structures with the exception of 
the smallest systems. 

Correlated semi-empirical calculations of the 
evolution with chain length of /3 L of push-pull 
systems have been performed recently; we refer 
the reader to Ref. [13] for a summary of these 
investigations. However, to our knowledge, no 
work has addressed the electron correlation effects 
on ab initio [3 L in a series of oligomers of increas¬ 
ing size. Only a few recent calculations have ad¬ 
dressed the electron correlation effects upon a L 
and y L in oligomeric series. First calculations on 
model hydrogen chains have been carried out 
[14-16] and have shown that, in the case of an 
infinite chain, the ratio between the MP4 and cou¬ 
pled Hartree-Fock (CHF) a L varies between .97 
and .90 with the 6-311G** basis set, while the 
corresponding y L ratio is 1.31. For polyacetylene, 
polyyne, polypyrrole the ratios between the MP2 
and CHF y L are 2.84, 8.0 and 2.2, respectively, with 
the 6-31G basis set [17, 18]. 

Our study is organized as followed. A brief 
presentation of the methods is given and followed 
by a detailed discussion of the results. In particu¬ 
lar, comparisons with the nonsubstituted chains 
are carried out. Finally, we summarize and con¬ 
clude. 


Methodology 

STATIC HYPERPOLARIZABILITIES 

The application of a static uniform electric field 
(E) on a system leads to a reorganization of its 
charge distribution. This reorganization is charac¬ 
terized by representing either the dipole moment 
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(|x) or the energy (e) as a power series in E: 

1 1 

|x = (ulq 4 - ol • E T- —- p* EE + —y : EEE + • • •, 

(1) 

1 1 

* = e o - Po * E - EE “ > EEE -/ 

( 2 ) 


where the subscript characterizes the order of the 
electron-electron interactions, and the superscript 
describes the type of substitutions used in the 
intermediate states; S stands for single substitu¬ 
tions, D for double substitutions, T for triple sub¬ 
stitutions. 6 4 q+r is the energy term coming from 
the disconnected quadruple substitutions and the 
associated renormalization. For P L (P L = a L , fi L , or 
y L ), similar equations may be written: 


where e 0 and jx 0 are the energy and the dipole 
moment of the molecule in the absence of an 
external electric field, respectively; a is the electric 
dipole polarizability tensor which represents the 
linear response of the dipole moment to the exter¬ 
nal electric field. P and y are the first and second 
hyperpolarizability tensors which correspond to 
the nonlinear phenomena. When the molecule is 
extended primarily in one direction, the longitudi¬ 
nal component of the (hyper)polarizabilities is the 
largest because the electrons are primarily delocal¬ 
ized along the backbone of the molecule. This is 
why we have restricted our investigation to this 
component. 

If the Hellmann-Feymann [19, 20] theorem is 
fulfilled, i.e., if the wave functions are optimized 
variationally with respect to all parameters, the 
quantities in Eqs. (1) and (2) (jx 0 , a, ...) are equiv¬ 
alent. This theorem is satisfied by Hartree-Fock 
(HF) wave functions, however it is not the case of 
the Moller-Plesset ones. Therefore, there exist two 
ways (leading to different numerical values) to 
compute the (hyper)polarizabilities; either via a 
differentiation of the field-dependent dipole mo¬ 
ment or via a differentiation of the field-dependent 
energy, the last one being chosen in the present 
work. 

MOLLER-PLESSET PARTITIONING 

Electron correlation corrections have been con¬ 
sidered within the Moller-Plesset partitioning 
scheme [21] which leads to successive corrections 
(MP2, MP3,...) which are consistent with elec¬ 
tron-electron interactions through second, third, 
etc. order. This technique presents the advantage 
to be size-consistent, i.e., to lead to the correct 
behavior when the size of the system increases. 
The total MP4 energy can be decomposed into 
different contributions: 


p — p 4- p D 
1 MP2 1 HF ^ 1 2 / 


^MP3 — P 


_l pD 
MP2 ^ 1 3 ' 


+ P 4 , 


(4a) 

(4b) 

(4c) 


MP4 


= P mp3 + P i + PP + PI + P 4 Q + R , (4d) 


where P MP4 is the MP4 value of P L , while P 4 is 
the fourth-order correction to P L , i.e., the value to 
be added to P MP3 to obtain P MP4 . Equations 4(a-d) 
permit to evaluate the relative importance of each 
type of excitations in the total P MP4 value. It is 
important to stress that all these terms are size 
consistent. The inclusion of the electron correlation 
requires the definition of an active space, i.e., a 
space where the electron substitutions occur. In 
our case, this space is the whole orbital space 
except for the core orbitals. When truncating 
Moller-Plesset schemes in that way, they are no 
more size consistent but, has shown in previous 
studies [22], the computed NLO properties are 
almost unaffected by this approximation. 


FINITE-FIELD PROCEDLRE 

The NLO properties of the various molecules 
have been computed by adopting the finite-field 
(FF) method. This choice has been dictated by the 
lack of analytical techniques for computing the 
(hyper)polarizabilities within the Moller-Plesset 
schemes. In particular, we have considered the 
field dependence of the energy. So, the finite- 
difference expressions of the static (hyper)polariz- 
abilities are given by: 



d 2 e(2 k E L ) 
d(2 k E L f 


e l =o 


(5a) 


: MP4 


f 4- + *: s + c D 4 . 4 - ^Q+ R 

6 HF ^ e 2 ^ e 3 ^ e 4 ^ e 4 ^ e 4 ^ e 4 


(3) 


= lim 


o 


2e(0) - e( -2 k E L ) - e(2 k E L ) 
(2 k E,f 


(5b) 
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0°'* = 


d 3 e(2 k E ,) 


d{2 k E L ) 
e(-2 k+x E ,) - 2e(-2 k E,) 


= lim 

El-0 


+ 2e(2* ; E, ) - e(2 k+1 E ,) 


2(2 k E,) 


y° ,k = 


d i e(2 k E ,) 


(5c) 


(5d) 


(5e) 


d(2 k E L ) 

-e(-2 t+1 E L ) + 4e(2 k E h ) - 6e(0) 


= lim 


+ 4e( — 2 fc £,) - e(2 <:+1 E,) 


0 


(2 k E,y 


(50 


where the basic field amplitude, E h , is a chosen 
constant, k (k = 0,1,...) defines the lowest 
nonzero field amplitude used in each calculation. 
The field amplitudes are following a power of two 
pattern to permit the use of the Romberg proce¬ 
dure (see below). The 0 superscript stands for the 
zeroth iteration within the Romberg procedure. 
These equations are equivalent to those given by 
Sim et al. [2], if one takes into account the different 
definitions of /3 and y in Eq. (2). Indeed, Sim and 
co-workers took an experimentalist's convention, 
developing the energy as: e = e 0 — |ul 0 • E — \a: 
EE - ;EEE-. 

Only the energies corresponding to different 
electric fields are necessary to use these FF formu¬ 
las; however, this technique presents two major 
drawbacks: First, the energy has to be known at 
several different electric fields for the high-order 
hyperpolarizabilities; second, the errors associated 
with the numerical derivative formulas may not be 
negligible. In practice, the electric fields at which 
the energy is computed have to be small enough to 
satisfy the E L —> 0 condition, to avoid SCF conver¬ 
gence problems, and to minimize high-order hy¬ 
perpolarizability contaminations but large enough 
to obtain a sufficient number of significant digits 
in the numerical differentiation which is especially 
critical for the second hyperpolarizability. 

The accuracy of the FF results is improved by 
the use of the Romberg's procedure [23], which 
removes the higher-order hyperpolarizability con¬ 
taminations. Starting with the values obtained by 
using different k in the Eq. (5), one could system¬ 
atically improve the hyperpolarizabilities values 


by applying the iterative formula [16]:* 


pp, k = 




4 p 


pp-1, k + l 


( 6 ) 


where p is the number of Romberg iterations. This 
improves the NLO property estimates to the preju¬ 
dice of additional computational costs. During our 
calculations, we have chosen the following basic 
field amplitude: E L = 8.10 -4 a.u. (1 a.u. of electric 
field = 5.1422 10 11 V mT 1 ) and k takes the values 
0, 1, 2, 3, and 4. This number of electric fields 
allows us to compute up to a 4,0 , /3 3/0 , and y 3/0 . As 
expected, y L is the most difficult quantity to calcu¬ 
late due to the high order of the numerical deriva¬ 
tive used in Eq. (5); while the a L values obtained 
after a unique Romberg's procedure are almost 
clear of any inaccuracies. In addition to this careful 
choice of the electric field amplitudes, the SCF 
convergence criterion has been tightened to 10“ 13 
a.u. in the Gaussian94 program [24], which permits 
us to meet a ~ 10 -2 a.u. precision on the a L , 
~ 1CT 1 a.u. on fi L , and ~ 10 1 a.u. on y L . 


Results and Discussion 

Two conjugated linkers [polyacetylene (PA) and 
polyyne (PY)] and two double-^ basis sets are 
used in this work: 6-31G and 6-31G*. The 6-31G* 
basis is biased toward the representation of elec¬ 
tron correlation rather than (hyper)polarizabilities, 
and generally it gives too small values for the 
(hyper)polarizabilities. It is difficult to predict 
which basis set will provide the more accurate 
correlated (hyper)polarizabilities estimates. It was 
shown in previous studies that the addition of p 
functions on the hydrogens of conjugated 
molecules is not crucial in order to obtain accurate 
estimates of the correlated longitudinal 
(hyper)polarizabilities values [18, 22]. This is re¬ 
lated to the fact that the NLO responses of these 
systems are mainly due to the 7r-electron delocal¬ 
ization. For all systems, the geometry has been 
fully optimized at the Hartree-Fock and at the 
MP2 levels of approximation using thresholds cor¬ 
responding to the tight keyword in the Gaussian94 
program [24]. For the sake of simplicity, we do not 
present here the geometrical parameters obtained 
by those optimizations. We refer to the work of 
Tsunekawa and Yamaguchi [10] for a description 

* In the Eq. (8) of Ref. [16], one should replace the numerator 

4V‘ U “ 7 r hk ~' 1 by4Py£-'‘ k - 
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of the results of the RHF/6-31G (RHF = restricted 
HF) geometries of the doubly bonded structures. 

Our results are displayed in Tables I, II, and III 
for the static longitudinal polarizability, first and 
second hyperpolarizabilities, respectively. Figures 
1, 2, and 3 show the ratios between the different 
components of the total MP4 correction and the 
HF values; these results have been obtained for the 
RHF/6-31G optimized structures with the 6-31G 
basis set. 

POLARIZABILITY 

From Table I, it is clear that, for small chains, it 
is mostly the size rather than the nature of the 
conjugated segment that determines the a L ampli¬ 
tude. Indeed, the differences in a L values noticed 
when using triple bonds instead of double bonds 
is relatively small. Nevertheless, the PA-type linker 
gives always the largest a L . is more sensitive 
to the nature of the conjugated linker than a HF . 
For small chains, this MP2 correction is always 
positive, i.e., it leads to an increase of a L , the 


opposite being true for longer chains. The relative 
importance of the MP2 correction (a^/a HF ) de¬ 
creases with chain length (■— 9, 6 and 4% for the 
PA monomer, dimer, and trimer, respectively), see 
Figure 1. However, it is not clear that this trend 
will go on for longer chains. The MP3 and MP4 
corrections are smaller than the MP2 one for PA 
oligomers; however, for the PY dimer a 4 is the 
largest component of the total Moller-Plesset cor¬ 
rection. If a f is negative (and decreases with in¬ 
creasing chain length, as shown on Fig. 1), the sign 
and the ratio a^/a 4 varies a lot with the nature 
and length of the conjugated linker. In general the 
Hartree-Fock and MP4 results are in quite good 
agreement, and, in the case of the PA trimer, the 
correction is so negligible that it is not interesting 
to compute a MP4 compared to a HF . 

The ratios ot MP4 /a MP2 found by us are very 
close to the ones computed by Toto and co-workers 
[18] on nonsubstituted PA chains. For the dimer 
and trimer they found 0.981 and 0.969, respec¬ 
tively, while we have 0.982 and 0.970 using the 
6-31G basis and the RHF/6-31G optimized struc- 


TABLE I 


Static longitudinal polarizability (a.u.) of NH 2 — R — N0 2 . a 


R 

Geometry 

Basis 

a HF 

a MP2 

a MP3 

a MP4 

X 

o 

II 

X 

o 

HF/6-31G 

6-31G 

70.27 

76.09 

76.22 

75.42 


MP2/6-31G 

6-31G 

75.89 

83.31 

83.23 

81.43 


HF/6-31G 

6-31G* 

70.10 

76.33 

75.38 

75.66 


MP2/6-31G 

6-31G* 

75.53 

82.22 

81.94 

81.63 


HF/6-31G* 

6-31G 

66.54 

71.74 

71.37 

71.44 


MP2/6-31G* 

6-31G 

71.69 

78.06 

77.91 

77.04 


HF/6-31G* 

6-31G* 

66.63 

72.29 

71.08 

71.93 


MP2/6-31G* 

6-31G* 

71.50 

78.22 

77.01 

77.31 

OJ 

X 

o 

II 

X 

o 

HF/ 6-31G 

6-31G 

144.45 

152.99 

150.93 

150.24 


MP2/6-31G 

6-31G 

156.31 

167.00 

164.66 

161.81 


HF/6-31G* 

6-31G* 

134.81 

144.01 

139.67 

142.52 


MP2/6-31G* 

6-31G* 

146.88 

158.26 

154.05 

155.53 

CO 

X 

o 

II 

X 

o 

HF/6-31G 

6-31G 

239.49 

246.74 

235.98 

239.25 


MP2/6-31G 

6-31G 

263.21 

271.92 

260.49 

260.32 


HF/6-31G* 

6-31G* 

221.26 

233.16 

220.36 



MP2/6-31G* 

6-31G* 

248.79 

263.47 

250.50 


(CH=CH) 4 

HF/ 6-31G 

6-31G 

358.96 

354.49 



c=c 

HF/6-31G 

6-31G 

67.80 

71.57 

70.69 

72.46 


MP2/6-31G 

6-31G 

74.60 

78.38 

78.00 

78.99 


HF/6-31G* 

6-31G* 

65.28 

70.46 

68.53 

71.11 


MP2 / 6-31G* 

6-31G* 

71.65 

77.15 

75.38 

77.67 

o 

III 

o 

K3 

HF / 6-31G 

6-31G 

136.06 

137.37 

135.68 

141.80 


MP2/6-31G 

6-31G 

151.14 

150.28 

150.39 

155.23 

CO 

o 

III 

o 

HF / 6-31G 

6-31G 

221.89 

217.27 



a 1 a.u. of polarizability = 1.6448 x 10 41 

C 2 m 2 J-'. 
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ture. However, the values of ot MP2 /a HF are differ¬ 
ent, as, for nonsubstituted PA, the polarizability 
always decreases when the electron correlation is 
included, while for substituted chains it is only the 
case for the tetramer (and most probably for longer 
chains). For the dimer, trimer, and tetramer To to et 
al. [18] found 0.855, 0.831, and 0.816, respectively, 
while we have 1.06,1.03, and 0.987 using the 6-31G 
basis and the RHF/6-31G geometry. Our chains 
are of course too small to show a convergence 
behavior. 

The triple excitations contribute largely to the 
total MP4 correction; and in the case of the trimer, 
a[ is even larger than a 4 , as shown by Figure 1. So 
the MP4(SDQ) values are not more accurate than 
the MP3 ones. This conclusion has been reached 
many times before for conjugated molecules 
[18,22]. 

When optimizing at the MP2 level, the bond 
length alternation (BLA) decreases, i.e., the differ¬ 
ence of bond length between single and double (or 
triple) bonds decreases and a L increases (~ + 7%). 
The addition of d polarization functions on the 


second-row atoms during the optimization process 
leads to an increase of the BLA and a decrease of 
a L (~ —6%). In that respect, it is striking to note 
that for NH 2 —CH = CH—N0 2 , the difference 
between the MP4/6-31G//MP2/6-31G (finite- 
field calculation based on MP4 energies computed 
on the MP2 optimized geometry with the 6-31G 

basis set) and MP4/6-31GV/MP2/6-31G* <*l is 

mostly due to the variation of geometry and not to 
the addition of d functions during the FF calcula¬ 
tions. 

FIRST HYPERPOLARIZABILITY 

Again for the first hyperpolarizability, the MP2 
contribution to the total MP4 correction is by far 
the largest, and the ratio /3 MP2 //3 HF is relatively 
stable for a given conjugated linker, showing only 
small decrease with chain length. For the PA-like 
chains, this ratio is 2.70, 2.44, 2.26, and 2.25 for the 
monomer, dimer, trimer, and tetramer at the 
RHF/6-31G level (Fig. 2). It is striking to note that 
for the infinite chain this ratio is undetermined 


TABLE II_ 

Static longitudinal first hyperpolarizability (a.u.) of NH 2 — R — N0 2 . a 


R 

Geometry 

Basis 

/3hf 

/3mP2 

^MP3 

Pmpa 

X 

o 

II 

X 

o 

HF/6-31G 

6-31G 

-313.4 

-846.9 

-814.1 

-824.0 


MP2/6-31G 

6-31G 

-379.2 

-1095.7 

-1017.2 

-1058.4 


HF/6-31G 

6-31G* 

-246.4 

-650.6 

-600.1 

-643.9 


MP2/6-31G 

6-31G* 

-308.5 

-826.5 

-769.7 

— 848.0 


HF/6-31G* 

6-31G 

-263.8 

-706.9 

-680.8 

-688.0 


MP2/6-31G* 

6-31G 

-323.9 

-902.2 

-853.2 

-874.1 


HF/6-31G* 

6-31G* 

-204.8 

-544.4 

-500.5 

-537.7 


MP2/6-31G* 

6-31G* 

-256.5 

-699.1 

-634.1 

-689.4 

(CH=CH) 2 

HF/6-31G 

6-31G 

-1494.1 

-3649.9 

-3713.5 

-3700.4 


MP2/6-31G 

6-31G 

-1798.1 

-4639.8 

-4630.9 

-4627.1 


HF/ 6-31G* 

6-31G* 

-1025.7 

-2449.0 

-2340.2 

-2495.9 


MP2/6-31G* 

6-31G* 

-1266.2 

-3139.3 

-3022.1 

-3184.7 

(ch=ch) 3 

HF / 6-31G 

6-31G 

-4272.8 

-9635.1 

-9710.6 

-9771.8 


MP2/6-31G 

6-31G 

-5219.2 

-12438.7 

-12513.5 

-12397.7 


HF / 6-31G* 

6-31G* 

-2951.5 

-6558.7 

-6137.6 



MP2/6-31G* 

6-31G* 

-3811.4 

-8844.8 

-8567.7 


(CH = CH) 4 

HF / 6-31G 

6-31G 

-8168.6 

-18459.1 



c=c 

HF/ 6-31G 

6-31G 

-524.7 

-828.5 

-822.8 

-754.8 


MP2/6-31G 

6-31G 

-653.6 

-1112.5 

-1083.2 

-988.1 


HF / 6-31G* 

6-31G* 

-341.9 

-528.9 

-500.2 

-490.8 


MP2/6-31G* 

6-31G* 

-456.4 

-720.5 

-687.9 

-661.3 

CM 

o 

III 

o 

HF/6-31G 

6-31G 

-1550.6 

-2436.4 

-2360.0 

-2233.4 


MP2/6-31G 

6-31G 

-1999.1 

-3311.8 

-3199.5 

-2931.5 

CO 

o 

III 

O 

HF/6-31G 

6-31G 

-3057.0 

-4651.8 



a 1 a.u. of first hyperpolarizability = 3.2063 x 10 53 C 3 m 3 J 2 . 
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TABLE III__ 

Static longitudinal second hyperpolarizability (10 1 a.u.) of NH 2 — R — N0 2 . a 


R 

Geometry 

Basis 

Thf 

y mp 2 

y mp3 

?MP4 

o 

X 

o 

X 

HF/ 6-31G 

6-31G 

742 

1409 

1419 

2116 


MP2/6-31G 

6-31G 

1030 

2035 

2060 

3207 


HF/ 6-31G 

6-31G* 

618 

999 

1014 

1414 


MP2/6-31G 

6-31G* 

939 

1600 

1640 

2317 


HF/6-31G* 

6-31G 

765 

1555 

1466 

1988 


MP2/6-31G* 

6-31G 

866 

1716 

1689 

2429 


HF/6-31G* 

6-31G* 

668 

1217 

1139 

1484 


MP2/6-31G* 

6-31G* 

728 

1248 

1230 

1707 

CM 

X 

o 

II 

o 

HF/6-31G 

6-31G 

6660 

17369 

18092 

21632 


MP2/6-31G 

6-31G 

8652 

24123 

24689 

30515 


HF/6-31G* 

6-31G* 

4950 

11790 

11336 

13605 


MP2/6-31G* 

6-31G* 

5784 

14181 

14056 

17126 

CO 

X 

o 

II 

X 

o 

HF/6-31G 

6-31G 

34321 

91241 

94194 

106614 


MP2/6-31G 

6-31G 

43544 

125193 

129089 

147598 


HF / 6-31G* 

6-31G* 

24463 

59672 

56391 



MP2/6-31G* 

6-31G* 

29629 

76354 

76241 


(CH=CH) 4 

HF / 6-31G 

6-31G 

99351 

269353 



c=c 

HF/ 6-31G 

6-31G 

1982 

3864 

3706 

4146 


MP2/6-31G 

6-31G 

2553 

5405 

5156 

5787 


HF/ 6-31G* 

6-31G* 

1382 

2544 

2270 

2711 


MP2/6-31G* 

6-31G* 

1776 

3270 

3035 

3525 

CM 

o 

III 

o 

HF/6-31G 

6-31G 

11234 

22117 

20392 

22757 


MP2/6-31G 

6-31G 

14162 

31220 

28811 

31577 

(C=C) 3 

HF/6-31G 

6-31G 

31400 

61949 




a 1 a.u. of second hyperpolarizability = 6.2352 x 10 65 C 4 m 4 J 3 . 



FIGURE 1 . Ratios between the Moller-Plesset corrections and the Hartree-Fock static polarizability values for 
a,a>-nitro,amino polyene chains. These results have been obtained with the 6-31G basis set on the RHF/6-31G 
optimized structures. 
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FIGURE 2. Ratios between the Moller-Plesset corrections and the Hartree-Fock static first hyperpolarizability values 
for a^nitro,amino polyene chains. These results have been obtained with the 6-31G basis set on the RHF/6-31G 
optimized structures. The ft® //3 HF ratio has been divided by a factor 2 for the purpose of visibility. 


because the PA has a zero /3 L . For comparison the 
ratio found, by a similar method, is 1.62 for the 
p-nitro-aniline [2] and —4.48 for the polymeth- 
ineimine (PMI) dimer [22]. (These PMI oligomers 
are similar to PA, but half of the carbon atoms are 
replaced by nitrogen atoms.) This ratio slightly 
increases when the geometry optimization is car¬ 
ried out at the MP2 level, as a result of the larger 
electronic delocalization along the backbone but is 
almost the same for both basis sets. For PA, with 
the 6-31G basis set, we have I /3 3 D | > | /3 4 |, but their 
relative sign changes as a function of the geome¬ 
try, the basis set, as well as the length of the 
conjugated segment. However, with the same ba¬ 
sis set, in PY chains the opposite is true: | /3 3 D | < 
|/3 4 |. The contribution of the triple excitations to 
the total MP4 correction is generally large: | /3 4 T | > 
|jS 4 |. Moreover it is striking to note the relative 
stability with chain length of most of the ratios 
drawn at Figure 2. 

Note the importance of the electron correlation 
corrections in the ordering of the doubly versus 
triply bonded structures. Indeed, for the monomer 
and dimer, the j 6 MP4 values order in the opposite 
direction to /3 HF ones; while the ordering given by 
the MP2 or MP3 technique changes as a function 
of the basis set and the geometry used. However, 


for the trimer, it becomes clear, even at HF level, 
that the PA linker leads to larger fi L . 

Under a geometry optimization at the MP2 level, 
the jS L values increase significantly (from +20 to 
+ 40%) compared to the values obtained on the 
corresponding HF optimized geometry. The addi¬ 
tion of polarization functions on the carbon atoms 
leads to a decrease of /3 L (from —30 to —40%). 
This basis set effect can be divided into two com¬ 
ponents: a direct one, coming from the variation of 
the computed property, and an indirect one, com¬ 
ing from the change on geometry. Although, for 
NH 2 —CH = CH—N0 2 both components are im¬ 
portant (which was not the case for the polarizabil¬ 
ity) and have the same sign, the sum does not 
reproduce exactly the full 6-31G* results, showing 
that they are not purely additive. 

SECOND HYPERPOLARIZABILITY 

In the computation of y L , similarly to /3 L , the 
MP2 correction is of more importance, and gener¬ 
ally, one can notice, on Figure 3 that yf > y HF . 
Moreover, the ratio Tmp 2 /Thf increases with chain 
length, which goes in opposition to the nonsubsti- 
tuted PA chains. For the monomer, dimer, trimer, 
and tetramer of nonsubstituted PA, Toto and co- 
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FIGURE 3. Ratios between the Moller-Plesset corrections and the Hartree-Fock static second hyperpolarizability 
values for a,co-nitro,amino polyene chains. These results have been obtained with the 6-31G basis set on the 
RHF/6-31G optimized structure. 


workers [18] found the ratio to be 3.05, 2.20, 2.00, 
and 1.92, respectively, while we have 1.90, 2.61, 
2.66, and 2.71 using the 6-31G basis and the 
RHF/6-31G geometry. For small chains y A has a 
magnitude comparable to yf for the CH=CH 
fragment, but the relative importance of the 
fourth-order correction decreases with increasing 
chain length. Indeed, y 4 /y H F decreases when shift¬ 
ing from the dimer to the trimer, while y^/y H F 
still increases (see Fig. 3). T3 D changes of sign 
according to the linker studied but is smaller by 
one order of magnitude than both yf and y 4 . In 
the case of y L the triple excitations do not con¬ 
tribute largely to the total MP4 correction, and 
thus the relatively cheap MP4(SDQ) method may 
be accurate. 

The substituted PY monomer and dimer present 
large y L values in comparison to the correspond¬ 
ing substituted PA molecules. This means that the 
largest polarizability does not systematically lead 
to the largest second hyperpolarizability; although 
for long chains this is going to be true. The ratio 
y MP2 /y HF (each being computed at the corre¬ 
sponding 6-31G optimized geometry) of the substi¬ 
tuted (nonsubstituted) PY dimer is 2.78 (2.27, see 
Ref. [17]). 

The effects of the geometry and the basis set on 
y L are comparable to the ones noticed for a L and 
j8 L , i.e., the MP2 geometry leads to larger y L val¬ 
ues; the addition of polarization function to smaller 


one. Like /3 L , the amplitude of y L depends on the 
geometry and the basis set used during the FF 
calculations. 


Conclusions and Outlook 

We have carried out Moller-Plesset calculations 
of the static polarizability, first and second hyper¬ 
polarizabilities of polyacetylene and polyyne link¬ 
ers end-capped by nitro and amino groups. The 
MP2 correction, which leads to significant in¬ 
creases of the hyperpolarizability estimates, is gen¬ 
erally much larger than the MP3 and MP4 ones, 
except in the cases of the second hyperpolarizabil¬ 
ity of small PA chains and of the polarizability of 
the PY dimer. If the absolute values of NLO prop¬ 
erties are very different for substituted or nonsub¬ 
stituted chains, the ratios P M p 2 /^hf (and espe¬ 
cially P M P4 /P MP2 ) are quite close, especially for 
medium-size oligomers. PA linkers always lead to 
larger static (hyper)polarizabilities than the PY 
chains, except in the case of y L of small oligomers. 
For a L and /3 L , P M p 2 /^hf decreases with increas¬ 
ing chain length, while for y L this ratio increases. 

Under optimization at the MP2 level, the bond 
length alternation decreases, and thus the 
(hyper)polarizabilities increase; while the addition 
of d polarization functions on the second-row 
atoms decreases the NLO estimates. 
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In our calculations we have neglected the vibra¬ 
tional contribution to the (hyper)polarizabilities; 
however, these contributions may be of impor¬ 
tance, as recently reviewed [25]. For example, it 
was shown that the static vibrational contributions 
to /3 l is more than twice the electronic contribu¬ 
tions for the small N0 2 —(CH=CH) N —NH 2 
chains at the RHF/6-31G level within the double 
harmonic approximation. To our knowledge, no 
Moller-Plesset correction of the vibrational first 
hyperpolarizability has been computed yet for such 
push-pull systems, and it is still impossible to 
know how the relative weight of the vibrational 
contributions will change when electron correla¬ 
tion corrections are included. 

Calculations including the electron correlation 
at the MP2 level of substituted PA oligomers of 
medium and large size are now carried out in our 
laboratory in order to address the saturation 
behavior. 
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ABSTRACT: Static electronic and vibrational longitudinal first hyperpolarizabilities 
(/3£(0) and BIX 0)) of a series of mono- and disubstituted benzenes were calculated at the 
Hartree-Fock 6-31G level by using coupled Hartree-Fock and the double harmonic 
oscillator approximations, respectively. Although the 0)/P[(0) ratio is slightly larger 
than unity and rather constant with respect to the substituent(s), it turns out, for the 
monosubstituted compounds, that the variations of /3£(0) upon substitution can be most 
accounted for by the mesomeric effects, whereas for /3 L T ’(0), the inductive effects are also 
of importance. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 689-696, 1997 


Introduction 

Q uantum chemical evaluations of the polariz¬ 
abilities (a), first and second hyperpolariz¬ 
abilities (fi and y), and their interpretation in 
terms of structure-property relationships help in 
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Contract grant number: D.4511.93. 
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the design of new compounds for nonlinear optical 
(NLO) applications [1]. Recently, particular atten¬ 
tion has been given to the vibrational contributions 
to these properties [2]. In fact, although the appli¬ 
cation of an external electric field induces simulta¬ 
neous distortions of the electron and nuclear mo¬ 
tions, in the clamped nucleus or canonical treat¬ 
ment, one considers that the field acts sequentially 
on the electron and nuclear motions [3-4]. This 
leads to the distinction between the electronic (cd, 
/ 3 e , and y e ) and vibrational (cd, (3 v , and y v ) (hy- 
per)polarizabilities. The present study copes with 
the evaluation of the electronic and vibrational 
static longitudinal first hyperpolarizabilities [ /3 L C (0) 
and BlXO)] of a series of mono- and disubstituted 
benzenes. Since the first investigations have 
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demonstrated that /3j v can be substantial for conju¬ 
gated molecules in the static limit [2], it can be 
optimized to maximize the dc-Pockels (dc-P) and 
optical rectification (OR) responses. At optical fre¬ 
quencies, the vibrational component to the second 
harmonic generation (SHG) response is negligible. 
A similar strategy has already been followed for 20 
years now for the electronic counterpart as re¬ 
viewed by Kanis et al. [5]. 

Zerbi and co-workers [6-8] found that, for many 
second-order conjugated molecules, the calculated 
static electronic and vibrational hyperpolarizabili¬ 
ties are very similar. Using experimental data, they 
concluded that P v /fi e is close to unity. In the 
latter case, the vibrational first hyperpolarizability 
is evaluated from infrared and Raman absorption 
spectra, whereas electric field-induced SHG 
(EFISHG) measurements give the electronic coun¬ 
terpart which—to be correct—would have been 
required to be extrapolated to zero frequency. 
Consequently, they argued that such a coincidence 
is not fortuitous and that f3 v and /3 C are just two 
manifestations of the same underlying physical 
phenomenon. They made the connection with the 
strong electron-phonon coupling along the coordi¬ 
nate which is associated with the variation of the 
bond-length alternation (BLA). A first quantitative 
demonstration of their assumption was recently 
elaborated on by Castiglioni et al. [9]. However, in 
addition to its restriction to the two-state approxi¬ 
mation for computing /3 e , whereas f3 v is evalu¬ 
ated at the double harmonic oscillator approxima¬ 
tion by neglecting all the modes but the BLA 
mode, their approach neglects an important contri¬ 
bution [10], The rationalization of the fi v //3 e ratio 
value is even made more difficult by a few recent 
results concerning push-pull conjugated materials 
which demonstrate the dependence of the fi v //3 e 
ratio upon both the chain length and the nature of 
the linker [2]. 

This lack of satisfactory quantitative justifica¬ 
tion for the relationship between v and /3 e in 
these conjugated systems makes the subject very 
attractive. In the present study, the relationship 
between /3 V and [particularly between P[(0) 
and Pi(0)] is tackled from another side, i.e., the 
P[(0) and 0) quantities are analyzed in terms of 
both the inductive and the mesomeric substituent 
constants. Then, these relationships to simple 
physical concepts are compared. A similar ap¬ 
proach was followed by Hehre et al. [11] to study 
the charge distributions and nuclear magnetic 
shifts in a series of monosubstituted benzenes and 


in m- and p-substituted fluorobenzenes. In addi¬ 
tion, a few investigations of second-order NLO 
materials have demonstrated that a satisfactory 
linear dependence relates fi e to the parameters 
(see below for their description). These experimen¬ 
tal or theoretical investigations of mono- and di- 
substituted benzenes [12], stilbenes [12, 13], 
polyenes [14], and polymethineimines [14] showed 
that this approach could be used by the organic 
chemists to elaborate new promising materials for 
NLO and to predict their properties from the na¬ 
ture and positions of the donors and acceptors. 
This work is organized as follows: The next section 
briefly summarizes the methodological and com¬ 
putational aspects, the third section presents the 
results and elaborates an interpretation, whereas 
we conclude in the fourth section. 


Methodological and Computational 
Aspects 

The /3£(0) values were evaluated at the coupled 
Hartree-Fock (CHF) [15] level with the split-va¬ 
lence 6-31G basis set [16]. This approach, which is 
equivalent to a Hartree-Fock finite-field proce¬ 
dure, takes explicitly into account the field-in¬ 
duced electron reorganizational effects. On the 
other hand, the vibrational counterpart, /3^(0), is 
obtained at the RHF/6-31G level within the dou¬ 
ble harmonic oscillator approximation [17-19] and 
reads 


Pi( 0) = [ /xa] 


0,0 

L; co=0 



l d(JL ^ 

dQ 


n I 0 


da L \ 

\^Q« 



( 1 ) 


where 0,0 indicates that no electrical or mechani¬ 
cal anharmonicity is included. The sum runs over 
all the vibrational normal modes of frequency o) a . 
The (dP L /dQ a ) 0 with P L = f.i L , a L , which are the 
derivatives of the property with respect to the 
normal coordinate Q n/ are evaluated at the equilib¬ 
rium geometry. 

For optical processes, both and f3[ are mod¬ 
ulated, although in very different ways. Indeed, 
the frequency dispersion of (3[ has been shown to 
follow the relation [20] 

fii ( — o ) (T ; at 1 , (o 2 ) = /3[ (0) [ 1 + A a>l + Bcol + * * * ], 

( 2 ) 
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where (o a = (o x + co 2 and col ~ ^ + coj + (o\. A 
and B are parameters depending upon the 
molecule, but only B depends upon the NLO pro¬ 
cess. On the other hand, by adopting the enhanced 
[21] or infinite-frequency [22] approximation which 
relies on the fact that the optical frequencies are at 
least one order of magnitude larger than the vibra¬ 
tional ones, one obtains 

(3j V ( co) co, 0) w-> ra — [3 1 (0; Co, to) <y —> oo 

^[m^l'/U O) 

/3/( — 2(o; (Of co)(o^oo = 0. (4) 

In other words, for SHG, /3/ is negligible, whereas 
for the dc-P and OR effects, it amounts to 1/3 of 
the static quantity. This point is obviously of im¬ 
portance for optimizing the responses according to 
the NLO process. 

All the (3 calculations were performed with the 
Gaussian94 program [23] on the RHF/6-31G-opti- 
mized structures. A tight convergence threshold 
on the residual forces was adopted to meet a 
satisfactory accuracy in computing the (dP L /dQ n ) 0 
values. The 6-31G atomic basis set was employed 
because it is known to be reasonably accurate for 
evaluating the hyperpolarizabilities of push-pull 
conjugated systems [5]. Addressing electron corre¬ 
lation corrections is left for future investigations. 
For these second-order NLO materials, a few cal¬ 
culations have already revealed that the inclusion 
of electron correlation effects leads to a 15-50% 
increase in /3^, whereas, to our knowledge, it has 
not yet been addressed for f3/. Further computa¬ 
tional details on these calculations can be found in 
[24]. The molecules have been oriented such that 
the C 1 and C 4 atoms belong to the z- or longitudi¬ 
nal axis. Since our purpose consists of relating f3 e 
and (3 V to the substituent constants, only the f3 zzz 
= f3 L tensor component was analyzed. It is strik¬ 
ing to note that for weak donor or acceptor groups 
/ 3 l is not necessarily the dominant tensor compo¬ 
nent, whereas in the case of the resonant D/A 
pairs, /3 l is always dominant. 


Results and Discussion 

The i± L , /3/;(0), and (3/(0) values are listed in 
Tables I and II for the mono- and disubstituted 
benzenes, respectively, whereas Table III gives the 
various cr parameters employed in this study. As 
shown in the previous investigations [12-14], dif¬ 
ferent types of cr parameters have proven to be 


TABLE I_ 

RHF/6-31G longitudinal component of the dipole 
moment and the static electronic and vibrational 
first hyperpolarizability as a function of the 
substituent in monosubstituted benzenes. 


R 

Ml 

Pl 

pi 

och 3 

0.044 

126.1 

96.8 

OH 

-0.225 

143.4 

43.1 

N(CH 3 ) 2 

0.688 

297.4 

180.8 

nh 2 

0.573 

259.2 

112.5 

F 

-0.965 

108.3 

-59.1 

ch 3 

0.130 

44.8 

23.4 

ch 2 ch 3 

0.116 

-12.4 

8.0 

chch 2 

-0.059 

-37.5 

-61.1 

CCH 

-0.264 

-23.9 

40.7 

SH 

-0.752 

28.5 

16.1 

Cl 

-1.076 

134.3 

-210.1 

SiH 3 

-0.363 

-119.1 

-236.9 

CN 

-1.914 

-56.1 

-56.0 

cf 3 

-1.606 

35.0 

-205.3 

CHO 

-1.262 

-197.3 

-358.8 

coch 3 

-0.939 

-197.7 

-229.2 

co 2 h 

-0.745 

-205.5 

-308.1 

no 2 

-2.294 

-269.7 

-311.3 


All the values are given in au. (1.0 au of dipole moment = 
8.4783610“ 30 cm = 2.541765 Debye; 1.0 au of first hyper¬ 
polarizability = 3.2063 X 10“ 53 C 3 m 3 J ” 2 = 8.641 X 
10“ 33 esu). 

adequate for reproducing the f3 e variations. Most 
often, the para substituent constants (cr , cr/, a/) 
are employed. The empirical cr p parameters were 
originally related by Hammett to the variation of 
the dissociation constant of p-substituted acids, 
the benzoic acids in water at 25°C being the refer- 

TABLE II_ 

RHF / 6-31G longitudinal component of the dipole 
moment and the static electronic and vibrational 
first hyperpolarizability as a function of the 
substituent in disubstituted benzenes 
(all values in au). 


R, R' 

Ml 

Pl 

Pl 

CHO, CH 3 

1.478 

366.6 

518.1 

CHO, F 

0.282 

335.0 

344.6 

CHO, OH 

1.146 

517.7 

607.8 

CHO, NH 2 

2.124 

928.0 

1014.8 

no 2 , ch 3 

2.548 

500.7 

492.3 

no 2 , f 

1.329 

408.9 

298.5 

no 2 , oh 

2.222 

672.4 

704.6 

no 2 , nh 2 

3.230 

1262.8 

1351.6 
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TABLE III_ 

The inductive ( 07 ) and mesomeric (a£, op") 
substituent constants. 


R 



(T +7_ 

a R 

OCH 3 

0.29 

-0.44 


OH 

0.25 

-0.41 

-1.17 

N(CH 3 ) 2 

0.12 

-0.53 


nh 2 

0.11 

-0.48 

-1.41 

F 

0.51 

-0.33 

-0.58 

ch 3 

-0.03 

- 0.10 

-0.28 

ch 2 ch 3 

-0.02 

-0.12 


chch 2 

0.06 

-0.07 


CCH 

0.20 

0.03 


SH 

0.29 

-0.20 


Cl 

0.48 

-0.21 


SiH 3 

0.05 

0.09 


CN 

0.57 

0.08 


cf 3 

0.42 

0.07 


CHO 

0.25 

0.27 

0.79 

COCH3 

0.26 

0.15 


co 2 h 

0.32 

0.29 


no 2 

0.68 

0.19 

0.56 


The a R are the unbiased values associated with nonconju- 
gated reaction centers, whereas the are the dual 

values which express the enhanced mesomeric interaction 
of a donor (a R ) with an acceptor or of an acceptor (<r R ~) 
with a donor. Values taken from [25]. 

ence [25]. Obviously, these a p parameters involve 
both inductive and mesomeric effects, whereas the 
latter ones are overestimated for medium/strong 
donor substituents because they interact reso¬ 
nantly with the acceptor acid group. This is why 
unbiased and dual (a p , <r~) parameters are 
required to express the extreme behaviors. Indeed, 
on the one hand, the a p parameters characterize 
substituents which do not interact with the reac¬ 
tive center, whereas, on the other hand, the a p 
( a p ) parameters include the enhancement of the 
donor (acceptor) character due to its resonant in¬ 
teraction with an acceptor (donor) substituent in 
the para position [25]. A further refinement of this 
parametrization is accomplished by the splitting of 
the different a p into mesomeric or resonant (a R ) 
and inductive or electrostatic ( 07 ) parts. The 07 do 
not depend upon the presence or not of other 
interacting substituents, whereas there exist unbi¬ 
ased and dual <j r constants [25]. In our case, the 
distinction between the 07 and a R effects is of 
importance. We have considered the unbiased 0 * 
values for the monosubstituted benzenes, whereas 
for the disubstituted constants, the dual 0 p“ 
parameters have been employed. Two important 


comments are needed before deducing 0 -relations 
and comparing these with previous studies. First, 
as mentioned by Exner [25], there are errors in the 
substituent constants which are not always un¬ 
equivocally determined. These errors originate 
from both the experimental data and their subse¬ 
quent theoretical manipulations such as these in 
deriving the cr R constants. Second, the 0 p“ values 
are often considered to be equivalent to up¬ 
values because the resonant effect is dominant, 
neglecting the importance of the inductive effects 
such as in the case of the N0 2 group. 

Figure 1 shows that the /3/( 0)//3[(0) ratio is 
rather constant with respect to the nature and 
number of the substituents, the deviation being 
associated mainly with the monosubstituted com¬ 
pounds. The slightly larger amplitude of j3/(0) 
with respect to 0 ) was first analyzed in terms of 
their dependence upon a R (Figs. 2 and 3). The 
least-squares fittings, 

0) = -51.5 - 572.50*° R = 0.93 (5) 

Pi (0) - -134.3 - 524.60 -* 0 R = 0.84, ( 6 ) 

show that the variations in can be better 

accounted for by the mesomeric effects than can 
the corresponding variations in /3 L z; (0). Indeed, the 
correlation coefficient is closer to unity for 
and the constant term of the line, which obviously 
should be equal to zero, is smaller. The removal of 
this constant term leads to poorer fittings of which 
the correlation coefficient is equal to 0.88 and 0.28 
for p[(0) and fil’i 0), respectively. The correspond¬ 
ing standard deviation of the errors is equal to 66 
and 136 au. This good description of the 
variations in terms of 0 * is in agreement with the 
fact that the charge-transfer (fi ct ) contribution to 
P[(0) is known to be by far the largest component, 
the remaining term ( fi add ) being associated to the 
inductive effects. The largest deviations of the cal¬ 
culated P[(0) from the linear fitting estimates con¬ 
cern the CF 3 and N0 2 groups, i.e., substituents 
with small 0 * with respect to op This dominance 
of P ct is even more pronounced for the D/A- 
disubstituted compounds [26-27]. In what con¬ 
cerns the CH 3 , CCH, Cl, and CN sub¬ 

stituents present the largest deviations from the 
linear fitting. In a similar way, for jjl l , the least- 
squares fitting to 0 - R ° is poor: 

fjb L = -0.80 - 2.040*° R = 0.64. (7) 
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FIGURE 3. RHF / 6-31G double harmonic (0) values as a function of a£. 


With the relation to <x R ° being insufficient in sev¬ 
eral instances, the role of 07 needs to be assessed. 
To make it, one can assume that any property, P, 
depends linearly upon both 07 and or R according 
to 

p = Pj°7 + PrOr- (8 ) 

The pj and p R factors which depend upon the 
investigated property are determined by a least- 
squares fitting procedure. One then obtains 

0) = -127.5(7, - 530.7<x R (9) 

Pi(0) = -376.10-; - 424.1 cr R ° (10) 

ji L = - 2.8807 - 1.56o* R °. (11) 

These results support our first idea that the induc¬ 
tive effects have a larger impact on j3 r ( 0 ) than on 
0), whereas they are dominant for p, L . They also 
indicate that for such monosubstituted benzenes 
PlX 0 ) and (3[(0) are not determined by the same 
underlying phenomena. The standard deviation of 
the errors which measures the goodness of these 
fittings is equal to 61, 87, and 0.27 au for P[(0), 
Pi(0), and /x L , respectively, showing a clear im¬ 
provement over the simple linear fitting for j 8 L y ( 0 ) 
and pb L , whereas for /3f(0), both fittings are of 
equivalent quality. Most of the significant devia¬ 
tions from Eqs. (9)-(ll) can be attributed to the 
particular nature of the substituents. For the CHO 


and SiH 3 groups, the /3 L Z ’(0) values are poorly 
reproduced as could be understood from their 
large intrinsic contributions. For Cl but also SiH 3 , 
a part of the deviation can certainly be attributed 
to the lack of d-functions on the Cl and Si atoms. 
Moreover, the /3 L z; (0) estimates for the cyano- and 
acetylenobenzenes are of relatively poorer quality 
than for the other substituents. In the case of /3/(0), 
the estimates for the CF 3 - and CH 2 CH ^sub¬ 
stituted benzenes are particularly bad due to both 
a large deviation and a wrong sign. Obviously, 
improvements of these fitting relations would be 
obtained by adopting higher-level quantum chem¬ 
ical methods, i.e., including electron correlation 
corrections and using a larger atomic basis set. 

Nevertheless, these global double linear rela¬ 
tionships deserve to be considered in the design of 
NLO materials and particularly for their help in 
determining the factors which influence the vibra¬ 
tional and electronic first hyperpolarizabilities. In¬ 
deed, there was no a priori reason to expect a 
linear relation between the /3 L (0)'s and the sub¬ 
stituent constants which are related to variations 
of chemical reaction constants. 

Then, an interesting point consists of estimating 
from Eqs. (9)—(11) the j8f(0), jS L z, (0), and p, L values 
of the substituted compounds and comparing these 
to the calculated ones. For the disubstituted ben¬ 
zenes, the substituent effect is represented by the 
difference of their individual constants, the coop- 
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erative or resonant effects being accounted for by 
taking in the place of <x K °. One can directly 
see inconsistencies between these estimates and 
the calculations. In particular, the /3 L (0)'s of the 
NH 2 -substituted compounds are much larger than 
these of the parent OH-substituted systems al¬ 
though the corresponding changes in cr^/crj are 
rather small. In addition, for the p[(0) of the CH 3 - 
and F-substituted systems, these estimated values 
overestimate the calculated ones. It is our feeling 
that these prediction errors are related to the diffi¬ 
culty of knowing Indeed, there is no reason 
why these values would be the same for the 
solvolysis of cumyl chlorides in 90% aqueous ace¬ 
tone and for ft quantities. In other words, contrary 
to the unbiased constants, the dual parameters can 
be easily considered dependent upon the strength 
of the resonantly interacting group. In this regard, 
it seems obvious that the NH 2 group resonantly 
interacts much more strongly with the acceptor 
group than does the OH group. 

The nonnegligible deviations from the linear 
relation obtained by Ulman [13] and the interpre¬ 
tation given by Cheng et al. [12] of the cr-depen- 
dence of the EFISHG ft values demonstrate also 
that the dual parameters are not determined in an 
absolute way. Indeed, Cheng et al. obtained linear 
relations with different slopes for the para-sub¬ 
stituted nitro- and methoxybenzenes. On the other 
hand, Jacquemin et al. [14] obtained a very good 
correlation constant for polyenes and polyme- 
thineimines substituted by strong or weak donors 
and acceptors. However, it is striking to note that, 
in the latter, the same standard geometry was 
adopted for the linker in order to address the 
electronic effects of the substituents, neglecting, 
therefore, the consequences on the linker geome¬ 
try. Accordingly, the N0 2 /NH 2 pair significantly 
transforms the aromatic benzene ring into a par¬ 
tially quinonoid structure [24]. 


Synopsis and Outlooks 

This RHF/6-31G study points out that the 
A?(0)/ft(0) ratio for mono- and disubstituted ben¬ 
zenes is rather constant with respect to the nature 
of the substituent(s) and is slightly larger than 
unity. With the lack of electron correlation and 
anharmonicity corrections preventing us from 
making a better estimate, these results are in 
agreement with the observation of Zerbi and co¬ 


workers [6-8] who found experimentally and the¬ 
oretically that and — co, a)) are of the 

same amplitude. However, the analysis of the de¬ 
pendence of and P[(0) upon the mesomeric 

and inductive substituent constants in the mono- 
substituted benzenes exhibits different behaviors. 
P[(0) can mostly be described by the unbiased 
mesomeric constants ( cr } ®), whereas its vibrational 
counterpart also depends upon the inductive con¬ 
stants (tr 7 ). Moreover, the intrinsic contribution of 
the substituent is generally larger for /3 L l ’(0) than 
for fi[(0 ). 

Nevertheless, it is important to note that, at 
optical frequencies, this 0)/P[(0) ratio de¬ 
creases to about 0.3 as a consequence of both the 
dispersion-induced increase of and the de¬ 
crease of fif by a factor of 3 as determined in the 
infinite-frequency double harmonic oscillator level 
of approximation. On the grounds of the recent 
results concerning a, co nitroamino polyenes, poly- 
ynes, and polythiophenes which show that the 
P^(0)/P[(0) ratio ranges between 1.1 and 2.8 ac¬ 
cording to the length and nature of the conjugated 
linker, further investigations of these substituent 
effects would deserve to be carried out to address, 
from the chemical point of view, the value of the 
PH(0)/P[(0) ratio. 
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ABSTRACT: Explicit expressions of polarizability and hyperpolarizability are 
formulated by a path integral method in terms of the generalized coherent states of the 
Thouless parametrization. A novel scheme is shown for calculating the molecular 
properties including total energy, and the Monte Carlo method using the metropolis 
algorithm is utilized for performing the actual calculations. As a simple example, the 
hydrogen molecule is examined to demonstrate performances of the present scheme. We 
discuss improvement of the accuracy of the (hyper)polarizabilities depending on the 
number of the division for the partition function and also the behaviors of the 
(hyper)polarizability for the external magnetic field. We compare the present results with 
those obtained by the conventional molecular orbital theory. © 1997 John Wiley & Sons, 
Inc. Int J Quant Chem 65: 697-707, 1997 


Introduction 

I n the context of quantum chemical calculations 
of the static (hyper)polarizabilities, we have 
two main approaches, i.e., the perturbation tech¬ 
niques and the derivative techniques. In the per- 
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turbation techniques, the perturbation energy is 
usually expressed as the summation over elec¬ 
tronic states (sum over states [SOS]). The hyperpo¬ 
larizability has been calculated by the perturbative 
approach with various approximate models [1-5]. 
The most useful advantage of the SOS technique in 
terms of the analysis of the nonlinear optical re¬ 
sponse is that it allows one to obtain a physical 
picture as to which excited states play a significant 
role in the response. On the other hand, the deriva¬ 
tive techniques rely on the fact that the static 
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(hyper)polarizability tensor components can be ob¬ 
tained as first- (second-, third-) order derivatives 
of a given dipole moment component with respect 
to the electric field. Those derivatives can be eval¬ 
uated numerically or analytically within the cou¬ 
pled Hartree-Fock scheme. The finite-field (FF) 
approach is used for the numerical procedure, 
originally proposed by Cohen and Roothaan [ 6 ], 
where the molecular Hamiltonian explicitly in¬ 
cludes a finite electric-field-dependent term. Sev¬ 
eral Hartree-Fock self-consistent-field or post- 
Hartree-Fock calculations need to be performed 
for different numerical values of the electric field. 
The total energy or dipole moment as a function 
of the different field values is then differentiated 
numerically. 

Many calculations of static (hyper)polarizabili- 
ties of several interesting materials have been re¬ 
ported. These results suggest that it is important to 
calculate them considering the effect of electron 
correlation using sufficiently large basis sets [7]. In 
particular, the second hyperpolarizability value, y, 
is very dependent on these factors. However, an 
accurate and practical computational method for 
determining the sensitive physical properties to 
the electron correlation of quantum many-body 
systems is still under study. 

In the last decades, there have been several 
attempts to apply the path integral method to 
calculating physical properties, with success for 
simple systems. The path integral was first intro¬ 
duced by Feynman [8-10]. The theory has been 
successfully applied to many problems [11-15] in 
quantum mechanics. Because the path integral is 
expressed as multidimensional integrals with a 
huge number of variables, we cannot directly com¬ 
pute this integration by ordinary ways of numeri¬ 
cal integrations. Therefore, a statistical method 
such as the Monte Carlo (MC) method [16-22] is 
attempted for the integrations. The standard ap¬ 
proach to the Monte Carlo simulation of field 
theory has been established through Feynman's 
path integral formulation. It is necessary to work 
at an imaginary time so that the integrands of the 
path integrals do not oscillate. It is also necessary 
to work with systems having a finite extent in both 
space and imaginary time. This is equivalent to 
finite temperature statistical mechanics. 

Recently, we also applied the path integral 
method to the many-electron systems in quantum 
chemistry [23-27]. The path integral formulation 
of the partition function was presented in terms of 
the LCAO coefficients or the Thouless parameteri¬ 


zation. We presented numerical calculations 
[28-31] of the ensemble average of the total energy 
and the magnetization by means of the MC 
method. We already reported briefly the numerical 
results of the calculations of the (hyper)polarizabil- 
ity for a H 3 system [32]. 

In this article, we show dependencies of the 
(hyper)polarizability on the external magnetic field 
for a hydrogen molecule using our path integral 
expressed by the generalized coherent state, where 
our model Hamiltonian includes the ab initio 
Hamiltonian and the electric- and magnetic-field- 
dependent terms. We also discuss relations be¬ 
tween (hyper)polarizability and spin states. 


Theory 


To apply a path integral method to molecular 
systems, the path integral formulation of the parti¬ 
tion function is derived by using the generalized 
coherent states. The Slater determinant | Z ) is writ¬ 
ten in terms of complex coefficients Z P T as 


|Z> = det(l + Z + Z) 1 


X exp 


a, p — (x, p 


M-N a N p 

Y T 7 ap a + a 

La La ^ph U pa U hp 

p = 1 h = 1 


\<t>) 


(i) 


= det(l + Z + Z) 


N p / 


x n n 

p~ a, p h = 1 


M-N„ \ 

<+ E E Z ph a pa l°>' 

<x= a, P p=\ I 

( 2 ) 


where | <f>) is a Slater determinant composed of N 
orbitals (hole-orbitals), N a means the number of 
hole-orbitals with a spin cr, and N is N a + Np. M 
is the number of the base. The p labels denote the 
particle orbitals that are orthogonal to the hole 
orbitals h, and Z is a matrix whose elements are 
Z c p r f r a+ a ( a i<T ) is the creation (annihilation) operator 
for the z-th orbital. These operators also obey the 
anticommutation relation. 

The partition function H is written as 

S - Tr[e^ H ]. (3) 

The symbol Tr indicates a trace operation, and 
/3 = ( k B T)~ l with the Boltzmann constant k B and 
a temperature of T Kelvin. 

A path integral representation of the partition 
function in terms of the generalized coherent states 
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yields 

H = / fl dg(Z k XZ k \Z k _ 1 ) 

J k= 1 

n 

I 

X exp 

with 


-tEH(Z,*,Z w ) 

/=1 


IZ 0 > = |Z„>, 


where 


(5) 


Kj = / drcpf a (r) 


■2 N„ 


2 m a _i lr - RJ 


•P/o-W/ 


( 12 ) 


(4) V$T = / dr 9«v(r)^ ; * (r')i^3^7i^ p (r')^ /( ,(r), 


(13) 


and 


A ,7 = / dr <pf a (r)p<p jlT ( r). (14) 


m-n„ Np rfZT* dZff 

w-cn n n -^h;— 

orp p= 1 Zl = l 


x|det(l + Z + Z)| N ( 6 ) 


and 


mZ*,Z k _ x ) = (ZJHIZ,.,). (7) 

r = /3/ft. ft is the number of the divisions, which is 
used as the parameter of the actual calculation of 
physical properties. From expression (4), we will 
derive an expression of the ensemble average of an 
operator such as the total energy or the dipole 
moment. 


MODEL HAMILTONIAN 

To apply the path integral expressed by the 
generalized coherent states to the quantum chem¬ 
istry, we consider the ab initio Hamiltonian in¬ 
cluding electric- and magnetic-field dependent 
terms for molecular systems. In the Born- 
Oppenheimer approximation, the Hamiltonian is 
written as 

H = H 0 + H 7 + H u , ( 8 ) 

where 


In Eq. (9), the first term represents the kinetic 
and the nuclear-electron interaction part and the 
second one represents the electronic interaction 
part. Hj and H n are the electric-field-dependent 
term and the Zeeman Hamiltonian, respectively, ijl 
is the dipole moment, and F indicates the electric 
field. fi Q and h 0 indicate the magnetic moment 
and the static external magnetic field, respectively. 
Here, we introduce an orthogonal basis set {<p/r)} 
such as a Hartree-Fock (HF) molecular orbital 
(MO). Therefore, a 7. (a ia ) becomes the creation 
(annihilation) operator for the z-th MO. S iz indi¬ 
cates a z-component of the spin angular momen¬ 
tum at the z-th site. In Eq. (13), R a is a spatial 
coordinate of the a -th nucleus, and N a , the number 
of the nucleus. Z a indicates the positive electron 
charge of the a-th nucleus. 

We derive the matrix elements (Z k \H\Z k _ 1 ) of 
the Hamiltonian in Eq. ( 8 ): 

H(Z*, Z t _a = H 0 (Z?, Z k _,) + H,(Z*, Z t _a 

+ H„(Z?,Z t _ 1 ). (15) 


ENSEMBLE AVERAGE 

The ensemble average of any operator O can be 
calculated: 


M 


h 0 = E E 

cr=a, (B i, j= 1 


M 


+ 


E 

E 

T7- crppcr + + 

v ijkl a ia a jp a kp a la 

, (9) 

a, p= a, (B i, 

j,k,l= 1 



H,= ~ 

E F 


( 10 ) 

<J 

= 0L, (B 




M 


H u = ~ 

- 2 ^o H 

o X! 

( 11 ) 


«o» = 


Tr[Oe _/3ff ] 
Tr[ e -/3H ] 


(16) 


f fl dttz k xz k \z k _ 1 


> 


1 " 


Xe-^HUtz t _o_ £ < Z jO|Z m _ 1 > 

/= 1 ft i 

m = 1 


i = 1 


(17) 
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In this article, we numerically calculate the ensem¬ 
ble average of the total energy ((H)) or that of 
the dipole moment (( fx)). 

It is impossible to directly calculate the ensem¬ 
ble average of physical properties in Eq. (17) with 
the matrix elements of the Hamiltonian in Eq. (15), 
because a huge amount of integral variables is 
generated. Therefore, the above integral is approxi¬ 
mated by other numerical methods such as the 
Monte Carlo method. 


Static (Hyper)polarizabilities 

We consider a Hamiltonian written as 

H — H' — F • /x, (18) 

where H f is the addition of the ab initio Hamilto¬ 
nian and the Zeeman Hamiltonian. If the second 
term is the perturbation term applied to the molec¬ 
ular system, the (hyper)polarizabilities are derived 
in terms of the perturbation procedure. Let us 
consider the molecule symmetric with respect to 
the x-direction. Then, the expression of the ensem¬ 
ble average of the dipole moment, ((fx v )( (v = 
x , y, z), becomes 


« IK» 


Tr[ /x„e pH '] 
Tr[e- pH '] 


(19) 


Similar to the above procedure, we finally have an 
explicit expression of the hyperpolarizability y xxxx : 


«0 3 .v» 


E Fdx f*d\' f y d\" 

0 0 

xTr[e-^- A) %e- (A - A '> H ^, ; 


Xe -( A ’- A ")H^ e -A»H^] 

XF v F V2 F„y Tr[e-* H '] 

- £ j P d ATr[e-^- A >Xe“ A V] 

V\, v 2 , 0 

xF Vi /Tr[e~ pH "] J^dX dX' 


xTr [ e -0- A )H^ e -( A - A ')H^ e -VH'] 


XF V2 F„ 3 /Tr[e- pH '], 


1 d\(0 3x )) 

« 7 » = - 
Fx:xxxf/ s 


6 dF x dF x dF x 


( 22 ) 

(23) 


It may be easily shown that these expressions 
are equal to those obtained from the conventional 
perturbation theories of, e.g., the SOS method and 
so on. 


Numerical Example 


According to the procedure of the linear response 
theory, we obtain an expression for ((O u )) to the 
first order in the dipole moment ijl v : 


«o lx »= E 

v= x, y, z 


J / 3 rfATr[e" (/ 3 - A)H ^e” AH )A I ] 


Tr[c (5H '] 


-p L 

v= x, y, z 


Tr[e pH 'ix„\ Tr[e pH 'n x ] 
Tr[e _/3H ] Tr[ e~ pH ] 


In this section we discuss the performances of 
numerical calculations of the ensemble average of 
the physical property by a Monte Carlo method 
based on this method. A novel numerical scheme 
is presented. The simplest example of a hydrogen 
molecule is examined. Results by this scheme are 
compared with those obtained from the conven¬ 
tional MO method. 


CALCULATING SCHEME AND MONTE CARLO 
PROCEDURE 


( 20 ) 


Therefore, the polarizability a xx is derived by 


«“rr» = 


*<o lx » 

dF, 


( 21 ) 


Figure 1 illustrates a numerical scheme to calcu¬ 
late the ensemble average of the physical property 
by a Monte Carlo method. We need the matrix 
elements, hf and V i ^f ,or / between orbitals in each 
term of Eqs. (12) and (13). Because our initial 
coherent state is constructed by the HF orbitals, 
matrix elements based on the HF orbitals must be 
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Selecting Basis sets 


Total Energy and Dipole Moment Without 
External Magnetic Field 


_1 

(hondo95.3) 

r 

Hartree-Fock(HF) Molecular Orbitals 
Molecular Integrals based on Basis sets 


(^Orthogonalizing ^ 
r 

Molecular Integ 

rals based on HF-MO 
— 


Division, Temperature 
Magnetic Field 


^Monte Carlo method 
^ (Metropolis algorithm) J 


t _ 

Total Energy 
Magnetization 
Polarization 
(Hyper)polarizability 


FIGURE 1. Scheme for calculating physical properties. 


generated. First, the HF molecular orbitals and the 
molecular integrals based on a basis set are ob¬ 
tained from HF calculations using the H0ND095.3 
program package [33]. Second, we convert them 
into the molecular integrals based on the HF or¬ 
bitals orthogonal to each other. Finally, the ensem¬ 
ble average of the physical property is calculated 
using the above molecular integrals, the number of 
the division n for the partition function, and a 
temperature based on a metropolis algorithm. The 
random number with the density |det(l + Z + Z)| N 
of the probability is generated by the rejection 
method. To avoid the negative sign problem, a 
reweighting method is attempted [34]. 

EXAMPLE 

As a numerical example, first, the total energy 
is calculated for a hydrogen molecule. The results 
are compared with those obtained from the con¬ 
ventional molecular orbital theory. Next, the calcu¬ 
lation results of the (hyper)polarizabilities are 
shown for the hydrogen molecule. Here, the de¬ 
pendence of the values on the number of the 
divisions is discussed. Dependence on the external 
magnetic field is also discussed. 


We consider a hydrogen molecule with an 
atomic distance of 1.46 au as shown in Figure 2. 
All calculations were carried using the 4-31G basis 
set. Figure 3 and Table I show the ensemble aver¬ 
age of the total energy vs. the number of the 
divisions for the partition function at temperature 
of 1 K. For 10 4 divisions, the ensemble average of 
the total energy becomes —1.1406 ± 0.0033 au. As 
shown in Table I, the CCSD total energy, which is 
equivalent to the full-CI energy in the case of the 
two-electron system, obtained by Gaussian 92 [35] 
is —1.15127 au. The total energy is expected to be 
equal to that of full-CI when the number of divi¬ 
sions becomes large. We also show the ensemble 
average of the dipole moment under the same 
condition in Figure 4. These results suggest that 
for the infinite divisions for the partition function 
at lower temperature calculations by the MC 
method cause errors. We expect that the ensemble 
average of the dipole moment value for each divi¬ 
sion number n will vanish at higher temperature. 
Note that the accuracy of calculations by the MC 
method is generally proportional to the square 
root of the number of the divisions. It means that 
precise calculations by the MC method takes us a 
long time. For this reason, the calculations for 10 4 
divisions were carried out to obtain qualitative 
effects of these physical properties. It is necessary 
to investigate further these effects with larger 
number of the divisions, n, and temperature for 
the quantitative study of these physical properties. 


z 



FIGURE 2. Employed model system of a hydrogen 
molecule. 
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FIGURE 3. Total energy vs. the number of divisions for the partition function. 


(Hyper)polarizabilities Without External 
Magnetic Field 

From Eqs. (21)-(24), a xx and y xxxx are calcu¬ 
lated in terms of the MC method. The present 
calculations were performed using the 4-31G basis 
set under a condition of a temperature of 1 K. The 


a xx and y xxxx values are also calculated for the 
H 2 molecule by the FF method with Gaussian 92. 
These calculations were made using the numerical 
derivative of the total energy with respect to the 
applied electric field. The results at HF and other 
correlation levels are shown in Tables II-IV. The 
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TABLE 

1 



Total energy. 


MC method 

MO method 

n 

Total energy (au) 

Method 

Total energy (au) 

1 

-1.1256 ± 0 

HF 

-1.126 

10 

-1.1264 + 0.0009 

MP2 

-1.143 

100 

-1.1285 ± 0.0014 

MP3 

-1.149 

1000 

-1.1336 ± 0.0024 

MP4DQ 

-1.150 

10,000 

-1.1406 ± 0.0033 

MP4SDQ 

-1.150 



CCSD 

-1.151 


polarizability a xx for each division number in the 
MC method is shown in Figure 5 and Table II. For 
the division number of 10 4 , the a xx value becomes 
6.675 ± 0.180 au. On the other hand, the CCSD 
method gives 6.407 au. As for the total energy 
value, we can also expect that by using a larger 
division number the polarizability value will vary 
from that with the HF calculation to that with the 
full-CI. Similarly, the hyperpolarizability y xxxx also 
shows a tendency similar to other physical proper¬ 
ties, which becomes the exact value at the limit of 
infinite division number as expected from Figure 6 
and Table III. 

(Hyper)polarizabilities Under External 

Magnetic Field 

We calculate the (hyper)polarizability of the hy¬ 
drogen molecule under the external magnetic field. 
These calculations were carried out using 4-31G 
basis set under a condition of division number of 
10 3 and temperature of 1 K. First, the variation of 
the a xx value with the magnetic field is shown in 
Figure 7 and Table IV. When the magnetic field is 
in the region from 0.0 to 1.5 au, the a xx value 


TABLE II_ 

Polarizability. 



MC method 

FF method 


n 

a xx (au) 

Method a xx (au) 

1 

6.962 ± 0.000 

HF 

6.924 

10 

6.988 + 0.121 

MP2 

6.665 

100 

6.845 ± 0.151 

MP3 

6.516 

1000 

6.785 ± 0.091 

MP4DQ 

6.440 

10,000 

6.675 + 0.180 

MP4SDQ 

6.449 



CCSD 

6.407 


TABLE III_ 

Hyperpolarizability. 


MC method FF method 


n 

%xxx (au) 

Method 

Txxxx (au) 

1 

-22.603 + 0.000 

HF 

-22.00 

10 

-19.456 ± 1.976 

MP2 

-17.39 

100 

-12.492 + 3.150 

MP3 

-12.79 

1000 

-10.979 ± 1.207 

MP4DQ 

-9.59 

10,000 

-9.707 ± 1.069 

MP4SDQ 

CCSD 

-10.21 

-8.72 


becomes about 6 au. However, when the magnetic 
field is over 2.0 au, the a xx value suddenly de¬ 
creases to about 3 au. This result can be explained 
as follows: As shown schematically in Figure 8, 
triplet energy level splits into three distinct levels 
by the external magnetic field (Zeeman splitting). 
Over the critical magnetic field, the lowest one of 
the three levels becomes lower than the singlet 
energy level (spin crossover). The sudden decrease 
in the polarizability described above may reflect 

TABLE IV_ 

Polarizability: (a) polarizability vs. magnetic field; 

(b) polarizability for the singlet or triplet spin state. 

(a) 


MC method 


HZ 

a xx (au) 

0.0 

6.899 + 0.036 

0.5 

7.175 ± 0.429 

1.0 

6.576 + 0.295 

1.5 

6.763 ± 0.610 

2.0 

3.207 ± 0.505 

2.5 

3.370 ± 0.295 

3.0 

3.303 ± 0.155 


(b) 

FF method 

a xx (au) 


Method 

Singlet 

Triplet 

HF 

6.924 

3.322 

MP2 

6.665 

3.297 

MP3 

6.516 

3.288 

MP4DQ 

6.440 

3.285 

MP4SDQ 

6.449 

3.285 

CCSD 

6.407 

3.283 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


703 






NAGAO ET AL. 



FIGURE 5. Polarizability vs. division number. 


this transition of the ground state. This explanation y xxxx value is around -11 au. On the other hand, 

is also supported by the results of independent the y xxxx value becomes larger at the region over 

calculations of a xx values for singlet and triplet the magnetic field of 1.0 au. The singlet and triplet 

states by the FF method shown in Table IV. Calcu- J xxxx values by the FF method are - 8.72 and 

la ted results for the hyperpolarizability y xxxx are 42.24 au, respectively, as shown in Table V. The 

shown in Figure 9 and Table V. When the mag- increase in the y xxxx values also can be considered 

netic field is in the region from 0.0 to 1.0 au, the to be the result of the transition of the ground state 


-8 


^ -10 
£ - 12 



-14 

-16 

-18 

-20 

-22 


-24 



FIGURE 6. Hyperpolarizability vs. division number. 
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H 

z 


FIGURE 7, Polarizability vs. static magnetic field. 


TABLE V_ 

Hyperpolarizability: (a) Hyperpolarizability vs. 
magnetic field; (b) hyperpolarizabilities for the 
singlet and triplet spin states. 


(a) 

MC metho 

Hz 

7xxxx 


0.0 

-11.492 ± 1.214 


0.5 

-13.672 ± 4.288 


1.0 

-11.004 ± 2.940 


1.5 

-8.797 ± 4.677 


2.0 

-1.922 ± 6.991 


2.5 

5.047 ± 10.684 


3.0 

8.239 ± 14.021 


(b) 

FF method 

Yxxxx (au) 


Method 

Singlet 

Triplet 

HF 

-22.00 

39.47 

MP2 

-17.39 

41.24 

MP3 

-12.79 

41.95 

MP4DQ 

-9.59 

42.15 

MP4SDQ 

-10.21 

42.02 

CCSD 

-8.72 

42.24 


between different spin states. These results quali¬ 
tatively show that the (hyper)polarizability under 
the external magnetic field is explained in terms of 
the relative stability of the singlet and triplet states. 
Note that the y xxxx value by the MC method in 
the region over the magnetic field of 2.5 au is less 
than that of the triplet state by the FF method 
because of using the division number of 10 3 . For 
more precise calculations, the division number 
should be larger than 10 3 . 


Concluding Remarks 

We presented a path integral formulation and 
the scheme to calculate the ensemble average of 


Energy 
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FIGURE 9. Hyperpolarizability vs. static magnetic field. 


physical properties in terms of the Monte Carlo 
method. We applied our method to calculations of 
total energy, dipole moment, and static 
(hyper)polarizability of a hydrogen molecule. The 
qualitative dependence of those physical proper¬ 
ties on the number of divisions for the partition 
function was shown, which indicates that various 
properties will converge to an exact value at the 
limit of larger division number. The qualitative 
dependence of physical properties on the external 
field is also shown. It is found that the static 
(hyper)polarizability value varies around the spin 
crossover. Our method can be a useful tool for 
investigating such phenomena as the property 
change under the transition process. In next step, 
we will reform our scheme to yield more quantita¬ 
tive results for investigating larger systems, chemi¬ 
cal catastrophe (dissociation), and so on. 
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ABSTRACT: Monte Carlo simulations of liquid formamide, N-methylformamide 
(MF), and N, N-dimethylformamide (DMF) have been performed in the isothermal and 
isobaric ensemble at 298 K and 1 atm, aiming to investigate the C—H-**0 and 
N—H O hydrogen bonds. The interaction energy was calculated using the classical 

6-12 Lennard-Jones pairwise potential plus a Coulomb term on a rigid six-site molecular 
model with the potential parameters being optimized in this work. Theoretical values 
obtained for heat of vaporization and liquid densities are in good agreement with the 
experimental data. The radial distribution function [RDF, g(r)] obtained compare well 
with R-X diffraction data available. The RDF and molecular mechanics (MM2) 
minimization show that the C—H ••• O interaction has a significant role in the structure 
of the three liquids. These results are supported by ab initio calculations. This interaction 
is particularly important in the structure of MF. The intensity of the N—H — O hydrogen 
bond is greater in the MF than formamide. This could explain some anomalous properties 
verified in MF. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 709-717, 1997 

Key words: Monte Carlo simulation; amides; hydrogen bond; radial distribution 
functions 


Introduction 

L iquids have particularly interested the chem¬ 
istry community due to their fundamental 
role in chemical and biological processes. There¬ 
fore, the study of their properties have attracted a 

International Journal of Quantum Chemistry, Vol. 65, 709-717 (1997) 
© 1997 John Wiley & Sons, Inc. 


good deal of attention, in spite of the difficulties 
involved. In the last decade, computational meth¬ 
ods such as molecular dynamics and Monte Carlo 
(MC) have become popular tools in this task [1-4]. 
These methodologies have shown to be specially 
useful to investigate some structural aspects re¬ 
lated with the liquid state. Nevertheless, the suc¬ 
cess of the investigation is strongly dependent on 
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the availability of intermolecular potential func¬ 
tions that yield a reasonable model for the liquid. 
In recent years, we have undertaken a theoretical 
investigation on the liquid properties emphasizing 
the liquid structure and solute-solvent interac¬ 
tions in binary mixtures [5]. The concept of hydro¬ 
gen bond (H bond) has been fundamental in un¬ 
derstanding the interaction between molecules, 
especially in aqueous systems. However, as yet, 
many aspects of this type of nonbonded interac¬ 
tion are not well known, in spite of all the work 
that has been done in that area. One point of 
dispute is the possibility of the C—H ••• O interac¬ 
tion be considered as H bond [6]. Taylor and 
Kennard have provided conclusive evidence of the 
existence of this interaction in crystals, in which 
the lengths of the C O distance are between 3.0 
and 4.0A, and the C—H---0 angle ranges be¬ 
tween 90° and 180° [7]. The energy of this interac¬ 
tion is small (2.0-20 kj/mol) [6], but there is little 
doubt that it has important implications in chem¬ 
istry, biochemistry, and crystal engineering, espe¬ 
cially when strong O—H • ♦ • O and N—H • • • O H 
bonds are absent [6, 8]. C—H***0 bonds may 
even distort the geometry of the strong H bond in 
specific instances. Anomalies in strong H bond 
patterns of certain crystal structures are explained 
on the basis of the interplay of strong and weak H 
bonds [8]. In general, the strength and effective¬ 
ness of C—H - OH bond depends on the C—H 
carbon acidity [9] and on the O atom basicity [10], 
which is enhanced by cooperative effects. The cri¬ 
teria for the significance of a particular C—H • • • O 
geometry have also been the subject of some dis¬ 
cussion. Recent analysis has shown that angular 
characteristics of the larger C—H • • • O contacts 
(C---0 3.5-4.0 A) and their effects on crystal 
structures resemble those with shorter contacts 
(3.0-3.5 A) [6]. It should be emphasized that the C 
—H * * O bond is not really a van der Waals con¬ 
tact but is essentially electrostatic. Therefore, since 
its strength falls off slower with distance than the 
usual van der Waals interaction, it is conceivable 
at distances equal to or larger than the van der 
Waals limit [8]. Thus, even long C ••• O separations 

o 

(about 4.0 A) must be considered, and one can 
expect that these weak H bonds have orienting 
effects on molecules, influencing in liquid struc¬ 
ture. 

Another type of H bond that plays a master role 
in biological chemistry is the N—H--’0 = C H 
bond. However, some aspects of its structure are 


not yet well understood. Bertolasi and co-workers 
have shown that the range of the N * • • O distance 
is between 2.7 and 3.3 A, while the N—H---0 
angle ranges between 144° and 175° [11]. The au¬ 
thors have concluded that the resonance is the 
most important factor determining the strength of 
the interaction and is responsible by the strong H 
bonds formed. 

The main goal of this study is to investigate the 
patterns of this interactions in liquid phase using 
computer simulations. For that purpose we have 
performed MC simulations [12] of the simplest 
liquid amides, like formamide, N-methylforma- 
mide, and N,N-dimethylformamide (DMF). Those 
molecules contain a carbonyl C—H group that 
could form C—H * • • O bond, and two of them also 
have a 0 = C—N—H group that could form an N 
—H-**0 = C bond. In the latest case the main 
feature is the capability of this group to present 
resonance. Besides, it is possible to investigate the 
role of the substitution on the donor capability of 
the amine group. 


Molecular Models 

A MC investigation using optimized potential 
for liquid simulations (OPLS) force field parame¬ 
ters [13] on a five-site model and a molecular 
dynamics simulation with test-particle model po¬ 
tential [14] on a six-site model has been published 
for formamide. Monte Carlo [13] and molecular 
dynamics [15] simulations have been performed 
for a five-site model of DMF using OPLS poten¬ 
tials. More recently, molecular dynamics simula¬ 
tion on the three liquids has been published using 
six-sites model in which the structure factors for 
the liquids were explored and compared with the 
experimental ones [16]. Having in mind our main 
purpose in this work, we have optimized a six-site 
model for those three liquids (the methyl groups 
are treated as a monatomic site) such that the 
H'-'O interaction in C—H---0 can be studied. 
This parametrization is also important to investi¬ 
gate the behavior of the aldehydic hydrogen in the 
amide-water solutions. In order to clarify the 
site-site intermolecular correlation obtained by the 
plots of the radial distribution function, we have 
investigated the dimer formation in the gas phase 
at molecular mechanics (MM2) level [17]. The 
molecules were modeled as rigid structures and 
therefore intramolecular relaxation effects are not 
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considered. As it is known these molecules do not 
present rotation around the C—M bond due to the 
resonance verified in the peptide bond. The molec¬ 
ular geometry parameters for the three molecules 
were taken from the Jorgensen and Swenson (JS) 
study [13]. The potential parameters were opti¬ 
mized for formamide and DMF and accordingly 
used in MF. The Lennard-Jones parameters used 
for aldehydic hydrogen were those developed by 
Flagler and Liphson [18] while the rest were opti¬ 
mized using those reported by JS [10] as a starting 
trial set. The comparison with the experimental 
values of density and heat of vaporization (A H v ) 
was the criterion used to optimize these parame¬ 
ters. The charge distribution was optimized to 
yield a dipole moment around 15-20% greater 
than the one in the gas phase. This procedure has 
been used as a criterion to compensate for neglect¬ 
ing polarization effects in fluid simulations [13,19, 
20]. In formamide we have started with the charges 
published by Hagler and Liphson [18] (the point 
charges on amine sites are very similar to those 
used by JS [13]). For the DMF we have used the 
aldehydic charges obtained for formamide and the 
charges published by JS for the amine group [13]. 
The optimization of the charges was made such 
that the aldehydic and amine groups were set to 
be zero. I kept the charge distribution on the alde¬ 
hydic group the same for all studied molecules 
and changed only the charges on the amine group. 
All the approximations above are criteria to mini¬ 
mize the number of potential parameters. In Table 
I, I present the potential parameters used for the 
simulations of the three amides. The results from 
this work attest the quality of the Lennard-Jones 
potential parameters reported by JS [13]. 

A calculation of the partial charges was made 
with 6-31G* basis set using the CFiELPG proce¬ 
dure included in the Gaussian 92 program [21] as 
discussed elsewhere [22], maintaining fixed our 
molecular geometries. In Table II, I present the 
results obtained in this work. 

It is noticed that the charges calculated with the 
CHELPG method are somewhat different from 
those I have used in the simulation. The main 
difference is in the charges of the carbonylic car¬ 
bon and the nitrogen. However, the dipole mo¬ 
ments are quite similar in both calculations. There¬ 
fore, I conclude that the dipole moment plays an 
important role in the experimentally observed A 
and liquid densities. It is interesting to notice that 
the derivation of partial charges from molecular 
orbital calculations depends on the various as¬ 


sumptions in the molecular orbital theories, while 
that optimized by MC simulations depends on the 
fitting of experimental results with the intermolec- 
ular force field used. 


Computational Aspects 

INTERMOLECULAR POTENTIAL FUNCTION 

To determine the potential topology, the energy 
E ah between molecules a and b has been repre¬ 
sented by a sum of Coulomb and Lennard-Jones 
potentials centered on the sites: 

E ab = E [ A tj /r]f - Bjj/rfj + , (1) 

y 

where r x - ] is the distance between site i in a and 
site j in b and and q- are fractional point 
charges located on i and j molecular sites (it has 
been supposed that the electrostatic contribution 
to the intermolecular potential is suitable repre- 


TABLE I_ 

Potential parameters for the amides used in 
this work. 


Sites 

9 

a (A) 

e (kcal / mol) 


Formamide ( 

/x = 4.2 D) 


C 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

H(C) 

0.12 

2.75 

0.038 

N 

-0.830 

3.25 

0.17 

H(N) 

0.415 

0.00 

0.00 


MF (n = 

4.3D) 


c 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

H(C) 

0.12 

2.75 

0.038 

N 

-0.70 

3.25 

0.14 

Me 

0.285 

3.80 

0.13 

H(N) 

0.415 

0.00 

0.00 


DMF (fi = 

= 4.4D) 


c 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

He 

0.12 

2.75 

0.038 

N 

-0.57 

3.25 

0.14 (0.17) 

Me 

0.285 

3.80 

0.13 (0.17) 


The values between brackets are those reported by Jor¬ 
gensen and Swenson [13]. The unit of t± in Debye (the 
values are those ones obtained with the charges presented). 
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TABLE II_ 

Partial charges located on the amides sites 
obtained with 6-31G* basis functions. 


Formamide 
(fi = 4.34) 

MF 

(/x = 4.34) 

DMF 

(fi = 4.35) 

C = 0.752 

C = 0.565 

C = 0.463 

O = -0.630 

O = -0.586 

0= -0.595 

H(C) = -0.003 

H(C) =0.017 

H(C) = 0.051 

N = -0.970 

N = -0.544 

N = -0.143 

H(N) = 0.401 

H(N) =0.303 

CH 3 = 0.153 

H(N) = 0.449 

CH 3 = 0.246 

CH 3 = 0.072 


Unit of jjL in Debye. 


sented by fractional charges located on the molecu¬ 
lar sites). For each site k, the parameters A kk and 
B kk were given by A kk = 4s k a k 12 and B kk = 4 s k a k 6 , 
where s k and a k are the Lennard-Jones parame¬ 
ters for the fcth site. Parameters A }j and B,- ■ for a 
nondiagonal interaction [i,j] were obtained using 
the geometric combining rules A ; --) 1/2 

and B /; = (B u [23]. This approach has been 
frequently used in this field of study. In an earlier 
work we have shown that the difference in the 
results obtained using the geometric combining 
rule above and well-know Lorentz-Berthelot rule 
is negligible [5]. 

MONTE CARLO SIMULATIONS 

MC simulations were carried out in the isother¬ 
mal and isobaric ensemble at 298.15 K and internal 
pressure 1 atm, employing Metropolis importance 
sampling [24] and cubic box boundary conditions. 


All the systems investigated have consisted of 256 
monomers in a cubic cell. In our earlier work we 
have demonstrated that the number of molecules 
in the box, does not imply significant changes in 
the results obtained [5]. In the calculation of con¬ 
figurational energy using Eq. (1), the Coulombic 
term was calculated with Ewald sum [23]. Starting 
from the initial distribution of monomers, a new 
configuration was generated by randomly translat¬ 
ing and rotating a randomly chosen molecule along 
Cartesian coordinates. Ranges for translations and 
rotations of the monomers and volume changes 
were adjusted to yield an acceptance/trial ratio of 
about 0.45 for new configurations. After a volume 
movement the coordinates of the center of mass of 
all molecules in the reference box were scaled in 
an usual way [23]. Each calculation started with an 
equilibration phase of 1.2 X 10 6 configurations fol¬ 
lowing by an average segment consisting of 8 X 10 6 
configurations. Statistical uncertainties were calcu¬ 
lated from separate averages over blocks of 2 X 10 5 
configurations. The calculations were carried out 
on Silicon Graphics Power 4D and IMB RISC 6000 
workstations. 


Results and Discussion 

THERMODYNAMIC 

In Table III, I show the thermodynamic results 
obtained for the three amides. The results agree 
very well with those reported by JS for formamide 
and DMF [13] and a good agreement is noted 
between theoretical and experimental values. As 
already discussed above, the potential parameters 


TABLE III_ 

Values of thermodynamics properties of the pure liquids. 3 


Liquid 

5 

<1 

-E, 

V 

P 

This work 

Formamide 

14.79 ± 0.02 

14.20 ± 0.02 

40.93 

1.10 + 0.003 

MF 

13.77 ± 0.02 

13.18 ± 0.02 

59.32 

1.00 ± 0.003 

DMF 

11.40 ± 0.03 

10.81 ± 0.03 

78.54 

0.93 ± 0.003 

Experimental values 5 

Formamide 

14.70 

14.11 ± 0.02 

39.89 

1.13 

MF 

13.77 

13.18 + 0.02 

57.56 

1.02 

DMF 

11.10 

10.51 ± 0.03 

77.66 

0.94 


a Heat of vaporization (A H v ) and intermolecular energy (E,) in kcal / mol. The heat of vaporization was estimated by A H v = + RT. 

Mole volume (V) in cm 3 /mol (the experimental one is calculated with experimental, density and molecular weight). Density (p) in 
g/cm 3 . 

b AH v and p obtained from Ref. [25]. 
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were optimized for formamide and DMF and used 
in methylformamide. As one can notice, these pa¬ 
rameters reproduce well the experimental thermo¬ 
dynamic properties of the liquids. 

RADIAL DISTRIBUTION FUNCTIONS AND 
HYDROGEN BOND 

Formamide 

The radial distribution functions (RDF) com¬ 
puted for formamide are shown in Figure 1. Quali¬ 
tatively, our results are similar to those reported 
by JS [13]. The small differences in the height of 
some peaks can be attributed to differences in the 
charges located on the sites. The closest inter- 
molecular correlation verified in the RDF is be¬ 
tween the oxygen and the hydrogen atoms bonded 

0 

to nitrogen located at 1.9 A. This correlation has 
been verified by different experimental techniques 
and reported earlier by Kalman and co-workers 
[26]. It is also very close to the smaller distance 
reported by Bertolasi and co-workers to the H 
bond in the molecules investigated by them [11]. 
In the same way, the N—O correlation at 2.9 A 
also agrees with the distance determined experi¬ 
mentally by Kalman and co-workers [26], and it is 
in the range of distances reported by Bertolasi and 
co-workers to the N—FI 0 = C FI bond [11]. Ac¬ 
cording to these authors, these distances are char¬ 
acteristic of a strong H bond between the molecules 
and are verified preferentially in molecules that 
present resonance. It is clear from the height and 
the peak positions of the two amine hydrogen that 
there is no strong preference between them in the 
hydrogen bond formation. 


Figure 2 shows the only two dimers found with 
the minimization procedures that agree with the 
g(r) plots. The legend show the most important 
geometric parameters to our discussion. As one 
can observe, the g(r) plots are a combination of 
these dimers. This is an indication of the existence 
of both of them in the liquid. This coexistence is 
stressed by the 0/H(C) RDF which presents two 
significant peaks at 2.9 and 4.9 A. The form of the 
N/N g(r) is also due to the coexistence of both 
dimers in the liquid. The N—H ♦*• O angle in dimer 
a is roughly 157°, which agrees with the values 
reported by Bertolasi and co-workers for the N ■ • • O 
distance verified in this correlation [11]. Ohtaki 
and co-workers have optimized formamide dimers 
with ab initio LCGO-MO-SCF with a minimum 
base set (2s/lp) [27]. They have obtained that the 
cyclic dimer is the most stable, which is also the 
case in our study, but the authors have concluded 
that the trans dimer is more probably in the liquid 
because this dimer favors the formation of open 
chains, which stabilize the liquid. Those authors 
have also optimized the dimer forming H bonds 
between oxygen and aldehydic hydrogen and have 
founded that this dimer presents a small stability 

o 

with the equilibrium distance O—FI of 3.2 A, 
which is very close to the first maximum of the 
0/H(C) RDF observed in Figure 1. The C - O 
lengths and C—H • • • O angles reported here also 
agree with the values presented by Desiraju [6] as 
characteristic of C—H---0 H bonds. Then the 
results are consistent with the existence of the 
C—H ■••OH bond in the liquid formamide. This 
correlation must contribute to increases the stabil¬ 
ity of the trans dimer (dimer a). Then, our six-site 




FIGURE 1. Plots of radial distribution function for formamide. H c and H ( are the aminic hydrogens c/s and trans to 
the oxygen, respectively. 
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FIGURE 2. The two formamide dimers optimized at MM2 level that agree with g(r). Geometric parameters of dimer a: 
r(0/Ht) = 2.0 A; r(0/Hc) = 3.6 A; r(0/N) =3.0 A; r(0/C) =3.5 A; r(0/H(C)) = 3.0; COH angle = 152°; NHO 
angle= 148.6°; CHO angle = 102.6°. Geometric parameters of dimer b: r(0/Ht) = 3.9 A; r(0/Hc) = 2.0 A (for the two 

H-bond); r{ 0/ N) = 3.0 A; r(0/C) = 3.8 A; COH angle = 120° (for the two H-bond); NHO angle = 178°. H c and H, are 
the aminic hydrogens cis and trans to the oxygen, respectively. 


model can identify the two types of correlation 
discussed in the introduction present in liquid 
formamide. Based in this work and in the early 
ones [13, 27], we consider that the structure of the 
liquid formamide consists of the two dimers 
showed in the Figure 2. Given the discussion 
above, I consider the position that the formamide 
is formed of flexible chains of dimer a with cyclic 
dimers "immersed" in it [27] seems to be reliable. 

A^,iV-Dimethylforniamide 

Unlike the formamide, no distinct peak has been 
verified in X-ray study of DMF [28]. As a conse¬ 
quence, there is more interest in simulating this 
liquid since, of course, it forms a condensed phase. 
In Figure 3 is shown the RDF computed for the 
liquid. Qualitatively, the g(r) graphs obtained here 
are equivalent to those obtained by JS [13]. As in 
the case of formamide, there are little differences 


in the height of the peaks, which can be attributed 
to differences in the charges located on the sites. 
The present g(r) results show a greater definition 
for the shoulders in the oxygen-carbon correlation 
then in the JS study [13], as consequence of the fact 
that I recognize C and H as different sites instead 
of a single site. The g(r) plots show that the 
shortest intermolecular distance in liquid phase is 
not between oxygen and methyl groups, as was 
reported in the JS study [13], but between oxygen 
and the aldehydic hydrogen. Figure 4 presents the 
two dimers found for which the atom-atom dis¬ 
tances better reproduce the g(r) plots obtained. 
From these dimers it is clear that there is a signifi¬ 
cant correlation between oxygen and the hydrogen 
bonded to the carbonyl group. This correlation 
must contribute for the correlation between oxy¬ 
gen and trans methyl to be more intense than 
oxygen and cis methyl. The first peak in the corre¬ 
lation oxygen-carbon occurs in the dimer involv- 




FIGURE 3. Plots of radial distribution function for N, A/-dimethylformamide. Me(c) and and Me(t) are the aminic 
methyls cis and trans to the oxygen, respectively. 
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FIGURE 4. The two DMF dimers optimized at MM2 level that agree with the g(r) plots. Geometric parameters of the 
dimer a: r(0 / Me(c)) = 3.5 A; r(0 / C) = 5.4 A; r(O / H) = 6.5 A; r(N/C) = 5.0 A- r(N / N) = 5.7 A- r (O / N) = 4.7 A* 
C(CH 3 )0 angle =158°. Geometric parameters oHhe dimer b: r(0/Me(t)) = 3.5 A; r(0/C) = 3.8 A; r(0/H) = 2.8 A; 
r( N/C) = 5.3 A; r(N / N) =5.8 A; r( O/N) = 4.0 A; COH angle = 143°; CHO angle = 144°. Me(c) and Me(t) are the 
aminic methyls c/'s and trans to the oxygen, respectively. 


for liquids with ab initio results for the gas phase. 
The existence of these two dimers in the liquid 
phase agrees with the structure proposed by Ohtaki 
and co-workers [28, 29] of a random distribution of 
molecular dipole orientation. 

/V-Methylformamide 

In the simulation of these molecules we have 
two possibilities for the position of the methyl and 
hydrogen sites of the amine group. Each one can 
be cis or trans to oxygen. The hydrogen trans 
conformer is predicted to be 4.2-5 kj/mol more 
stable than the cis one. Experimentally the relative 
concentration of the cis conformer is found to be 
around 10%, which is in agreement with the stabi¬ 
lization energy [30]. In this work, I have modeled 
the molecule with the hydrogen trans to the oxy¬ 
gen. Figure 5 shows the RDF graphs obtained for 


ing the oxygen-hydrogen correlation while the 

o 

peak at 5.5 A is due to the dimer involving the cis 
methyl. The second peak in the O/C correlation, 
more intense than the first peak, is a clear indica¬ 
tion of the existence of two different dimers in the 
liquid bulk. Also in this case, the C * • • O lengths 
and C—H • * • O angles reported for dimer b agree 
with the values presented by Desiraju as being 
characteristics of C—H---0 hydrogen bonds [6]. 
Ohtaki and co-workers [28] have made an ab initio 
MO-SCF calculation for the dimer bonded by 
C = 0—H(C) and found a stabilization energy of 
— 4.3 kcal/mol and a bond distance O—H of 3.2 
A. Based in the Desiraju study [6] and in the 
hydrogen bond energy of water (7.2-9.6 kcal/mol), 
this energy seems to be too high. Our results show 
that the distance of this interaction is also lower 
than that reported by those authors [28]. One 
should keep in mind that we are comparing results 



INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


715 








CORDEIRO 


H 


& 






& 


H 


H 



c 

■ 

jt!r 

te* - 


fe j 

J : 

■sU? 

N 

T? 

SfSiv,. ‘ 

v 


r 




ch 3 

% ■ / 


CHL 




N 


H 


& 


O 


FIGURE 6. Methylformamide dimer optimized at MM2 
level. Geometric parameters of the dimer: r(0-H(N)) = 
2.0 A; r(0-CH 3 ) = 3.6 A; r(O-C) =3.4 A; r(0-H(C)) = 
3.0 A; r(N-O) = 2.9 A; r(N-C) = 3.9 A, 4.7 A; N-H-0 
angle = 156°; C -H • • • 0 angle = 104°. 


this liquid, and Figure 6 shows the only dimer that 
we succeeded to optimize that agrees with the 
RDF graphs presented. 

Thus for this liquid, the intermolecular interac¬ 
tion between oxygen-amine hydrogen with hydro¬ 
gen bond formation prevails. The 0-**H(N) and 
N O distance and N—H ••• O angle agree with 
this [8]. The weak correlation between oxygen and 

'o 

hydrogen bonded to the carbonyl around 3 A must 
contribute to the dimer stabilization. The O • • • H(C) 
length and C—H---0 angle agree with that re¬ 
ported by Desiraju for the H bond between this 
atom [6]. It is interesting to notice that the substi¬ 
tution of a hydrogen by a methyl group from 
formamide to MF make the aminic hydrogen in 
MF more acidic than in the formamide, due to the 
stabilization of the methyl group in the change of 
nitrogen from sp 3 to sp 2 . Then the O--H(N) hy¬ 
drogen bond must be stronger. This dimer agrees 
with the structure proposed by Ohtaki and co¬ 
workers [31] and Neuefeind and co-workers [32] of 
linear chains with preferred parallel orientation of 
the molecular moment. 


Conclusions 

Clearly the properties of formamide and N- 
methylformamide are a consequence of the hydro¬ 
gen bond between oxygen and aminic hydrogen. 
DMF does not present this correlation. N-methyl- 
formamide presents an O H(N) H bond stronger 
than formamide, with the consequence than the 
aminic hydrogen is more acidic in the first one. So, 
MF must be more structured around this hydrogen 


bond than formamide. Other properties to be no¬ 
ticed are the melting and boiling points of for¬ 
mamide, MF, and DMF which are 2.55 and 111°C, 
-3.86 and 185°C, and —60.48 and 155°C, respec¬ 
tively [25]. Then, clearly MF presents an anoma¬ 
lous behavior, a consequence of a greater struc¬ 
turation of the liquid phase in which the O • • • H(N) 
plays a very important role. On other hand, for¬ 
mamide and DMF are completely soluble in water, 
while MF is not [25]. The more intense correlation 
in DMF is between the oxygen and methyl group 
located in the trans position with respect to oxy¬ 
gen, but the O • • • H(C) is the closest. Ab initio 
calculations on the dimer stabilization energy of 
oxygen-aldehydic hydrogen interaction from for¬ 
mamide and DMF give values of -2.0 and -4.3 
kcal/mol, respectively [26, 27]. This correlation in 
DMF is more stable than in the other two liquids 
and must have a greater significance in the liquid 
structure. In all studied liquids, the length, angles, 
and energetic analysis made on the O • • * H(C) per¬ 
mit the conclusion, based in the Desiraju study [6], 
that there is hydrogen bond formation in the liq¬ 
uids. Liquid formamide is made of flexible chains 
in which are immersed cyclic dimers. Liquid DMF 
is constituted of two dimers roughly dispersed in 
the liquid phase. MF is constituted of linear chains 
of one dimer only. 
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ABSTRACT: The molecular mechanism for the gas-phase addition of 
organomagnesium reagents: CH 3 MgCl, 2CH 3 MgCl, (CH 3 ) 2 Mg / and (CH 3 ) 2 Mg plus 
Cl 2 Mg, to 2-hydroxypropanal as a model of chiral a-alkoxy carbonyl compounds is 
investigated at the 6-31G* basis-set level of calculation. An extensive exploration of the 
reactive potential energy surface was carried out in order to locate and characterize the 
stationary points. The geometry of stationary points and the harmonic vibrational 
frequencies, transition vectors, and electronic structure of the transition structures were 
obtained. The theoretical results are analyzed, discussed, and compared with previous 
theoretical and available experimental data. The first step corresponds to the exothermic 
formation of the chelate complexes without an energy barrier. These stationary points 
correspond to puckered five-membered rings, determining the stereochemistry of the 
global process, which is retained throughout the reaction pathway. For the reactions of 
one equivalent of an organomagnesium compound [CH 3 MgCl or (CH 3 ) 2 Mg], the second 
step for the intramolecular mechanism is associated to the C—C bond formation via 
1,3-migration of the nucleophilic methyl group from the organomagnesium compound to 
the carbonyl carbon and the corresponding transition structure can be described as a 
four-membered ring, the anti attack being the most favorable pathway. CH 3 MgCl is a 
more powerful quelant agent than is the (CH 3 ) 2 Mg system. Therefore, the reaction 
pathway associated to the nucleophilic attack of CH 3 MgCl + 2-hydroxypropanal presents 
a larger barrier height than that of (CH 3 ) 2 Mg + 2-hydroxypropanal addition. The 
inclusion of a second equivalent corresponding to the 2CH 3 MgCl and (CH 3 ) 2 Mg + 

Cl 2 Mg systems yields an intermolecular mechanism, the barrier height decreases, and 
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the process can be considered as an assisted intermolecular mechanism: The first 
equivalent forms the chelate structure and the second one carries out the nucleophilic 
addition to the carbonyl group. The most favorable pathway corresponds to an 
intermolecular mechanism via an anti attack for the addition of 2CH 3 MgCl. © 1997 John 
Wiley & Sons, Inc. Int J Quant Chem 65: 719-728, 1997 

Key words: chelate complex; organomagnesium reagents; chiral a-alkoxy carbonyl 
compounds; enantioselective additions; intra- and intermolecular mechanisms 


Introduction 

D uring the last several years, there has been a 
notable increase in research on organomag¬ 
nesium alkylation reactions [1-3]. Detailed knowl¬ 
edge of such reactions is relevant not only for the 
understanding of the addition mechanism but also 
for the development of enantioselective synthesis 
via the presence of chiral centers, since the latter is 
necessary to avoid formation of undesirable di- 
astereoisomers. Face-selective nucleophilic addi¬ 
tion of Grignard reagents to a trigonal carbon such 
a carbonyl group may create a new chiral center of 
high diastereoisomeric or enantiomeric purity with 
practical significance in synthetic organic chem¬ 
istry [3-7]. However, there is still no general 
agreement concerning the nature of the molecular 
mechanism for this type of chemical reaction 
[1-3,8-10]. 

Explaining experimental behavior has been a 
challenging problem in theoretical physical chem¬ 
istry. The quantum mechanical techniques are used 
to characterize the potential energy surface (PES) 
representing a chemical reaction [11,12]. The anal¬ 
ysis of this PES provides molecular geometries, 
vibrational frequencies for reactants, products, in¬ 
termediates, and transition structures (TSs). From 
these data, the barrier heights and rate constants 
for the reaction pathways connecting the reactants 
with the products via the corresponding TSs can 
be calculated. An understanding of the nature of 
the TS for a reaction can be valuable for practical 
as well as theoretical reasons. We are engaged in a 
research program to explore new possibilities in 
asymmetric synthesis from both experimental and 
theoretical viewpoints. We describe herein our the¬ 
oretical work on the analysis of nucleophilic addi¬ 
tion of organomagnesium reagents [CH 3 MgCl, 
2CH 3 MgCl,(CH 3 ) 2 Mg and (CH 3 ) 2 Mg plus Cl 2 Mg] 
to 2-hydroxy prop anal in the gas phase as a model 


of chiral a-alkoxy carbonyl compounds. The main 
objective is the characterization of the reactive 
potential energy surface, to obtain the nature of 
the molecular mechanism for these chemical reac¬ 
tions and to rationalize experimental observations. 


Methods and Models 

The calculations were performed with the 
GAUSSIAN94 program [13]. The ab initio calcula¬ 
tions were made at the HF/6-31G* [14-18] level. 
This basis-set representation yields reasonable re¬ 
sults in related theoretical studies [4,19-22]. 

The exact characterization of the TSs was 
achieved by using a simple algorithm [23-26], in 
which the coordinates describing the system are 
separated into two sets: the control space, which is 
responsible for the unique negative eigenvalue in 
the respective force constant matrix associated with 
the transition vector (TV) [27], and the comple¬ 
mentary space, which includes the remaining coor¬ 
dinates. The geometry optimizations were carried 
out alternatively on each subspace, one at a time, 
until a stationary structure was obtained. Finally, a 
complete analytical optimization was achieved for 
the whole space of all variables. The Berny analyti¬ 
cal gradient optimization routines [28,29] were 
used. The nature of each stationary point was 
checked by diagonalizing the mass-weighted Hes¬ 
sian matrix to determine the number of imaginary 
frequencies (zero for local minima and one for the 
TS). The TSs were verified to be the right ones by 
tracing the intrinsic reaction coordinate (IRC) [30] 
path from each TS to the two lower-energy struc¬ 
tures it connects by using the second-order 
Gonzalez-Schlegel integration method [31,32]. In 
the present work, the CH 3 MgCl (I), 2CH 3 MgCl 
(II), (CH 3 ) 2 Mg (III), and (CH 3 ) 2 Mg + Cl 2 Mg (IV) 
organomagnesium reagents were selected while 
the 2-hydroxypropanal compound was employed 
to represent the chiral a-alkoxy carbonyl com¬ 
pounds. 
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Results and Discussion 

To discriminate between alternative reaction 
channels, a characterization of the PESs including 
the location of stationary points is essential. An 
extensive exploration of the different PESs has 
rendered two minima (chelate complexes and 
products) and one TS. The addition of CH 3 MgCl 
to the carbonyl group of 2-hydroxypropanal has 
rendered two competitive reaction channels: the 
anti and syn attacks. The corresponding chelate 
complexes, ICha and IChs, TSs associated with the 
1,3-migration of the methyl group, ITSa and ITSs, 
and the corresponding products, IPa and IPs, re¬ 
spectively, are localized on the PES. A schematic 
representation of the structures of these stationary 
points is shown in Figure 1. 


The stationary points for the addition of two 
equivalents of organomagnesium reagents, 2CH 3 - 
MgCl, to 2-hydroxypropanal are depicted in Fig¬ 
ure 2. The addition processes take place along an 
intermolecular mechanism via attack on anti and 
metal-chelated anti conformations, IlChaa, IITSaa, 
and IlPaa; attack on anti and metal-chelated syn 
conformation, IlChas, IITSas, and IlPas; attack on 
syn and metal chelated anti conformation, IlChsa, 
IITSsa, and UPsa; and attack on syn and metal- 
chelated syn conformation, IlChss, IITSss, and 
IIPss, respectively. 

The pathways corresponding with the addition 
process of (CH 3 ) 2 Mg to 2-hydroxypropanal render 
six stationary points along anti and syn chelation 
processes, IHCha, IHChs, IIITSa, IIITSs, IUPa, 
and IIIPs, as shown in Figure 3. Finally, an inter¬ 
molecular process corresponding to the addition of 



IChs 

ITSs 

IPs 

-26.4 

-4.2/22.1 

-54.7 


FIGURE 1. Reactants, chelate complexes, TSs, and product structures for the anti (upper path) and syn (lower path) 
addition of CH 3 MgCI to 2-hydroxypropanal. Relative energies, including zero-point energies and thermal corrections, in 
kcal/ mol, to reactants (total energy = -965.439246 au) and barrier heights, in kcal/mol (after the slash). The arrows 
on the TSs represent the reaction coordinate vectors. The atom labels are included. 
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HPaa 


- 95.6 


IlPas 


- 101.8 


nPsa 


- 98.7 


IIPss 


- 97.5 


FIGURE 2. (a) Reactants, chelate complex, TSs, and products for the anti and syn addition of two equivalents of 
CH 3 MgCI to 2-hydroxypropanal. Relative energies, including zero-point energies and thermal corrections, in kcal/mol, 
to reactants (total energy = -1664.166056 au) and barrier heights, in kcal/mol, (after the slash). 


(CH 3 ) 2 Mg + Cl 2 Mg to 2-hydroxypropanal was 
considered. In this case, the molecule of Cl 2 Mg 
forms the chelate structure and the molecule of 
(CH 3 ) 2 Mg carried out the nucleophilic attack on 
the carbonyl group. The corresponding stationary 
points are presented in Figure 4: IVCha, IVChs, 
IVTSa, IVTSs, IVPa, and IVPs. 

A schematic representation of the structures of 
the stationary points is shown in Figures 1-4. The 
completely optimized geometries are available 
from the authors on request. An analysis of the 
results reveals slight differences depending on the 
model system. The chelate structures are puckered 
five-membered rings; the metal is coordinated to 
the lone pair of the carbonyl oxygen and the 
methoxy oxygen is not pyramidalized. Similar 
structures have been found for related chelates 
using X-ray methods [33] and related theoretical 
calculations [4,34,35]. There is a correlation be¬ 


tween reactivity and stereoselectivity that demon¬ 
strates the mechanistic importance of chelation, the 
chelate structure being maintained throughout the 
reaction path. 

The intramolecular conversion of the chelate 
complex to the product takes place via the corre¬ 
sponding TS with retention of the chelate configu¬ 
ration. For CH 3 MgCl and (CH 3 ) 2 Mg systems, the 
nucleophilic methyl group of the organomagne- 
sium compound is located over or above the 
chelate ring in the TS structure. The steric interac¬ 
tions with the a-substituted group, CH 2 R 3 , are 
avoided in the case of anti attack, which is consis¬ 
tently favored. 

The inclusion of a second CH 3 MgCl molecule 
yields an assisted intermolecular mechanism, 
where the first molecule forms the chelate struc¬ 
ture and the second CH 3 MgCl molecule carries 
out the nucleophilic addition to the carbonyl group. 
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IITSaa 



IITSas 



IITSsa 



FIGURE 2. (b) A schematic representation of the TSs for the anti and syn addition of two equivalents of CH 3 MgCI to 
2-hydroxypropanal. The arrows on the TSs represent the reaction coordinate vectors. The atom labels are included. 


This fact is in agreement with experimental data 
reported by Reetz et al. [36], 

The addition of CH 3 MgCl to 2-hydroxypro¬ 
panal presents a barrier height of 21-22 kcal/mol, 
the products being the most stable species (28-30 
kcal/mol below the chelate complexes). In this 
case, both methods render that the anti attack is 
the most favorable pathway. The ratio between the 
product obtained by means of an anti attack with 
respect to that obtained by a syn attack was calcu¬ 
lated in our early work [37] assuming a Boltz¬ 
mann's distribution of the TS leading to the two 
diastereomers. However, the presence of a transi¬ 


tion structure controlling the interconversion be¬ 
tween both chelate complexes, TSc, on the PES 
opens up the possibility of an equilibrium between 
both chelate complexes, and according to the 
Curtin-Hammett principle [38-40], the product 
ratio is therefore not determined by the relative 
energies of the TSs with respect to their corre¬ 
sponding chelate complexes but, instead, depends 
primarily on the relative energy of the two TSs 
with respect to each other. In this case, a complete 
kinetic analysis gives [22,41] the following prod¬ 
uct ratio equation: 


[IPa] 

Tips! 


k ia (l + (E e A E ich«/ R T/Y, e AElclis ^ RT )) + k s T, e AE icii«/ RT /Y,e ^ E ic’'^ RT 
k ns ( 1 + (E e~ A E i c '>»/ Rr /£ e - &E ici, s / RT )) + ^ 


( 1 ) 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


723 













OLIVA ET AL. 



IIITSa 


- 1 . 9 / 14.7 


IllPa 

- 54.6 



IHChs 

- 16.6 - 1 . 1 / 15.5 - 52.9 

FIGURE 3. Reactants, chelate complexes, TSs, and products for the anti (upper path) and syn (lower path) addition 
of (CH 3 ) 2 Mg to 2-hydroxypropanal. Relative energies, including zero-point energies and thermal corrections, in 
kcal / mol, to reactants (total energy = “545.435254 au) and barrier heights, in kcal / mol (after the slash). The arrows 
on the TSs represent the reaction coordinate vectors. 






To do such calculations, the rate constants k a , k s , 
k as , and k sa (Fig. 1) must be found, this can be 
achieved by means of the TS theory (TST) [42-44], 
assuming that the transmission coefficient is equal 
to 1. A pressure and temperature of 1 atm and 
195.15 K was taken in the S and k calculations, 
respectively. The product ratio for the diastere- 
omers obtained by means of the anti attack of the 
organomagnesium reagent methyl group with re¬ 
spect to those obtained by means of a syn attack is 
89:11. Using MP2/6-31G*, the activation barriers 
associated to ITSa and ITSs are significantly re¬ 
duced from the HF/6-31G* results (ca. 5-6 
kcal/mol), having very similar values. As a conse¬ 
quence, a decrease in stereoselectivity is sensed 
(68:32). This is corrected by using the correlation 
energy obtained at the MP3 level (87:13). This fact 
can be related to the known tendency of MP2 
calculations to overcorrect the too high HF results 
and shows that MP2 calculations overestimate the 
effect of the electron correlation energy on the TSs 


with respect to that of the reactants, in agreement 
with previous studies of related Diels-Alder reac¬ 
tions [45-51]. 

The TV renders very concisely the essentials of 
the chemical process under study. The correspond¬ 
ing components of TV for the variables defining 
the reaction coordinate vectors are depicted in 
Figures 1-4 and are available from the authors 
upon request. The TSs are well associated with 
forming/breaking bonds, as can be seen from the 
main components of the TVs. The results provide 
good hints for a qualitative and semiquantitative 
analysis. 

An analysis of the TV shows that the major 
components are Cl—C(Mg), C(Mg)—Mg, Cl-Ol, 
and Mg—Ol distances, the Cl—C(Mg) distance 
being the most important. The minimal set of 
coordinates capable of producing the TS are these 
variables. This result is obtained after reducing the 
size of the control space followed by the diagonal- 
ization of the corresponding Hessian matrix. It is 
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IVChs IVTSs IVPs 


-46.32 -3B.8/7.5 -109.2 

FIGURE 4. Reactants, chelate complexes, TSs, and products for the anti (upper path) and syn (lower path) addition 
of (CH 3 ) 2 Mg plus CI 2 Mg to 2-hydroxypropanal. Relative energies, including zero-point energies and thermal corrections, 
in kcal/mol, to reactants (total energy = -1664.157711 au) and barrier heights, in kcal/mol (after the slash). The 
arrows on the TSs represent the reaction coordinate vectors. 


important to note that while the atoms participat¬ 
ing in the control space are always the same, those 
in the complementary space are not. If these latter 
do not modulate in first order the force constants 
in the control space, the TV is invariant. The in¬ 
variance is to be understood in terms of preserva¬ 
tion of the reactive fluctuation patterns associated 
with the chemical interconversion processes. One 
of the interesting results obtained is a sort of 
geometric invariance of the fragments participat¬ 
ing in a TS geometry. This fact has been found by 
us [25,26,52] and other workers [53]. Structural 
invariance of the TS structure fragments is an 
important result from the technical viewpoint. It 
helps setting up the search of TSs. 

In the case of the addition of two equivalents of 
CH 3 MgCl to the carbonyl group of the 2-hydroxy¬ 
propanal, the barrier heights are predicted to be 
lower than in the 1:1 stoichiometry case (3-9 
kcal/mol), the anti attack on the anti chelate being 
the most favorable pathway. The ratio between the 


products obtained by means of anti attacks with 
respect to those obtained by syn attacks can be 
calculated assuming a Boltzmann distribution of 
the TSs leading to the two diastereoisomers, by 
using the following equation [54]: 

anti/syn = ^ e ^ Ea/RT jH, e~ AEs/RT , (2) 

where anti and syn refer to the TS conformation 
and A E a and AE S are the relative energies (with 
the zero-point vibrational energies and tempera¬ 
ture corrections) of the TSs for the anti and syn 
attacks, respectively, with respect to the corre¬ 
sponding chelate complexes. The product ratio ob¬ 
tained (> 99:1, anti: syn) agrees very well with the 
experimental ratio, obtained by Carda et al. [55]. 
The corresponding TVs for the variables defining 
the control space are shown in Figure 2(b). These 
TVs are well associated with forming/breaking 
bonds. Cl—C(Mg), C(Mg)—Mg, Cl—Ol, and Mg 
—Ol, as can be seen from their main components. 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


725 






OLIVA ET AL. 


The barrier heights for the addition of (CH 3 ) 2 Mg 
to 2-hydroxypropanal are lower than in the case of 
the addition of 1 equiv of CH 3 MgCl but higher 
than in the intermolecular processes (2 equiv of 
CH 3 MgCl). The barrier heights are 14-16 
kcal/mol, the products being the most stable 
species (36-38 kcal/mol below the chelate com- 
plexes). In this case, we find that the anti attack is 
the most favorable pathway, with a lower energy 
barrier and a larger stability for the products than 
for the syn attack. The product ratio is calculated 
by Eq. (2), yielding 89:11 anti/syn. The corre¬ 
sponding TVs for the variables defining the control 
space are shown in Figure 3. These TVs are well 
associated with forming/breaking bonds. 
Cl —Cl(Mg), Cl(Mg)—Mg, Cl — Ol, and 
Mg—Ol, as can be seen from the main compo¬ 
nents. 

Since Grignard reagents are an equilibrium mix¬ 
ture of CH 3 MgCl, (CH 3 ) 2 Mg, and Cl 2 Mg, all of 
which can react with chiral a-alkoxy carbonyl 
compounds, we have considered the addition with 


(CH 3 ) 2 Mg + Cl 2 Mg. In this case, the latter system 
forms the chelate intermediate and the former one 
carries out the nucleophilic attack. The barrier 
heights for the addition of (CH 3 ) 2 Mg + Cl 2 Mg to 
2-hydroxypropanal are lower than in the case of 
the addition of 1:1 CH 3 MgCl and (CH 3 ) 2 Mg, but 
higher than in the intermolecular processes (2:1 of 
2CH 3 MgCl). The barrier height is 7-16 kcal/mol, 
the products being the most stable species (62-63 
kcal/mol below the chelate complexes). In this 
case, the syn attack is the most favorable path 
way with a lower energy barrier and greater stabil¬ 
ity for the products than for the anti attack. The 
product ratio is calculated by Eq. (2) and is > 99:1 
syn/anti. The corresponding TVs for the variables 
defining the control space are shown in Figure 4. 
The TVs are well associated with the forming/ 
breaking bonds. Cl—Cl(Mg), Cl(Mg)—Mg, 
Cl—Ol, and Mg—Ol, as can be seen from the 
main components. 

In Figure 5, the stationary points belonging to 
the different reaction pathways of the addition 



FIGURE 5. A schematic view of the reaction pathways studied in this work. R1 corresponds with the addition of 
CH 3 MgCI to 2-hydroxypropanal while R2 is associated with the reaction of (CH 3 ) 2 Mg plus CI 2 Mg and 2-hydroxypropanal. 
The names of the different stationary points appearing along the reaction pathways are defined in the text. IllCh 
represents IllCha and IllChs, which have the same energy and roughly the same structure. 
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processes are schematically depicted. In this repre¬ 
sentation, a global perspective of the reactive PES 
is summarized in order to compare the different 
molecular mechanisms found in this work. 


Conclusions 

Quantum chemical characterization of TSs may 
be rationalized to discuss molecular mechanisms 
for organic reactions. The theory of nucleophilic 
addition of organometallic reagents to carbonyl 
compounds is still a much debated subject, partic¬ 
ularly with respect to the origin of stereoselectiv¬ 
ity. The following factors are found to be dominant 
in the molecular mechanism for the carbonyl addi¬ 
tion of organomagnesium reagents to chiral a-al- 
koxy carbonyl compounds: 

(i) The formation of syn and anti chelate 
complexes is an exothermic process in the 
gas phase, corresponding to the first step 
in the addition process, and takes place 
without an energy barrier. The magne¬ 
sium is coordinated to the lone pair of the 
carbonyl oxygen and to the methoxy oxy¬ 
gen. The chelate complexes can be de¬ 
scribed as puckered five-membered rings. 

(ii) syn and anti chelate intermediates deter¬ 
mine the stereochemistry of the addition 
process, maintaining the conformation 
throughout the reaction pathway. 

(iii) The C—C bond-forming stage, corre¬ 
sponding to the 1,3-intramolecular migra¬ 
tion from chelate complex to products, is 
the second and rate-limiting step for the 
addition process. The corresponding tran¬ 
sition structures are four-membered rings 
and the most favorable pathway is the anti 
attack. 

(iv) The inclusion of a second molecule of an 
organomagnesium reagent decreases the 
barrier height and the process can be de¬ 
scribed as an assisted intermolecular 
mechanism: The first molecule forms the 
chelate structure and the second molecule 
carries out the nucleophilic addition to the 
carbonyl group. 

(v) The reaction pathway for the nucleophilic 
attack of CH 3 MgCl + 2-hydroxypropanal 
has a barrier height larger than the 
(CH 3 ) 2 Mg + 2-hydroxypropanal addition. 


CH 3 MgCl is a more powerful quelant 
agent than is (CH 3 ) 2 Mg. 

(vi) The most favorable pathway corresponds 
to an anti attack via an intermolecular-as- 
sisted mechanism for the addition of 
2CH 3 MgCl. These theoretical results are 
in agreement with the experimental data 
of Reetz et al. [36] and with the di- 
astereoisomeric excess obtained by Carda 
et al. [55]. 

(vii) The inclusion of the solvent effects into 
the calculation can lead to a better (quali¬ 
tative) description of the product ratio. 
However, studies with similar systems [4] 
show that polarity effects on the structures 
and energies of the reactions are rather 
small. 
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ABSTRACT: The molecular mechanisms for the ring openings of cyclopropanone, 

2.2- dimethylcyclopropanone, fnms-2,3-di-fcrf-butylcyclopropanone, and 
spiro(bicyclo[2.2.1]heptane-2.T-cyclopropan)-2'-one systems were studied at the PM3 
semiempirical level in the gas phase and including solvent effects. The behavior of the 
solvent polarity was considered by using the SCRF polarizable continuum method. Six 
solvents were selected: hexane, ether, tetrahydrofuran, pyridine, acetone, and acetonitrile. 
An extensive exploration of the potential energy surface using analytical gradient 
techniques allows the characterization of stationary points associated to the 
stereomutation conversion. Along a disrotatory ring-opening mechanism, cyclopropanone, 

2.2- dimethylcyclopropanone and fnms-2,3-di-fer/-butylcyclopropanone are intraconverted 
via an oxyallyl intermediate. The epimeric forms of the spiro(bicyclo[2.2.1]heptane-2.T- 
cyclopropan)-2'-one intraconvert along two competitive pathways correspond to two-step 
processes by a disrotatory ring-opening mechanism. Two oxyallyl intermediates and four 
transition structures were obtained and the corresponding transition vectors are associated 
to the carbon-carbon bond-breaking process and the dihedral angle measuring the 
conrotatory movement of the plane defined by the three carbon atoms of the 
cyclopropanone ring. The oxyallyl intermediates and the transition structures for the four 
model systems present similar structures and energies and they are located on a rather 
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flat region. The analysis of the theoretical results shows that the solvent reaction field 
decreases the energy barriers for the ring-opening processes and a stabilization of the 
oxyallyl intermediates takes place. The calculated relative barrier heights are in good 
agreement with the experimental data available, and the trends in the kinetics can be 
explained primarily by steric interactions. Nevertheless, for the 

spiro(bicyclo[2.2.1]heptane-2T-cyclopropan)-2'~one system, it is necessary to include a 
specific interaction of a discrete molecule of the nucleophilic solvent on the quantum 
mechanical representation to explain the experimental behavior. © 1997 John Wiley & 
Sons, Inc. Int J Quant Chem 65: 729-738, 1997 


Introduction 

Stereomutation rearrangements involving cyclo- 
propanone systems has been the subject of contin¬ 
uous interest from experimental and theoretical 
viewpoints [1-7]. The molecular mechanism corre¬ 
sponding to the ring-opening process via oxyallyl 
intermediates is a common reaction with impor¬ 
tant applications in organic synthesis [8-10]. Nu¬ 
merous theoretical studies have been reported on 
this type of stereomutation [5, 11-16], many of 
them for the purpose of elucidating the nature of 
the ring-opening process [1, 2, 4, 5, 7, 11 16]. In 
particular, the cyclopropanone ring can be postu¬ 
lated as a reactive intermediate in the Favorskii 
rearrangement [10, 17-21]. Recently, we reported 
quantum chemical calculations on the mechanism 
of the Favorskii reaction [22]. In addition, some 
efforts have been devoted in our laboratory to the 
theoretical treatment of solvent effects in chemical 
reactivity studies [23-25]. The results indicate that 
the activation energies and relative rate constants 
can be estimated with good accuracy at a high 
level of theory, but more important for organic 
chemists, even simple calculations at moderate 
levels of theory, such as semiempirical procedures, 
give useful information of the solvent effects on 
the molecular mechanism for different chemical 
rearrangements. Calculations of the properties of 
stationary points located on the reactive potential 
energy surface (PES) of great complexity can be 
characterized by using semiempirical molecular 
orbital theory. We tried to determine the reaction 
mechanisms and how the solute-solvent interac¬ 
tions can affect the different barrier heights and 
the relative energies of reactants, possible interme¬ 
diates, transition structures (TSs) and products on 
PES. 

Solvent effects or, more generally, environmen¬ 
tal effects can be accounted for by two different 


approaches: adding discrete solvent molecules 
around the subsystem of interest, the super¬ 
molecule approach, or placing them in a cavity 
surrounded by a continuum approach [26-28]. In 
the first approach, specific solvent effects are suc¬ 
cessfully represented, while in the second, nonspe¬ 
cific and long-range interactions are appropriately 
included. In the continuum models [26], the solute 
is embedded in a cavity, while the solvent, treated 
as a continuous medium, characterized by some 
macroscopic properties, e.g., the same dielectric 
constant as that of the bulk liquid, is incorporated 
into the solute Hamiltonian as a perturbation. In 
this reaction field approach, the bulk medium is 
polarized by the solute molecules. The electronic 
distribution of the solute polarizes the continuum 
which generates an electric field inside the cavity, 
which, in turn, affects the solute's geometry and 
electronic structure. This is a convenient model for 
describing electrostatic interactions due to the sol¬ 
vent polarity while short-range interactions are not 
correctly represented by this method. The imple¬ 
mentation of this interaction scheme is achieved 
through the self-consistent reaction field method 
developed initially by Tapia et al. [29, 30]. 

Kinetic data for stereomutation on cyclo¬ 
propanone systems derive from experimental re¬ 
sults in the solution phase [1-4, 6, 7], which is 
often obtained with a great deal of effort. Substitu¬ 
tion on the cyclopropane ring has subtle influences 
on the kinetics for ring cleavage and the origin of 
these kinetic effects is not clear. We describe herein 
a complementary theoretical analysis carried out 
with a PM3 semiempirical procedure to study the 
ring opening of various substituted cyclopropan¬ 
one systems, which have generated considerable 
interest among organic and physical chemists due 
to important mechanistic roles that have been dis¬ 
cerned but not entirely elucidated [1-7, 12-17]. It 
is of considerable interest to know the energy 
profiles of the corresponding reaction pathways in 
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the gas phase and in the presence of different 
solvents. This computational investigation con¬ 
tributes to a better understanding of the stereomu¬ 
tation via a ring-opening process, being the main 
objective the characterization of the reactive poten¬ 
tial energy surface in the gas phase including the 
environmental effects and thus to obtain the na¬ 
ture of the molecular mechanism and to rationalize 
the experimental observations. 


Computational Methods and Models 

PM3 semiempirical calculations [31] were car¬ 
ried out with the GAUSSIAN94 package of pro¬ 
grams [32]. The PESs were calculated in detail to 
ensure that all relevant stationary points were lo¬ 
cated and properly characterized. The exact loca¬ 
tion of the TSs was achieved by using an algorithm 
[33-35] in which the coordinates describing the 
system are separated into two sets: the control 
space, which is responsible for the unique negative 
eigenvalue in the respective force constants matrix, 
and the remaining coordinates set, which is called 
complementary space. The geometry optimizations 
were carried out alternatively on each subspace, 
one at a time, until a stationary structure was 
obtained. Finally, a complete analytical optimiza¬ 
tion of the TS was achieved with an "eigenvalue 
following" optimization method [36, 37] for all 
variables. 

The intrinsic reaction coordinate (IRC) [38] path¬ 
ways, from the TSs down to the two lower-energy 
structures, were traced using the second-order 
Gonzalez-Schlegel integration method [39, 40] in 
order to verify that each saddle point links the two 
putative minima. The requested convergence on 
the density matrix was 10 9 atomic units and the 
threshold value of maximum displacement was 
0.0018 A and that of maximum force was 0.00045 
Hartree/bohr using the Berny analytical gradient 
optimization routine [41, 42]. The nature of each 
stationary point was checked by diagonalizing the 
Hessian matrix to determine the number of imagi¬ 
nary frequencies (zero for the local minima and 
one for the TSs). The unique imaginary frequency 
associated with the transition vector (TV) [43] of 
the different TSs was analyzed. 

The energies of the stationary structures on the 
PES including solvent effects was calculated at the 
PM3 semiempirical level using the Conductor-like 
Screening Model (COSMO) option [44] included in 


the MOPAC93 package program [45]. This COSMO 
model, proposed by Klamt and Schuurmann [44], 
calculates the electrostatic solvation energy by rep¬ 
resenting the solute charge distribution as a set of 
point charges and dipoles in the neglect differen¬ 
tial diatomic overlap formalism. The cavity chosen 
for the calculations was the solvent-excluded sur¬ 
face obtained by overlapping spheres. The liquid is 
assimilated to a continuum characterized by its 
dielectric constant. Six solvents were selected: hex¬ 
ane, ether, tetrahydrofuran, pyridine, acetone, and 
acetonitrile. 

Four systems were studied: cyclopropanone (C), 
2,2-dimethylcyclopropanone (D), tran$-2,3-di-tert- 
butylcyclopropanone (T), and spiro(bicyclo[2.2.1] 
heptane-2.1'-cyclopropan)-2'-one (S) in the gas 
phase and in six different solvents: hexane, ether, 
tetrahydrofuran, pyridine, acetone, and acetoni¬ 
trile. 


Results and Discussion 

An extensive exploration of the PES by means 
of the PM3 method was carried out in order to 
asses the different stationary points related to the 
ring-opening process. The geometries of cyclo¬ 
propanone (C), 2,2-dimethylcyclopropanone (D), 
and trafts-2,3-di-terf-butylcyclopropanone (T), as 
well as the corresponding oxyallyl structures and 
transition structures are illustrated in Figures 1-3, 
respectively. For the spiro(bicyclo[2.2.1]heptane- 
2.T-cyclopropan)-2'-one system (S), two reaction 
pathways depicted corresponding to the two pos¬ 
sible disrotatory ring-opening processes for this 
steromutation were characterized; for this system, 
two oxyallyl intermediates and four TSs were lo¬ 
cated. The structures associated to the stationary 
points of both channels are illustrated in Figure 4. 
The labels of the structures corresponding to TSs 
and oxyallyl intermediates are designed by the 
letters TS and I, respectively. The stationary points 
are distinguished from each other by appending 
the label C, D, T, or S. For the S system, the two 
reactive channels are designed by the letters A and 
B, and the numbers 1 and 2, for both steps of the 
two stepwise mechanisms. Thus, the localized sta¬ 
tionary points are: S, TS-SA1, TS-SB1,1-SA, I-SB, 
TS-SA2, TS-SB2, and S'. 

The most relevant geometrical parameters of the 
stationary points obtained for the different systems 
are depicted in Figures 1-4. Optimized geometries 
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(- 0 . 48 ) 



(- 0 . 48 ) 



I-C 


<C2C1C3 = 62.48 
<C1C2C3 = 58.75 
jo. = 0.87 D 

E = -0.005724 (0.00) 


C2-C3= 2.353 
<04-Cl-C2-H5= 0.0 
<H7-C3-C2-H5= 0.0 
H= 1.97 D 
E = 0.085156 (57.03) 


C2-C3= 2.354 
<04-Cl-C2-H5 = 0.49 
<H7-C3-C2-H5 = 0.0 
|i = 1.97 D 
E = 0.085155 (57.03) 


FIGURE 1 . Representation of the optimized geometries for the stationary points: cyclopropanone, TS for disrotatory 
ring opening, and oxyally! intermediate. Distances in angstroms and bond angles in degrees. Values in italics (in 
parentheses) are net atomic charges. 


of all structures are available from the authors on 
request. For C, the optimized structure agrees with 
the experimental data obtained by Nelson and 
Pierce [46] from a microwave analysis and the 
computed geometries obtained by Lim et al. [5] at 
the (4/4)CASSCF/6-31G* calculation level. The 
TSs and the oxyallyl intermediates are located on a 
very flat region of the PES. On going from cyclo¬ 


propanone to the TSs or oxyallyl intermediate, the 
electronic charge redistribution shows that an ex¬ 
cess of negative charge is developed on the car¬ 
bonyl carbon atom and to a lesser extent on the 
peripheral carbon atoms (see Figs. 1-4). Some in¬ 
teresting observations emerge from the Mulliken 
population analysis [47]. There is no change in the 
Mulliken charges between the TSs and oxyallyl 



D 



TS-D 



<C2-C1-C3 = 62.68 
<04-Cl-C2-H5 = 
<H5-C2-C1-H6 = 
fj. = 0.88 D 
E =-0.029321 (0.00) 


C2-C3 = 2.351 
<04-Cl-C2-H5= 10.62 
<H5-C2-C1-H6= 181.17 
ji = 2.07 D 

E = 0.042899 (45.32) 


C2-C3 = 2.355 
<04-Cl-C2-H5 = 0.35 
<H5-C2-C1-H6= 180.22 
ji = 2.15 D 

E = 0.042741 (45.22) 


FIGURE 2. Representation of the optimized geometries for the stationary points: 2,2-dimethylcyclopropanone, TS for 
disrotatory ring opening, and oxyallyl intermediate. Distances in angstroms and bond angles in degrees. Values in 
italics (in parentheses) are net atomic charges. 
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H31 



H27 


H26 


H20 



TS-T 



<C3-C1-C2 = 62.70 
<04-C1-C2-H5 = 285.08 
<H5-C2-C1-C6 = 213.00 
fit = 0.90 D 
E=-0.078467 (0.00) 


C2-C3 = 2.387 
<04-CI-C2-H5= 180.01 
<H5-C2-C1-C6 = 179.97 
1.98 D 

E = 0.001436 (50.14) 


C2-C3 = 2.420 
<04-Cl-C2-H5 = 179.82 
<H5-C2-C1-C6= 180.06 
11= 1.97 D 
E = 0.001126 (49.95) 


FIGURE 3. Representation of the optimized geometries for the stationary points: frans-2,3”di-terf-butylcyclopropanone, 
TS for disrotatory ring opening, and oxyallyl intermediate. Distances in angstroms and bond angles in degrees. Values 
in italics (in parentheses) are net atomic charges. 


intermediates. The imaginary frequency for the 
TSs TS-C, TS-D, and TS-T are 131.8i, 65.4i, and 
70.3i cm” 1 , respectively. For model S, the TSs 
TS-SA1, TS-SB1, TS-SA2, and TS-SB2 present the 
following imaginary frequency values: 64.4i, 54.9i, 
78.0i, and 49.Oi cm -1 , respectively. The corre¬ 
sponding TVs are associated to a disrotatory ring 
opening, as expected. These four TSs yield similar 
optimized geometrical parameters and energies. 
The predicted energy barrier between the cyclo- 
propanone systems and TSs are in the range of 
45.3-57.0 kcal/mol. For the simple model C, calcu¬ 
lations at high level of theory [5] show a barrier 
height of 32.4 kcal/mol. As expected, the PM3 
results tends to overestimate this barrier. How¬ 
ever, our goal was to rationalize the experimental 
behavior of this stereomutation process for more 
complex systems in various solvents. 

The TSs are energetically and structurally very 
similar to oxyallyl intermediates. The principal dif¬ 
ferences are the rotation of a-carbon atoms of the 
cyclopropanone rings, in the range 0.1-20.4°, out 
of plane in the TS, and the C—O distance of the 
carbonyl group that increases in the range of 
0.052-0.060 A, on going from cyclopropanone sys¬ 
tems to oxyallyl intermediates. The energy differ¬ 
ences between the TSs and oxyallyls intermediates 
are less than 0.2 kcal/mol. These structures can be 
described as diradicals with a strong carbon-oxy¬ 
gen double bond and an electron mainly localized 
on the terminal carbon atoms. An analysis of the 


barrier energies shows that the rate of the intercon¬ 
version processes follows the order S > D > T > 
C. This order can be explained by two additive 
effects—(i) Hyperconjugative effects: Alkyl substi¬ 
tution strongly stabilizes the TS or the oxyallyl 
intermediate for the ring-opening process. This hy¬ 
perconjugative stabilization is more effective by 
the norbornane, S system, than by the tert -butyl 
groups, methyl groups, and hydrogen atoms in the 
T, D, and C systems, respectively, (ii) The size of 
the steric hindrances of this reaction on going from 
the cyclopropanones to the TSs corresponding to 
the ring-opening process: In a disrotatory mecha¬ 
nism, ring opening of the T system forces a tertiary 
alkyl group into a 1,3-ds ally lie interaction with a 
hydrogen atom in the ring-opened oxyallyl inter¬ 
mediate. For the D system, this repulsive interac¬ 
tion is less hindered and takes place between two 
hydrogen atoms. In contrast, due to the presence 
of the rigid norbornane ring in the S system, the 
rotation movement around C2-C3 and C3-C4 
bonds is smaller than are those of freely rotating 
analogs and the 1,3-ds-allylic interaction involves 
the ring hydrogen and the oxyallyl hydrogen. Con¬ 
sequently, a substantial promotion of the stereo¬ 
mutation process can be expected. 

The next step in our investigation is the study 
of the solvent effects on the basic features of the 
stereomutation reaction and we are now in a posi¬ 
tion to compare our results with those of Greene 
et al. [1, 2] for the T system, Cordes and Berson [4, 
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(-0.49) 


I-SA 


C2-C3 = 2.368 
<04-Cl-C2-H5 = 2.30 
<H6-C2-C1-H5 = 178.70 
H = 2.21 D 
E = 0.061616 (46.48) 




<C1-C2-C3 = 59.02 
<C2-C1-C3 = 62.43 
<04-Cl-C2-H5 = 287.5£ 
<H6-C2-C1-H5 = 144.41 
- 0.89 D 

E =-0.012548 (-0.01) 


IB 

(a) 



<H6-C2-C1-H5 ss 179.23 
H = 2.14D 
E = 0.061786 (46.59) 

TS-SA1 



TS-SA2 


Cl-C2= 1.411 
C2-C3 = 2.324 
<C2-C1-C3 = 109.32 
<04-Cl-C2-H5= 194.01 
<H6-C2-C1-H5 = 179.99 
ji = 2.16 D 
E= 0.062908 (47.29) 



TS-SB1 


(b) 


Cl-C2= 1.410 
C2-C3 = 2.344 
<C2-C1-C3 = 110.74 
<04-Cl-C2-H5= 190.12 
<H6-C2-C1-H5= 180.57 
H = 2.16D 
E = 0.064073 (48.16) 



TS-SB2 (0.15) 


C2-C3 = 2.360 
<04-Cl-C2-H5 = 174.18 
<H6-C2-Cl-H5 = 178.51 
= 2.23 D 

E = 0.064309 (48.17) 


FIGURE 4. Representation of the optimized geometries for the stationary points: 

spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one, TS for disrotatory ring opening, and oxyallyl intermediate. 
Distances in angstroms and bond angles in degrees. Values in italics (in parentheses) are net atomic charges. 
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7] for the S system, and the theoretical work of 
Lim et al. [5] on the C and D systems. The relative 
barrier heights vs. the solvatochromic parameter 
E t ( 30) (Dimroth-Reichardt parameter) [48-51] for 
the six solvents are depicted in Figure 5. As could 
be expected from the electrostatic data, TSs and 
oxyallyl intermediates present identical solvation 
effects; however, preferential solvation of these 
stationary points would be expected due to that 
they are more polar than are the cyclopropanone 
ground-state systems. It should be noted that the 
dipole moments are increased in the range of 
1.07-1.34 D upon the ring opening of the corre¬ 
sponding cyclopropanones for the four model sys¬ 
tems. This is an indication that solvation influ¬ 
ences the stability of these structures to a different 
extent. An analysis of the results shows that the 
negative of the relative barrier height increases on 
going from the nonpolar hexane to the polar ace¬ 
tonitrile solvent. A change to a more polar solvent 
will increase the reaction rate due to that TSs are 
more polar than are the initial cyclopropanone 
systems. The low values of these relative energies 
for the steromutations in solution reflect the dirad¬ 
ical rather than the zwitterionic nature of the 
oxyallyls, while the presence of a zwitterion is 
accompanied by much larger solvent effects [49]. 



FIGURE 5. Relative barrier heights (kcal/mol) for the 
ring-opening process vs. solvent parameter E r (30) 
(Dimroth -Reichardt term). The ordinate shows the 
negative of the barrier height relative to that in hexane 

taken as zero. (-) Cyclopropanone; (-) 

2,2-dimethylcyclopropanone; (•*••) 

trans-2, 3-di-ferf-butylcyclopropanone; (-—) 

spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one. 


Additionally, our results are in reasonable agree¬ 
ment with the Monte Carlo simulations reported 
by Lim et al. [5] for the C and D models. This fact 
supports the use of the PM3 semiempirical proce¬ 
dure to study the steromutation processes in solu¬ 
tion. 

In Figure 6, a comparison between our results 
and experimental data for the S [4, 7] system is 
given. The values of the relative barrier heights vs. 
the E T (30) for the spirocyclopropanone S are not 
correlated with the experimental data for the T 
system or with the values theoretically predicted 
for the more simple D and C models. In particular, 
Cordes and Benson [7] found that the rate of stere¬ 
omutation in the S system is slower in ether and in 
acetone than in hexane and in methylene chloride. 
This deceleration was explained by Cordes and 
Benson [7] in terms of a nucleophilic association of 
these solvents with the carbonyl group in the cy¬ 
clopropanone ring, leading to a ground-state stabi¬ 
lization. Reaction field representation of solvent 
effects is a convenient model for describing elec¬ 
trostatic interactions due to the solvent polarity. 
Flowever, short-range interactions are not correctly 
represented by this method. Therefore, to under¬ 
stand the role played by specific and nonspecific 



FIGURE 6. Relative barrier heights (kcal / mol) for the 
stereomutation process vs. solvent parameter E r (30) 
(Dimroth -Reichardt term) for the 
spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one 

system. (-) Experimental values reported by Cordes 

and Berson (these authors used the free energy of 

activation) [7]. (-) Theoretical data including specific 

interaction in the nucleophilic solvent. 
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solute-solvent interactions in the stereomutation 
process of the S system in solution and under¬ 
standing how nucleophilic solvents alter the bar¬ 
rier height for the ring opening, we studied the 
reaction including a single discrete molecule of the 
nucleophilic solvents. 

The apparent contradiction between experiment 
and theory could be overcome taking into account 
a single molecule of nucleophilic solvents in the 
quantum mechanical treatment. Thus, to explore 
this aspect, specific interactions between the ni¬ 
trogen or oxygen lone pairs of the nucleophilic 
solvents (ether and acetone) and the vulnerable 
carbonyl carbon of the cyclopropanone ring were 
considered. Following the extensive crystal struc¬ 


ture studies of Biirgi et al. [52-54] and previous 
analysis by Cordes et al. [7, 55], a model in which 
the distance between the oxygen atom and the 
carbon atom of the cyclopropanone ring, O * • • 
C = 0, is fixed to a value of 2.6 A (near van der 
Waals contact) and 0***C = 0 angles of 98° (a 
reasonable Burgi-Dunitz angle) was selected. The 
minima and the TSs were also located in the pres¬ 
ence of this additional solvent molecule with an 
arrangement depicted in Figure 7. The analysis of 
the results, presented in Figure 6, shows that this 
extra stabilization (2.8 and 4.0 kcal/mol for ether 
and acetone solvent, respectively) is more favor¬ 
able in the cyclopropanone ring systems than in 
the TSs or oxyallyl intermediates, because the con- 



FIGURE 7. A schematic view of the optimized geometries for the spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one 
system interacting with a single molecule of the nucleophilic solvent: (a) ether; (b) acetone. The distance between the 
oxygen atom of the solvent and the carbonyl carbon of the cyclopropanone ring was fixed at 2.6 A and the angle 
(< oxygen atom-carbonyl carbon-carbonyl oxygen) was fixed at 98°. Distances in angstroms and bond angles in 
degrees. 
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centrated charge in the ground state becomes dis¬ 
persed in the TSs. This result is in good agreement 
with the experimental data. In contrast, the racem- 
ization of the T system has not an anomalous 
behavior in nucleophilic solvents due to the pres¬ 
ence of terf-butyl groups, preventing the formation 
of this interaction. 


Conclusions 

In this work, we carried out a PM3 theoretical 
study of the solvent effects along the reaction 
profiles on the corresponding PES for the ring¬ 
opening process of the cyclopropanone, 2,2-di- 
methylcyclopropanone, frans-2,3-di-fert-butylcyc- 
lopropanone, and spiro(bicyclo[2.2.1]heptane-2.r- 
cyclopropan)-2'-one systems. The polarizable con¬ 
tinuum model and the inclusion of a discrete 
molecule of nucleophilic solvents are used to intro¬ 
duce solvent effects. The continuum and discrete 
solvent models offer a complementary view of the 
nature of solute-solvent interactions. The specific 
details of the process may change at higher levels 
of theory (e.g., ab initio, inclusion of correlation 
energy), but despite the approximate procedure of 
the calculations employed here, some important 
features were clarified. The following conclusions 
can be drawn from the results reported in this 
study: 

(i) The stereomutation processes take place via 
an oxyallyl intermediate along a disrota- 
tory ring-opening mechanism. The corre¬ 
sponding TSs and oxyallyl intermediates 
are structurally and energetically related. 

(ii) The oxyallyl intermediates and the TSs for 
the four model systems are shallow min¬ 
ima on a rather flat region, indicating the 
proximity of the oxyallyls to the transition 
structures for the ring-opening processes. 

(iii) The solvent reaction field decreases the en¬ 
ergy barriers for the stereomutation pro¬ 
cesses and a stabilization of the oxyallyl 
intermediates takes place. A good agree¬ 
ment between the theoretical results and 
experimental data is found. 

(iv) For the stereomutation of the spiro(bicyclo 
[2.2.1]heptane-2.T-cyclopropan)-2'-one sys¬ 
tem, the inclusion of one discrete molecule 
of the nucleophilic solvents forming an 
specific interaction between the lone pairs 


of ether, acetone, acetonitrile, pyridine, or 
tetrahydrofuran molecules and the car¬ 
bonyl group of the cyclopropanone ring is 
able to explain the experimental data. This 
discrete molecule plays an active role in 
the stereomutation process, showing that 
this interaction is more favorable with the 
carbonyl group of the cyclopropanone sys¬ 
tem than is the TS for the ring opening or 
the oxyallyl intermediate. 
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ABSTRACT: As a model of olefin polymerization for a heterogeneous Ziegler-Natta 
catalyst, the mechanisms of the insertion of ethylene and propylene into R-Ti- 
(C1 2 A1H 2 )J(R = CH 3 , C 3 H 7 ) are studied by ab initio self-consistent field (SCF) methods 
and many-body perturbation theory. The structures of the reactant, the intermediate, the 
transition state, and the product are optimized by the Hartree-Fock molecular orbital 
calculation level. The reactions are classified into two steps: the first forms of olefin-Ti it 
complex, and the second step is the carbon-carbon bond formation through the pull-push 
mechanisms. The cocatalysis play a role in facilitating the pull-push mechanism with the 
Al-Cl bonds alternation. For the insertion reaction of propylene, the four-type 
conformations for attacking of propylene (two primary forms: methyl-substituted carbon 
attacks to the carbon side of Ti-C bond; two secondary forms: methyl-substituted carbon 
attack to Ti atom) are studied. For the complex formation, the secondary forms are more 
stable in energy than the primary forms. For the transition state, the primary forms are 
more stable in energy than the secondary forms. © 1997 John Wiley & Sons, Inc. Int J Quant 
Chem 65: 739-747, 1997 


Introduction 

I t is widely known that Ziegler-Natta catalysis 
polymerization is one of the most important 
industrial polymerization reaction, and many ex¬ 
perimental studies have been reported [1-10]. The 
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manufacture for Ziegler-Natta type polymeriza¬ 
tion is based on heterogeneous catalyst which have 
become highly selective and efficient. Classical 
Ziegler catalysts are heterogeneous with the poly¬ 
merization taking place on dislocations and edges 
of TiCl 4 crystals. Heterogeneous titanium-based 
Ziegler catalysts are more versatile; they are able 
to polymerize propene with a very high degree of 
stereoselectivity and catalyze the copolymerization 
of ethylene with higher alk-l-enes such as hex-1- 
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ene, important for the production of flexible, 
nonbrittle polymers for use as films and in packag¬ 
ing. Although there are successful processes which 
give rise to a wide range of polymer products, the 
diversity of active sites in heterogeneous catalysts 
leads to an uneven degree of comonomer incorpo¬ 
ration, with a high incorporation rate in short 
chains, and little incorporation in the high-molecu- 
lar-weight fraction, a drawback where metal¬ 
locenes offer particular promise. In 1959 Bres- 
low and Newburg [11] suggested for their 
[TiCl 2 Cp 2 ]AlClEt 2 system that the transition metal 
is first alkylated and then forms a halide-bridged 
binuclear complex which is capable of reaction 
with ethylene. In spectroscopic study of the forma¬ 
tion of ion pair in Cp 2 TiCl 2 and MeAlCl 2 system, 
both groups of Long [12] and Eisch [9] did not 
observe the ions (Cp 2 TiCH 3 and A1C1 4 ). The pos¬ 
sibility that Cp 2 TiCl 2 :Cl 2 AICH 3 might be the ac¬ 
tive catalysts is ruled out by the ultraviolet (UV) 
studied of Long [12] and the kinetic studies of 
Fink [13]. Eisch and co-workers [14] by H, C, 
and A1 nuclear magnetic resonance (NMR) spec¬ 
troscopy, also concluded that the ion pair 
[(Cp 2 TiCH 2 SiMe 3 ) + —A1C1 4 : included Ti-Cl-Al 
bridged bond] in Cp 2 Ti(Cl)CH 2 SiMe 3 and A1C1 3 
system are effective catalysts for the polymeriza¬ 
tion of ethylene. 

For the generic mechanism of olefin insertion of 
the Cossee mechanism [10] has been accepted 
widely as the most plausible. The first step is 
olefin coordination to a vacant site of Ti atom. In 
the second step olefin is inserted into the Ti-C 
bond through a four-membered cyclic transition 
state. Many theoretical studies of Ziegler-Zatta 
catalysis polymerization have been reported. Most 
of these studies discuss the reaction mechanism in 
Ziegler-Natta polymerization based on Cossee's 
model. Cossee's model was treated with semiem- 
pirical self-consistent field (SCF) molecular orbital 
(MO) methods by Armstrong and co-workers [15] 
and by Novaro and co-workers [16]. Their results 
suggested that the Ziegler-Natta mechanism might 
be explained by a concerted motion of the olefin 
and alkyl group that would bring about a transi¬ 
tion from octahedral coordination to a trigonal-bi- 
pyramidal coordination for the Ti atom. This view 
was also confirmed with the ab initio SCF MO 
method by Clementi and co-workers [17, 18], Bal- 
azs and Johnson [19] demonstrated the reaction 
CH 3 TiCl 4 + C 2 H 4 by using a SCF-X a -SW MO 
method. The interaction between C 2 H 4 and 
CH 3 TiCl 2 was examined by ab initio MO methods 


[20, 21] and by density functional methods [22]. In 
the more realistic systems of homogeneous cataly¬ 
sis (including Cp), hydrozirconation of ethylene 
and acetylene by Cp 2 Zr(H)Cl and (or) Cp 2 TiCH 3 
was studied by ab initio MO methods [23, 24]. 
The insertions of ethylene into Cp 2 ZrCH 3 and 
(SiH 2 Cp 2 )ZrCH 3 were also treated by ab initio 
MO method [25] and by density functional meth¬ 
ods [26-28]. The stereotacticity of zirconium met¬ 
allocene Ziegler-Natta propylene polymerization 
catalysis was studied using a combination of ab 
initio MO method and empirical force field molec¬ 
ular mechanics techniques [29-34]. However, 
above theoretical studies did not reported for the 
mechanism of cocatalysis of Ziegler-Natta poly¬ 
merization. The present author has proposed the 
fundamental carbon-carbon bond formation 
mechanisms with a simple model of Cossee type 
in a previous study [35]. The proposed mecha¬ 
nisms were on the basis of electronic pull-push 
process: the electrons in the Ti-R bond move to 
the region of new carbon-carbon bond formation, 
and 77 electrons of ethylene move to the region of 
new Ti-carbon bond formation. These mechanisms 
were the same as those of alumination reaction of 
olefin as shown in our previous studies [36, 37]. In 
the study [35], the role of cocatalysis for the Zieg¬ 
ler-Natta type reaction was also reported on the 
basis of the classical model (CH 3 TiCl 4 AlH 2 + 
H 2 C = CH 2 ). For this model, it was clarified that 
the cocatalysis play a role in facilitating the pull- 
push mechanism through the Al-Cl bond alterna¬ 
tion. Although we clarified entirely the role of 
cocatalysis for the model, we could not explain the 
stereotacticity for polymerization on the basis of 
the model. Consequently, for the explanation of 
the role of cocatalysis and the stereoreguration for 
the heterogeneous Ziegler-Natta polymerization, 
the present author proposes a new model as the 
following: 


H 

*N, 

Q -A1^ H 





Cl 
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This model is more realistic compared with the 
previous one. For our model of olefin polymeriza¬ 
tion, a heterogeneous as well as homogeneous 
catalysis has been adopted. 

In this study, two subjects are reported. The 
first is the role of cocatalysis for olefin insertion 
mechanism. The second is the stereoregulation of 
propylene polymerization on the basis of our 
model. 


Method of Calculation 

The basis sets used for Ti atom through the 
study were 3-21G of Dobbs and Hehre [38]. For C, 
H, Cl, and A1 atoms the standard 3-21G basis 
functions [39, 40] were used. The set is denoted at 
the following sections as BS-I. For better energies 
the 6-31G(d) basis functions [41, 42] for carbon 
atom and the (8s/4p/3d) contracted functions [43] 
for Ti atom were used, denoted as BS-II. All equi¬ 
librium- and transition-state geometries were de¬ 
termined by the use of analytically calculated 
energy gradients with Hartree-Fock (HF) wave 
functions. The stationary points were identified as 
the equilibrium or the saddle point by examining 
the calculated normal vibrational frequencies. The 
force constant matrix and thereby the vibrational 
frequencies were calculated by analytical second- 
derivative procedures [44, 45], and zero-point vi¬ 
brational energy corrections were obtained at this 
level. Additional calculations were performed to 
obtain improved energy comparisons—the calcu¬ 
lations at the HF-optimized structures with elec¬ 
tron correlation incorporated through the second- 
and third-order Moller-Plesset (MP2 and MP3) 
perturbation theory [46-50]. 

Ab initio molecular orbital calculations were 
carried out by using the GAUSSIAN94 program 
[51]. 


Results and Discussion 

INSERTION OF ETHYLENE INTO 
H 3 CTi(Cl 2 AlH 2 ) 2 + AND ROLE 
OF COCATALYSIS 

The stationary point geometries of reactant, 
complex, transition state, and product for 
(Cl 2 AlFI 2 ) 2 TiCH 3 + H 2 C = CPI 2 reaction are illus¬ 
trated in Figure 1. For all stationary points, Ti, Cl c , 
Cl d , and three carbon atoms are almost in the same 


plane. The stationary point geometry of reactant 
has a C s symmetry. The C s symmetry plane bisects 
each angle of Cl a -Ti-Cl c and Cl b -Ti-Cl d , and is in 
the plane of Ti-C-H a . For the reactant conforma¬ 
tion, there are two possible spaces for approaching 
of olefin: Cl a -Ti-C-Cl c and Cl b -Ti-C-Cl d spaces. 
The space of Cl a -Ti-C-Cl c is more wide than the 
space of Cl b -Ti“C-Cl d from the comparison of the 
angels of Cl c TiC(H 3 ) and Cl d TiC(H 3 ). The Cl c -Ti 
(and Cl a -Ti) bond length is a little shorter than the 
Cl d -Ti (and Cl b -Ti) bond length. The difference 
between Cl c -Ti and Cl d -Ti bond lengths, how¬ 
ever, is only 0.019 A in length. Consequently, the 
space of Cl a -Ti-C-Cl c is favorable for the ap¬ 
proaching of olefin. 

For the it complex, C 13 carbon in ethylene inter¬ 
acts stronger with the Ti atom than the C a carbon, 
because of the repulsion of C“ and Cl c atom (or 
Ti-Cl c bond). In the catalysis part, the bond be¬ 
tween Ti atom and Cl c becomes longer by 0.321 A 
in length than that in the reactant. Because the Cl c 
atom moves to the reverse side of methyl-Ti bond 
for Ti atom during the complex formation. The 
longer Ti-Cl c leads to be the shorter C1 C -A1 bond, 
the longer Cl b -Al bond, and the shorter Ti-Cl b 
bond in comparison with those of the reactant. 
Accordingly, this part (Ti, Cl c , Al, and Cl b ) is a 
complex between C1 C -A1H 2 and Ti-Cl b . On the 
other hand, the variation of Ti-Cl d bond for the it 

o 

complex formation is small (only 0.03 A in length). 
As a result, the one side of cocatalysis is active for 
the 77 -complex formation and the other is not ac¬ 
tive. 

For the transition state, the Ti-Cl c bond be¬ 
comes shorter in length than that of the 77 com¬ 
plex. This behavior facilitates the bond breaking of 
Ti-C(H 3 ). Because the insertion mechanism is the 
pull-push as shown in our previous work [35, 36], 
the electrons of Ti-C(H 3 ) bond move to the region 
of new C-C bond formation and original 77 elec¬ 
trons of ethylene part move to the region of the 
new C-Ti bond formation. Accordingly, the ap¬ 
proach of Cl c to Ti atom facilitates the moving of 
the electrons of Ti-C(H 3 ) bond region. On the 
other side, the Ti-Cl d bond length becomes longer 
than that of the 77 complex. This behavior facili¬ 
tates moving the 77 electrons of the ethylene part 
easily to the new C-Ti bond formation region (the 
reverse side of Ti-Cl d bond for Ti atom). 

For the product, polymer grows up for the more 
wide space of Cl-Ti-Cl. The mechanism of co¬ 
catalysis is shown in Figure 2. From the figure, the 
bonds alternation for Cl-Al and Ti-Cl bond occurs 
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FIGURE 1. HF/BS-I optimized geometries of reactant, i t complex, transition state, and product for reaction 
H 3 CTi(CI 2 AIH 2 ) 2 + H 2 C = CH 2 . Distances and angles are in angstrom and degrees, respectively. 


during the one process of polymerization. It is also 
considered that this is the reaction of olefin inser¬ 
tion into H 3 C-TiCl 2 plus cocatalysis (C1A1H 2 ) 2 . 
Many theoretical works [22, 30] treated the inser¬ 
tion of ethylene into H 3 C-TiCl 2 as a model of the 
homogeneous catalysis of [Cp 2 TiCH 3 ] + . The po¬ 
tential energy surface of H 3 CTiCl 2 ethylene is 
different from that of our system. For Cl 2 TiCH 3 + 
ethylene, the complexation energy at the it com¬ 
plex is over 40 kcal/mol. On the other hand, the 
complexation energy of our system is about 10 
kcal/mol. TheTarge difference of the complexation 
energies comes from the distribution of the posi¬ 
tive charge by the cocatalysis. The influence of the 


complexation energy on the charge distribution 
can be seen for the other homogeneous systems. 
The 7r complexation energy for Cp 2 TiCH 3 + ethyl¬ 
ene was proposed by Wess and co-workers [24]: 

— 5.3 kcal/mol for the HF calculation and -23.7 
kcal/mol for the MP2 calculation level. Yoshida 
and co-workers [25] proposed the 7r complexation 
energy for H 2 SiCp 2 TiCH 3 + ethylene by various 
methods: —13.0 kcal/mol for the HF calculation, 

— 28.6 kcal/mol for the MP2 calculation, —20.9 
kcal/mol for the MP3 calculation, and —21.9 
kcal/mol for QCISD calculation levels. For these 
homogeneous systems, the it complexation ener¬ 
gies by the HF calculation are almost the same to 
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FIGURE 1. Continued. 


the value for our system. Yoshida and co-workers 
pointed out that the calculation values for their 
system are far from convergence in the perturba¬ 
tion series. Accordingly the difference between the 
calculated rr complexation energies by the HF and 
by the MP2 levels is quite large (15.6 kcal/mol) for 
the homogeneous system. For our system, the dif¬ 
ference of the 7 t complexation energies by the HF 
and by the MP2 calculations is only 4.2 kcal/mol. 
The ethylene binding energy for H 2 SiCp 2 TiCH J is 
21.9 kcal/mol (best value), and is the middle value 
of those for Cl/TiCHj and for our system. 

For the activation energy at the transition state 
—the energy is given relative to the 77 complex of 
[TiJ-CHj and ethylene—13.8 kcal/mol for the HF 
calculation and 4.3 kcal/mol for the MP2 calcula¬ 
tion level for Cl/TiCHj system were proposed 
[30]. For the H 2 SiCp 2 TiCH+ system [25], 14.3 
kcal/mol for the HF calculation, 1.3 kcal/mol for 
the MP2 calculation, 10.9 kcal/mol for the MP3 
calculation, and 7.1 kcal/mol for the RQCISD cal¬ 
culation levels were proposed. The activation ener¬ 
gies from the tt complex for our system is 9.3 
kcal/mol for the HF calculation, 0 kcal/mol for 
the MP2 calculation, and 3 kcal/mol for the MP3 
calculation levels. The activation energy barrier 
height for our system is about 7-2 kcal/mol lower 
than those for the other homogeneous systems. It 
is considered that this energy difference comes 
from the energy reduced by the cocatalysis. This 
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FIGURE 2. Mechanism of olefin insertion into Ti-CH 3 
bond of H 3 CTi(CI 2 AIH 2 )J. 


low-energy barrier corresponds to the fact that the 
reaction of heterogeneous catalysis is more active 
than that of homogeneous catalysis. 

ISOTACTIC STEREOREGULATION IN 

PROPYLENE POLYMERIZATION 

For evaluation of regio- and stereoregulatory 
capabilities, we have to compare the four possible 
conformations for tt complex and transition state 
shown in Chart 1 the differences in the geometry 
and the total energy. The four tt complexes and 
four transition-state geometries, primary A, pri¬ 
mary B, secondary A, and secondary B shown 
in Chart 1, for propylene insertion into 
H 3 C—Ti(Cl 2 AlH 2 ) 2 have been optimized by the 
HF/BS-I method. The some geometry parameters 
and the relative energies, relative to primary A for 
all the case, are summarized in Table II. 

For 7r complex, the difference of the distances of 
Ti-C (C“ and C /3 ) bonds are drastic for the confor¬ 
mations of the primary and secondary. This comes 
from the 77 -electron distribution of propylene. 
Namely the unsubstituted carbon in propylene has 
more 7r-electron density than methyl-substituted 
carbon. It is widely known that the 77 complex is 
formed with the interaction between the empty 
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TABLE I 


Relative energies from the reactant for ethylene polymerization and propylene polymerization (kcal/mol). 


Method 

HF/BS-I 

MP2/BS-I 

HF/BS-II 

MP2/BS-II 

MP3/BS-II 

Ethylene +H 3 CTi(CI 2 AIH 2 ) 2 

Reactant 

0.0 

0.0 

0.0 

0.0 

0.0 

77 Complex 

-13.1 

-14.4 

-8.7 

-12.9 

-11.3 

Transition state 

- 2.2 

- 12.1 

0.6 

-13.7 

-8.3 

Product 

-31.6 

-31.6 

-30.4 

-34.5 

-34.3 

Propylene + H 3 CTi(CI 2 AIH 2 ) 2 

Reactant 

0.0 

0.0 

0.0 

0.0 


77 Complex 

Primary A 

-18.5 

- 20.2 

-13.6 

-19.3 


Primary B 

-18.9 

-19.9 

-14.0 

-19.0 


Secondary A 

-20.9 

-20.4 

-16.3 

-19.5 


Secondary B 

- 20.6 

— 20.5 

-15.7 

-19.5 


Transition state 

Primary A 

-5.9 

-16.0 

-1.9 

-18.0 


Primary B 

-6.3 

-16.6 

- 2.0 

-18.1 


Secondary A 

-3.0 

-14.0 

1.2 

-14.1 


Secondary B 

- 2.8 

-14.0 

1.9 

-13.9 



orbital of transition metal and tt orbital of olefin. 
Consequently, the unsubstituted carbon interacts 
stronger with Ti atom than the methyl-substituted 
carbon. The differences between the total energies 
of the primary and secondary conformations for 
the HF and MP2 calculation levels are 2 and 0.2-0.3 
kcal/mol, respectively. The conformations of the 
secondary are a little favorable in energy to com¬ 
pare with those of the primary. The energy differ¬ 
ence, however, is very small, and the stereo-regu¬ 
lation of the product is not decided probably by 
the formation of the tt complex. The conformation 


of the secondary is similar to that of the ethylene 
insertion. The energy difference between A and B 
for both of the primary and secondary is little 
(under 1 kcal/mol). The tt complexation energy 
for propylene is about 7-8 kcal/mol more stable 
than that for the ethylene. The difference of these 
complexation energies for both systems comes from 
the 7r-electron distribution with methyl-substitu¬ 
tion group. As the results, the 77 complexation 
energy have more influence for the electronic part 
of the methyl-substitution group than that for the 
steric part. 


TABLE II_ 

Ti - C bond distance and the reiative energy from primary A geometry for propylene polymerization. 


Bond distance (A) Relative energy (kcal / mol) 



Ti -C“ 

Ti-C^ 

Ti -C(Hg) 


c^-cong) 

HF/BS-I 

MP2/BS-I 

HF/BS-II 

MP2/BS-II 

Complex 

Primary A 

2.355 

3.001 

2.034 

1.361 

3.055 

0.0 

0.0 

0.0 

0.0 

Primary B 

2.348 

3.029 

2.029 

1.361 

3.213 

-0.38 

0.27 

-0.40 

0.30 

Secondary A 

3.008 

2.325 

2.026 

1.369 

3.047 

-2.47 

-0.19 

-2.71 

-0.28 

Secondary B 

3.003 

2.327 

2.025 

1.367 

3.053 

- 2.10 

-0.27 

-2.17 

- 0.20 

Transition state 

Primary A 

2.092 

2.540 

2.139 

1.432 

2.108 

0.0 

0.0 

0.0 

0.0 

Primary B 

2.092 

2.537 

2.141 

1.431 

2.105 

-0.37 

-0.61 

- 0.12 

- 0.11 

Secondary A 

2.195 

2.512 

2.121 

1.406 

2.172 

2.89 

1.92 

3.08 

3.96 

Secondary B 

2.199 

2.511 

2.116 

1.406 

2.174 

3.14 

2.00 

3.79 

4.14 


744 


VOL. 65, NO. 5 











ZIEGLER-NATTA POLYMERIZATION MECHANISMS 


For the transition state, the Ti-C 01 bond lengths 
of the primary conformations are about 0.1 A 
shorter than those of the secondary. From the 
comparison of the geometry parameters between 
the primary and secondary conformations, the pri¬ 
mary conformations are earlier transition states 
than the secondary conformations. The conforma¬ 
tions of the primary are about 3-4 kcal/mol more 
stable in energy than those of the secondary at 
both calculation levels of the HF and MP2. The 
difference of activation energy barrier heights be¬ 
tween the primary and the secondary conforma¬ 
tions comes from the it electrons distribution with 
methyl-substituted group. As described above, the 
unsubstituted carbon in propylene has much tt- 
electron density. Consequently, the reaction path 
of the secondary conformation is unfavorable for 
the pull-push mechanism; the electrons in the 
Ti-R bond move to the region of new carbon-car¬ 
bon bond formation and the electrons of tt bond 
move to the region of new Ti-C bond formation. 
Namely the electron distribution of propylene for 
the secondary conformations has reverse effects for 
the electron movement along the reaction path. 
The energy differences between A and B for both 
primary and secondary conformations are also un¬ 
der 1 kcal/mol. The energy difference between 
primary A and primary B should determine isotac¬ 
tic stereoregulation for the product. From the small 
energy difference, it is concluded that the isotactic- 
ity is not decided for the initiation step of poly¬ 
merization. This conclusion corresponds to the re¬ 
sults [34] for propylene insertion to transition metal 
(Ti, Zr, and Hf)-C bond for homogeneous catalysis 
systems by ab initio MO-MM (molecular mechan¬ 
ics) method. They concluded the stereoselectivity 
is decided after the first propylene insertion from 
the results of the MM calculations. The stereoregu¬ 
lation via direct ligand-olefin interaction for their 
catalyst is not significant. Accordingly the stereos¬ 
electivity is decided by steric energies between 
monomer and polymer. However, their calculation 
is the MM and included only steric effects (not 
electronic effects). 

In order to study the steric and electronic effects 
of polymer by ab initio molecular orbital calcula¬ 
tion level, the transition states for the second step, 
as shown in Figure 3, were calculated. In Figure 3, 
conformation I indicates the polymer part C 8 HM 2 , 
spreads to the outside of the space of Cl b TiCl d C 7 . 
Conformation II indicates the polymer part spreads 
to the inside of the space of Cl b TiCl d C y . For these 


C|a. 



-aih 2 


■ci d 


C|a -AIH 2 

Cl c 



P(Me) 


Conformation (I) Conformation (II) 

FIGURE 3. Conformations of polymer part at the 
transition state for reaction of (CH 3 ) 2 CHCH 2 Ti(CI 2 AIH 2 ) 2 
and propylene. 


two conformations of polymer part, the transition 
states of eight types (primary A-I, A-II, B-I, and 
B-II, and secondary A-I, A-II, B-I, and B-II) were 
calculated. These eight conformations are real tran¬ 
sition states and have one negative eigenvalue for 
their force constant matrix. The relative energies 
for these eight transition state are listed in Table 
III. The energy difference between the primary A-I 
and A-II is only 0.7 kcal/mol at the MP2 calcula¬ 
tion. The energy of the primary B-II is almost the 
same to that of the primary A-I. However, the 
primary B-I is about 5 kcal/mol higher in energy 
than the primary A-I. The value of 5 kcal/mol 
indicates the static repulsion energy between the 
methyl groups of propylene and polymer part. The 
primary B-II is about 5 kcal/mol more stable in 
energy than the primary B-I. If the space of 
Cl b TiCl d is crowded with other catalysis (maybe 


TABLE III_ 

Relative energy from primary A-I conformation 
for the second step of propylene polymerization. 

Relative energy (kcal / mol) 
HF/BS-1 MP2/BS-I 


Transition state 


Primary A-I 

0.0 

0.0 

Primary A-II 

0.88 

0.69 

Primary B-I 

5.27 

4.61 

Primary B-II 

0.51 

-0.23 

Secondary A-I 

1.63 

3.75 

Secondary A-II 

2.32 

4.26 

Secondary B-I 

2.28 

4.01 

Secondary B-I! 

1.97 

3.49 
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this assumption indicates a realistic heterogeneous 
catalysis), the conformation II types are unstable in 
the energy. All secondary types are about 4 
kcal/mol more unstable in energy than the pri¬ 
mary A-I. This reason comes from the electronic 
distribution of propylene as mentioned above. The 
insertion is the pull-push mechanism, and i t elec¬ 
trons of monomer have to move new Ti-C bond 
region through the transition state. Consequently, 
the regioselectivity of reaction is controlled by the 
static repulsion between the methyl groups of 
monomer and of the first unit in polymer for the 
primary types, and is controlled by the electronic 
effects (the it electrons distribution of monomer) 
for the secondary types. As a result, the reaction 
path of the primary A-I type leads to the isotactic 
sequence. 


Conclusions 

The mechanisms of the insertion of ethylene 
and propylene into R-Ti(Cl 2 A1H 2 )J (R = CH 3 and 
C 3 H 7 ) were studied by ab initio MO methods. Our 
treated catalytic system is one kind of a edge 
model of TiCl 4 crystals for heterogeneous 
Ziegler-Natta catalysts. The reactions are classi¬ 
fied into two steps: the formation of olefin-Ti tt 
complex and olefin insertion through the 
pull-push mechanism. For the 7r-complex forma¬ 
tion step, one side of cocatalysis works actively to 
weak the Ti-CH 3 bond. For the olefin insertion 
step, the cocatalysis works to break the old Ti-CH 3 
bond, and the other side of cocatalysis works to 
form a new Ti-C bond. As the results, the cocatal¬ 
ysis play a role in facilitating the pull-push mech¬ 
anism through the Al-Cl bonds alternation. 

For the initial step of the propylene polymeriza¬ 
tion, four types of conformation were studied for 
each 7 T complex and for each transition state: pri¬ 
mary A, primary B, secondary A, and secondary B. 
For the 7r-complex formation, the secondary con¬ 
formations are a little favorable in energy to com¬ 
pare with those of the primary. The energy differ¬ 
ence, however, is only 0.2-0.3 kcal/mol by the 
MP2 calculation and about 2 kcal/mol by the HF 
calculation level. Accordingly, the tt complex does 
not decide probably the stereoregulation of the 
product. For the transition state, the primary con¬ 
formations are favorable by about 4 kcal/mol in 
energy to compare with those of the secondary. 
The energy difference between the primary confor¬ 


mations A and B is only 0.3 kcal/mol by the MP2 
level. Therefore, the isotacticity is not decided at 
the initiation step of polymerization. 

The transition states for the second step of poly¬ 
merization were calculated by the HF/BS-I calcu¬ 
lation level. All secondary types of the transition 
states are about 4 kcal/mol higher in energy than 
the primary A-I. Conformation II for the primary 
types has almost the same energy to the primary 
A-I. However, if the space of Cl b TiCl d C y is 
crowded with other catalysis, all conformations of 
type II are unstable in energy. Accordingly, the 
transition state of the primary B is about 5 
kcal/mol higher than that of the primary A. The 
most stable structure, the primary A-I, leads to the 
isotactic type product. 

As mentioned, the role of the cocatalysis for the 
pull-push mechanism and the isotacticity for 
propylene polymerization could be shown on the 
basis of our model of the Ziegler-Natta type catal¬ 
ysis. It is considered that the role of cocatlysis and 
the mechanisms of the isotacticity polymerization 
can be extended to the case of the homogeneous 
catalysis system. 
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ABSTRACT: A density functional theory study of Au, Au 2 , Au 3 , and Au 4 is 
performed focusing on the study of gold tips or contacts of interest in investigations of 
molecular-scale electronics. The ground state for the four systems corresponds to the one 
with the lowest multiplicity, and the ordering of energies follows the multiplicity in all 
cases. It is found that the tetrahedral geometry of Au 4 is not the preferred one for the 
lowest spin states with neutral charge, but is the preferred one for charged systems. The 
ionization potential and electron affinities for the four systems were also obtained. A 
qualitative interpretation of the Kohn-Sham highest occupied molecular orbital (HOMO) 
and lowest unoccupied molecular orbital (LUMO) orbitals is given based on the present 
calculations. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 749-758, 1997 


Introduction 

The development of scanning tunneling mi¬ 
croscopy (STM) techniques has enormously trig¬ 
gered the possibilities of using organic chemistry 
as the fundamental complement for semiconduc¬ 
tors [1]. This may allow the fabrication of logical 
circuits at the level of single molecules [2]. These 
studies require the precise knowledge of the effect 
of the STM tips or contacts on these molecules. 

One of the major metals used as tips or contacts 
for the development of molecular-level electronic 
circuits is gold. In some cases, small gold clusters 
are the preferred material for measurements of 

Correspondence to: J. M. Seminario. 
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current-voltage characteristics [3], Other interest 
centers on the measurements of electrical resis¬ 
tance of wires composed of single atoms, and 
previous studies have been performed with xenon 
[4], Therefore, in this study we have decided to 
focus on the characteristics of small gold clusters 
that could represent the tips or contacts in experi¬ 
ments of molecular conductance. These contacts 
could be small enough to form one-atom-point 
contacts. In addition, the interactions of gold clus¬ 
ters with single molecules must be studied in 
order to understand and design systems for 
single-molecule electronics. There are several ex¬ 
perimental techniques such as molecular beams 
that allow the synthesis and characterization of 
clusters of a defined size. Several studies have 
been able to determine, theoretically and exper- 
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imentally, dissociation energies, ionization po¬ 
tentials, and chemical reactivity for some gold 
clusters [5-17]. The study of gold clusters is inter¬ 
esting because clusters combine both the proper¬ 
ties of metallic surfaces and metallic compounds. 
The goal of this contribution is to study and deter¬ 
mine the basic properties of the lowest states of 
gold monomer, dimer, trimer, and tetramer using 
the best theoretical tools available at present, and 
at the same time, to determine a proper tool to 
study much larger clusters and gold complexes of 
interest for molecular-scale electronics. 


Theory 

We have used density functional theory (DFT) 
techniques [18-24] combined with effective core 
potentials [25]. Presently, this seems to be the most 
powerful combination of tools to deal with rela¬ 
tively large numbers of big atoms. The basis set 
used for the gold atom is the Los Alamos National 
Laboratory's (LANL) set for effective core poten¬ 
tials (ECP) of double-^ type [26-28]. The use of 
DFT is fully justified due to the fact that is an ab 
initio tool able to deal with a broad variety of 
systems. Several successful applications of DFT 
have been reported using the so-called hybrid 
functionals, where a portion of the exchange is 
calculated as a fully nonlocal functional of the 
wave function of an auxiliary noninteracting sys¬ 
tem of electrons. Since this resembles the exchange 
in the Hartree-Fock (HF) procedure (actually in 
any wave function procedure), it is common to 


refer to this functional, or procedure, as a DFT-HF 
hybrid. However, we have to consider that the 
so-called exchange is being calculated using a non¬ 
interactive wave function whose density, but not 
its wave function, corresponds to the real system. 
A detailed analysis and their theoretical rigor was 
recently reviewed [29]. The functional that we used 
is the B3PW91 [30-33]. All calculations were per¬ 
formed using the Gaussian-94 program [34]. All 
geometry optimizations were performed via the 
Berny algorithm in redundant internal coordinates 
[35]. The thresholds for convergency were 0.00045 
atomic units (a.u.) and 0.0003 a.u. for the maxi¬ 
mum force and root mean square (RMS) force, 
respectively. The self-consistency of the noninter¬ 
active wave function was performed with a re¬ 
quested convergence on the density matrix of 10 -8 , 
and 10 ~ 6 for the RMS and maximum density ma¬ 
trix error between iterations, respectively. These 
settings provide correct energies of, at least, five 
decimal figures and geometries of approximately 
three decimal figures. On the other hand, the use 
of the pseudopotentials with relativistic correc¬ 
tions have been widely demonstrated to be a good 
compromise with the alternative use of full-elec¬ 
tron procedures. This reduces the required compu¬ 
tational effort without loss of accuracy [36]. 


Results and Discussions 

Table I shows the results for the lowest state of 
each spin symmetry of the gold atom. The pre¬ 
dicted ground state is a doublet, and this result is 


TABLE I 


Results for the gold monomer (1 Ha = 27.2114eV). a 

Multiplicity 

Energy (H a ) 

Energy (eV) 

Experimental (eV) 

Doublet (Au) 

-135.48424 (0.00) 



Quartet (Au) 

-135.22896 (6.95) 



Sextet (Au) 

-134.84610 (17.36) 



Octet (Au) 

-134.37500 (30.18) 



Singlet (Au - ) 

-135.56147 



Triplet (Au - ) 

-135.45921 



Singlet (Au + ) 

-135.14100 



Triplet (Au + ) 

IP of Au 

-135.03201 

9.34 b 

9.22 

EA of Au 


2.10 b 

2.31 

KS-HOMO Au 


-6.77 


KS-LUMO Au 


-4.60 


a Relative energies in parentheses (eV). 

b Relativistic MRSDCI/ (13s11p504f) yielded IP = 9.04 eV and EA = 

2.11 eV [9], 
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in agreement with the experimental observation 
[37]. The ordering of the higher spin states follows 
the predicted trend. In order to calculate the ion¬ 
ization potential (IP) and electron affinity (EA) of 
the gold atom, we also studied the positive and 
negative ions of gold. These results are also in 
excellent agreement with the experimental values, 
and they are slightly better than expensive rela¬ 
tivistic multireference single-double configuration 
interaction (MRSDCI) calculations using larger ba¬ 
sis sets [9]. Table I also shows the Kohn-Sham 
(KS) highest occupied molecular orbital (HOMO) 
and lowest unoccupied molecular orbital (LUMO) 
energies of the Au atom. They are plotted in Fig¬ 
ure 1 where the A values are shown from the 
negative of the HOMO level to the IP, and from 
the negative of the LUMO level to the EA. These A 
values are 2.57 and 2.50 eV for the HOMO and 
LUMO, respectively. These values correspond to 
the electron contribution to the energy in the adia¬ 
batic integral [38, 39] of the Kohn-Sham [40] pro¬ 
cedure: 


d\, 

J o 

resulting from the reorganization of the electron 
density after the electron ejection or capture. 

Table II shows the results for the gold dimer. 
The ground state is a singlet with a bond length of 


LOWEST ENERGY STATES OF Au, 


1 Au+ 

A=2.57 


Au KS-HOMO 

‘"'"I 


IP=9.34 


6.77 


EA=2.10 


Au- 


Au KS-LUMO 


A=2.50 


Au 


4.60 


FIGURE 1 . Energy levels for the Au atom (in eV) 


TABLE II_ 

Results for the gold dimer (D e is calculated with respect to two doublet monomers). 3 


Multiplicity 

Energy (H a ) 

Bond Length (A) 

Energy (eV) 

D e (eV) 

Singlet Au 2 ^ 

-271.03845 

2.547 

0.00 

1.90 


(2.472) 


(2.30) 



[2.549] 


[2.22]DHK BLYP 



[2.556] 


[2.19]Pauli BP 



[2.517] 


[2.26]ZORA-BP 



[2.515] 


[2.29]DKH BP 



[2.513] 


[2.54]DKH PW 



[2.54] 


[1,86]ARPR-CEPA 



[2.54] 


[2.13]RPP-QCI(T) 

Triplet Au 2 3 X U 

-270.97384 

2.961 

1.76 

0.15 

Quintet Au 2 

-270.76759 

2.638 

7.37 

-5.47 

Septet Au 2 

-270.49636 

2.502 

14.75 

-12.85 

Doublet Au 2 2 2 u 

-271.11149 

2.695 



Doublet Au 2 2 2 g 

EA of Au 2 

-270.69432 

2.675 

1.99 
(1.94) 


IP of Au 2 



9.36 


a Experimental values are in parentheses. Other calculations [17] are shown in square brackets. 
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2.547 A and an atomization energy of 1.90 eV with 
respect to two gold ground-state atoms. This is 
followed by a triplet state at 1.76 eV of separation 
and with a weak bond of only 0.15 eV. The lowest 
quintet and lowest septet were also calculated. 
They show stable structures with a bond length of 
2.695 and 2.675 A, respectively. In addition, we 
have also analyzed the positive and negative ions 
in order to obtain IP and EA of the ground-state 
gold dimer. These calculations yielded 9.36 and 
1.99 eV, respectively, practically the same values 
as for the monomer. The EA is in excellent agree¬ 
ment with the experimental value of 1.94 eV 
[12-14]. We were not able to find an experimental 
value for the IP of Au 2 . The bond dissociation 
energy ( D e ) of 1.90 eV is in fairly good agreement 
with the reported experimental values of 2.30 eV 
[41] and 2.33 eV [10, 11]. The o bond length is 
slightly overestimated by 0.077 A. However, our 
results are in closer agreement with a direct per¬ 
turbation relativistic DFT procedure [16] which 
reported a D e of 2.04 eV and a bond length of 2.58 
A. Several calculations on Au 2 have recently been 
reported [17] and they are listed in Table II. 

Table III shows the results for the trimer. The 
lowest doublet, quartet, and sextet were calcu¬ 
lated. As with the monomer and dimer, the order¬ 
ing of energies follows the multiplicity. As per our 
goal, resembling the STM tip implies the search for 
local minima with a triangular structure. There¬ 
fore, all our geometry optimizations started with a 
triangular structure. In addition, the search was 
extended to the positive and negative singlets and 
triplets. Except for the negative singlet of An 3 that 
yielded a linear structure, all the other geometries 
were triangular. For the charged ions, the singlets 
were lower in energy than the triplets. The quartet 
was the only neutral structure to have D 3h sym¬ 
metry and correspond to the one with the longest 
bond length. The doublet and triplet are stable 
with respect to dissociation into three ground-state 
Au atoms by 2.79 and 0.48 eV, respectively. The 
sextet is unstable by 7.41 eV. Only the singlet is 
stable with respect to a monomer plus dimer sys¬ 
tem by 0.88 eV. 

Table IV shows the results for the tetramer. The 
search for local minima was started with an almost 
tetrahedral structure with no constraints on its 
symmetry (C x symmetry). The ordering of ener¬ 
gies was again the same as their multiplicities; 
however, no tetrahedral geometry was found for 
the lowest singlet and triplet. The lowest quintet 
corresponds to a tetrahedral structure at 3.16 eV 


above the singlet. The singlet ended up planar in a 
triangular arrangement connecting the fourth gold 
atom in one of the vertices. The triplet afforded an 
almost perfect square with a puckered angle of 
12.9° and 0.81 eV above the singlet, and an addi¬ 
tional search constrained to D 4h symmetry yielded 
practically the same energy as the puckered one. 
The quintet was a tetrahedron. Interestingly, the 
three structures are stable with respect to four 
gold atoms in their ground state by 4.96, 4.15, and 
1.80 eV, respectively. The singlet and triplet are 
stable with respect to two dimers by 1.15 and 0.34 
eV, respectively. The quintet is unstable by 2.01 
eV. The singlet and triplet are also stable with 
respect to a ground-state trimer and a monomer 
while the quintet is not stable. We can also observe 
from Tables II—IV that the bond length of the 
doublet negative tetramer is in very close agree¬ 
ment with the experimental Au-Au gold distance 
in the Au(100) (1 X 1) surface. [15]. 


Structural and Electronic Analysis 

The point contacts of gold clusters are of 
paramount importance for single-molecule elec¬ 
tronic studies. To our knowledge, there are no ab 
initio studies dealing with the use of clusters as 
the contacts for single molecules. Most previous 
studies dealt with surfaces. Shown in Figure 1 are 
the electronic levels for the gold atom in its ground 
state. The ground state of gold is used as the 
reference to compare with the energy of its posi¬ 
tive and negative ions. The negative of the KS 
HOMO and LUMO of the ground state of the gold 
atom are also shown. The KS orbitals correspond 
to a system of fictitious noninteracting electrons 
with a density identical to the real system. There¬ 
fore, the negative of the HOMO energy corre¬ 
sponds to the energy increase of the system when 
this fictitious orbital stops being part of the nonin¬ 
teracting system due to the hypothetically instan¬ 
taneous ejection of one electron in the real system. 
The loss of one electron implies an electron relax¬ 
ation in the real and noninteractive systems that 
has, as a consequence, an increase in the total 
energy of the positive ion. This excess (A) is domi¬ 
nated by the electron-electron repulsion, which 
grows more than the kinetic and nuclear-electron 
energies combined. Similarly, the insertion of an 
electron in the real system causes the LUMO of the 
noninteractive system of the neutral atom to be 
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TABLE III_ 

Results for the gold trimer. 3 


LOWEST ENERGY STATES OF Au 


System 


Energy, H a (eV) 


Geometry (A and °) 


Structure 


Au 3 B 2 C v 


-406.55521 (0.00) 


2.642 

69.4 


2.642 

55.3 


3.007 

55.3 


Au 3 4 A 1 D 3/l -406.47020 (2.31) 


2.966 


Au 3 B,C 2v 


-406.28273 (7.41) 


2.695 

66.9 


2.695 

56.5 


2.972 

56.5 


Au 3 3 A 1 D 3/? -406.64609 (-2.47) 2.787 


Au 3 \D„ h 


-406.69050 (-3.68) 2.623 


Au 1 - Au 2 - Au 3 


Au 3 \D 3h -406.29524 (7.07) 


2.674 


Au s 3 A,C 2v 


-406.19806 (9.72) 


3.090 

50.4 


3.090 

64.8 


2.631 

64.8 


'When electronic state and symmetry of the optimized system do not correspond with the initial state the latter is indicated. 
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TABLE IV_ 

Results for the gold tetramer. 3 


System Energy, H a Energy (eV) Geometry (A and °) 


V\, C 2v Au 4 -542.11914 0.00 1-3,2-3 = 2.761 

1-2 = 2.608 
3-4 = 2.560 


3 A 1g D 4h Au 4 -542.08941 0.81 2.684 b 


Structure 


Au! 

I \ 



Au 2 


Au- 

-Au 

Au — 

-Au 


5 S 1 D zd Au 4 -542.00314 


Doublet AuJ -542.17039 


Doublet Auj -541.81813 


Quartet Au 4 -542.17848 


3.16 


-1.39 


8.19 c 


-1.61 


1-2, 3-4 = 2.979 
1-3,1-4 2-3 2-4 = 2.733 


1-3 = 2.827 

1- 4 = 2.828 

2- 4 = 2.885 


1- 3 = 2.955 

2- 3 = 2.922 
1-2 = 2.626 


1- 3 = 2.848 

2- 3 = 2.851 
1-2 = 2.852 


3-4 2.282 

1- 2 = 2.877 

2- 3 = 2.893 


1 - 4 = 2.943 

2- 4 = 2.912 

3- 4 = 2.626 


1- 4 = 2.851 

2- 4 = 2.852 

3 - 4 = 2.848 



Quartet Au 4 No convergence 

a AII initial geometries were almost perfect tetrahedrons. 

b A similar geometrical and isoenergetic conformation with a puckered angle of 12.9 was also found. 

C A lower energy structure with C 2v planar symmetry (rombic) was found with energy of -541.8205 a.u. 


occupied and, therefore, instantaneously reducing 
the energy by an amount equivalent to the nega¬ 
tive of the LIJMO energy. Since the electrons be¬ 
come denser due to the presence of the excess 
electron, the relaxation of the electrons will tend to 
spread out the electronic charge. As in the case of 


the positive ion, an increase in the total energy 
results. As expected from a physical interpretation 
of the KS orbitals, the subtraction of the HOMO 
energy from the total energy of the neutral system 
will yield an underestimation of the IP. Further¬ 
more, the addition of the LUMO energy to the 
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neutral atom will yield an underestimation of the 
energy of the negative ion. Relaxation effects will 
produce an increase of the energies in both cases. 

Therefore, DFT presents a better qualitative and 
quantitative picture than HF theory where, in most 
cases, the LUMO energy is always positive [42] 
and thus contrary to a real physical situation. Since 
the virtual KS orbitals describe electrons moving 
in the field of N - 1 electrons, they therefore lack 
the artificial upward shift of the Ffartree-Fock 
virtual orbitals [43]. In addition, using an exact 
functional, the HOMO energy is the exact ioniza¬ 
tion energy since this energy depends only upon 
the asymptotic decay of the density [44]. For fur¬ 
ther details on the Kohn-Sham orbitals see, for 
instance. Refs. [21, 23] and the works cited within. 
It is important to mention that relaxation energy 
defined here has contributions from the potential 
and kinetic energies. The potential energy arises 
from the nuclear-electron and electron-electron 
interactions and are incorporated into A, which is 
formed by the contributions of the T, V ee and V ne 
energies. Therefore, when an electron leaves the 
neutral system, at that instant, the V ee and T lower 
the total energy and the V ne increases the total 
energy. Then, in the relaxation process, the V ne has 
more effect than the V ee + T energy. Conversely, 
when an electron enters into the neutral system to 
form the negative ion, at that instant the V ee + T 
energy increases, therefore increasing the total en¬ 
ergy. Accordingly, the V ne increases in its absolute 
value, thus lowering the total energy. In this situa¬ 
tion, the V ee + T term has a dominant effect. These 
ideas are in disagreement with another study us¬ 
ing the local density approximation (LDA), which 
claims that the so-called relaxation and correlation 
effects cancel, thus Koopman's theorem can be 
used for the outer orbitals [45]. 

Table V summarizes the energetic results for the 
four gold systems. The HOMO and LUMO ener¬ 
gies follow the trend and qualitative behavior of 
the monomer. For the dimer and trimer, the effects 
of geometry relaxation have to be considered, and 
basically the IP and EA tend to be in the same 
range of values as the cluster increases in size. This 
has strong implications in the estimation of Fermi 
levels for gold obtained from small clusters calcu¬ 
lations. We also noticed that a tetrahedral geome¬ 
try is not allowed with low spin and neutral charge. 
In addition, results in Table V yield a perfect 
qualitative interpretation of the Kohn-Sham HO¬ 
MO and LUMO, allowing us to express the energy 
of the cation and anion with respect to the energy 


TABLE V_ 

Ionization potentials (IP), electron affinities (EA), 
Kohn - Sham (KS) HOMO and LUMO energies of 
Au, Au 2 , and Au 3 in eV. a 



Au 1 

> 

c 

ro 

Au 3 

IP 

9.34 

9.36 

7.07 

^HOMO 

6.77 

7.19 

5.40 

Ahomo 

2.57 

2.17 

1.57 

EA 

2.10 

1.99 

2.47 

^LUMO 

4.60 

3.79 

3.81 

Alumo 

2.50 

1.80 

1.34 


a We have not used the Au 4 because no ground-state sys¬ 
tem with tetrahedral symmetry was found. 


of the neutral, which can be written in KS-DFT as 
(see for instance Ref. [46]) 

E = T S + V ae + / V a !VJ*a> d\, 

and using the KS orbital energies, as 

E =!>,-+ TWel^) 

1 J 0 

Therefore, the energy of the cation is 

£ i ~ £ homo d- A^Q^Joj 

+ {/Vjf A )dA-A" a o}. 

and the energy of the anion 

} = | H £ i d~ ^LUMO “ ^UJMoj 

Thus 

a _ Anon int _ Aadiaba 

a HOMO — a HOMO a HOMO 

and 

a _ _ Anon int i Aadiaba 

^ LUMO — a LUMO " r ^ LUMO * 

Since A HOMO and A LUMO are always positive, the 
noninteracting changes predominate over the adia¬ 
batic changes when the system looses one electron, 
and the opposite happens when an electron is 
gained by the system. 
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An analysis of the HOMO and LUMO orbitals 
also helps in explaining the geometrical structure 
of the gold tetramers. The dimer and trimer struc¬ 
tures are straightforwardly understood. Figure 2 
shows the HOMO and LUMO orbitals for the 
singlet, triplet, and quintet lowest states of Au 4 . 
We see that in the planar structure of the singlet, 
the HOMO is localized on the gold atom outside 
of the triangle and the LUMO is localized on the 
base of the triangle. Therefore, electron conduction 
will be preferred when one electron enters from 
the base and one electron ejects from the tip. The 
reverse would be less preferred and probably only 


possible through a tunnel effect as in the STM. 
This logical way to interpret conductivity in 
molecules needs to be analyzed further since it 
would be instrumental for the design of molecular 
circuits. The HOMO and LUMO corresponding to 
the ground state of the planar tetramer are also 
show in Figure 2. In addition we have plotted the 
HOMO and LUMO of the nonoptimized tetramer 
with a T d symmetry. Clearly, the HOMO of the 
nonoptimized tetramer indicates that this corre¬ 
sponds to an antibonding orbital and therefore no 
stable structure is found. The D ih point group 
used here can end up on a T d or a D 4h structure on 


C 2v Au 4 singlet 


HOMO 


LUMO 




D 4h Au 4 triplet 


HOMO 


LUMO 



FIGURE 2. HOMO and LUMO orbitals for the optimized lowest singlet, triplet, and quintet states of Au 4 as well as for 
the nonoptimized triplet of T d symmetry. 
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T d Au 4 triplet 

HOMO 



LUMO 



D 2d Au 4 Quintet 

HOMO 

& ^ 


LUMO 





FIGURE 2. Continued 


the other end. The preferred symmetry is the D 4h . 
The HOMO and LUMO of the lowest quintet state 
is also shown in the figure. Although this structure 
has reached a local extremum, it cannot be real¬ 
istically considered because it corresponds to an 
artifact of the procedure; since several of the al¬ 
pha-occupied orbitals are above several of the 
beta-unoccupied orbitals. The orbital analysis al¬ 
lows us to conclude that there is no possibility of 
having a tetrahedral geometry for the gold te- 
tramer due to the instability associated with the 
asymetric electronic occupation of degenerate or¬ 
bitals in the T d structure of Au 4 because there is a 
distortion pathway that will lower the electronic 
energy of the molecule yielding a removal of the 
orbital degeneracy. In this case, the distortion can 
reduce the T d symmetry to C 2v , D 4/j , or D 2d point 
groups. Although strictly associated initially with 
the instability of a degenerate electronic state [47], 
the above distortion is also known as a Jahn-Teller 
effect [48]. All orbitals presented in Figure 2 corre¬ 
sponds to contours of ±0.07/bohr 3/2 , i.e., con¬ 


tours of \% probability per cubic Bohr or in classi¬ 
cal terms, contours corresponding to a electronic 
density of 0.005 electrons per cubic Bohr. 
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ABSTRACT: The results of ab initio post-Hartree-Fock study of the molecular 
structures, relative stabilities, and Hartree-Fock level calculations of IR frequencies and 
intensities for mono- and dihydrated oxo-amino-tautomers of guanine are reported. The 
geometries of the local minima were optimized without symmetry restrictions at the 
second-order Moller-Plesset perturbation theory. The standard 6-31 G(d) basis set was 
used. Local minima were verified by calculations of the matrix of energy second 
derivatives [Hessian at the HF/6-31 G(d) level]. The single-point calculations were also 
performed at the MP4(SDQ)/6-31G(d)//MP2/6-31G(d) and MP2/6-31 + + 

G(d, p)//MP2/6-31G(d) levels of theory. The total energies were corrected for the 
zero-point energy contributions scaled by a factor of 0.9. The structural parameters of 
mono- and dihydrated complexes were analyzed and related to the characteristics of 
hydrogen bonds. We found that the interaction of guanine tautomers with two water 
molecules changes the order of the gas-phase stability: 7GUA ~ 9GUA into the order 
which corresponds to the stability of the guanine tautomers in the polar solvent: 
9GUA » 7GUA. The predicted IR HF/6-31GW) level frequencies and intensities are 
presented. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 759-765, 1997 
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Introduction 

T he structural characteristics of the isolated 
DNA and RNA bases and the prototypic 
molecules have been the subject of extensive spec¬ 
troscopic studies in recent years. Most of the struc¬ 
tural characteristics have been obtained using 
infrared spectroscopy and high-resolution mi¬ 
crowave studies. These investigations are always 
more relevant if they are performed in parallel 
with ab initio quantum chemical calculations, 
which, at the correlated level with sufficiently large 
basis sets, are able to reproduce such structural 
characteristics as geometry and IR frequencies with 
the same accuracy as those of experiments (see 
e.g., [1]). Examples of such spectroscopic investiga¬ 
tions coupled with ab initio quantum chemical 
calculations can be found in [2]. 

Currently, the focus of these combined investi¬ 
gations is strongly shifted toward the study of 
isolated intermolecular complexes with water 
molecules. It is so natural because the hydration of 
the DNA and RNA bases plays a special role in 
determining the three-dimensional structure of 
these kinds of biopolymers. The available litera¬ 
ture on this subject consists of investigations of 
only the pyrimidine type of DNA bases and their 
prototypic molecules [3], To fill the gap formed by 
a lack of investigations of hydrated puring bases, 
we initiated ab initio calculations of the two tau¬ 
tomers of guanine: the oxo-amino form of the 
N(9)H tautomer (9GUA) and the oxo-amino form 
of the N(7)H tautomer (7GUA), which are the 
lowest minima on the potential surface of isolated 
guanine tautomers [4]. 

Before discussing the objective of this article, 
justification for considering the mono- and dihy¬ 
dra ted species of guanine tautomers is in order. 
We are interested only in the cyclic forms of these 
complexes because of the existence of both experi¬ 
mental and theoretical data which support the idea 
that similar structures are the most important min¬ 
ima on the potential surface of the water-DNA 
base complexes: 

(i) According to the X-ray data of guanidine 
monohydrate [5], the water molecule is 
located in the vicinity of the carbonyl 
oxygen; 

(ii) according to the calculations of formamide 
and acetamide-water complexes [6,7], the 


> C=0 • • H 2 0 • • HN < bifurcated struc¬ 
ture is really the global minimum; and 

(iii) recently, it was concluded from the studies 
of rotationally resolved fluorescence excita¬ 
tion spectrum [3a, b] that the mono- and 
dihydrated complexes of pyridone have the 
same geometrical structure as that as¬ 
sumed for the 7(9)GUA • H 2 0 and the 
7(9)GUA -2H 2 0 species in this study. 

The purpose of this article was to provide accu¬ 
rate theoretical data for the studied species. The 
following properties of the mono- and dihydrated 
guanine species are predicted: (1) the geometry 
and rotational constants of the mono- and dihy¬ 
drated 7(9)GUA species; (2) the stability of these 
complexes and a comparison with the stability of 
isolated species; and (3) IR frequencies of mono- 
and dihydrated 7(9)GUA species and a compari¬ 
son with the spectra of isolated components. 


Computational Methods 

The ab initio LCAO-MO method [8] was used to 
study the interaction of guanine tautomers with 
water molecules. The calculations were carried out 
with the Gaussian-92 program [9]. The standard 
6-31GW) basis set was used. All the geometries 
were optimized without symmetry restrictions (C a 
symmetry was assumed) by the gradient proce¬ 
dure [10] initially at the HF level and subsequently 
at the second-order Moller-Plesset perturbation 
theory [11], Local minima were verified by estab¬ 
lishing that the matrix of energy second deriva¬ 
tives (Hessian) [at the HF/6-31G(d) level] has only 
positive eigenvalues. The single-point calculations 
were performed at the MP4(SDQ)/6- 
31G(d)//MP2/6-31G(d) and MP2/6-31 + + 
G(d, p)//MP2/6-31G(d) levels of theory. The total 
energies were corrected for the HF/6-31G(d) level 
with a zero-point energy contribution scaled by a 
factor of 0.9. 


Results 

The calculated geometrical parameters for 
mono- and dihydrated complexes of 7(9)GUA are 
presented in Figure 1. Tables I and II present the 
computed rotational constants along with selected 
intermolecular parameters for 7(9)GUA • H 2 0 and 
7(9)GUA • 2H 2 0. The data corresponding to the 
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A 



FIGURE 1 . Structural parameters of mono- and dihydrated oxo-amino-guanine complexes. (A) Up indexes: 
monohydrated oxo-amino-N(9)H guanine; down indexes: monohydrated oxo-amino-N(7)H guanine. (B) Up indexes: 
dihydrated oxo-amino-N(9)H guanine; down indexes: dihydrated oxo-amino-N(7)H guanine. 


relative stability of isolated and hydrated guanine 
species together with computed dipole moments 
are collected in Table III. The computed IR fre¬ 
quencies are given in Tables IV and V. 


Discussion 

GEOMETRY OF 7(9)GUA * H 2 0 AND 
7(9)GCA 2 H 2 0 SPECIES 

An analysis of the structural parameters in hy¬ 
drogen-bonded systems A—H • * • B, where A and 

TABLE I_ 

MP2/6-31G(d) rotational constant (A, B, C, GHz) 

and selected intermolecular parameters (A) for the 
monohydrated 7(9)-guanine complexes. 


9GUA•H 2 0 7GUA * H 2 0 


A 

1.47278 

1.46761 

B 

0.74504 

0.74764 

C 

0.49581 

0.49647 

RN 10 O 17 

2.822 

2.811 

R0 14 0 17 

2.818 

2.820 


B are heavy atoms which are involved in hydrogen 
bonding, is crucial for an understanding of the 
reasons for proton transfer in DNA bases. Accord¬ 
ing to the classification of nearly linear hydrogen 
bonds [12] (AHB more than 165°), the very low 
values of the proton-transfer barriers take place 
when the A—B distance is less than 2.5 A (A and 
B are oxygens and/or nitrogens atoms). In this 
case, the two heavy atoms are so close to each 

TABLE II_ 

MP2/6-31G(d) rotational constant (A, B, C, GHz) 

and selected intermolecular parameters (A) for the 
dihydrated 7(9)-guanine complexes. 



9GUA•H 2 0 

7GUA•H 2 0 

A 

1.23260 

1.22306 

B 

0.524809 

0.52490 

C 

0.37245 

0.37073 

RNi 0 0 17 

2.868 

2.845 

RO17O20 

2.719 

2.752 

RO 14 o 20 

2.794 

2.804 


The O — O distance in (H 2 0) 2 is 2.915 A at the MP2/ 
6-31 G(d) level of theory. 
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TABLE III_ 

Relative stability (kj mol ~ ^ and dipole moments (D) of guanine tautomers and their mono- and dihydrated 
species. 


Species 

MP4(SDQ)/ 

6-31G (of)// 

MP2/ 

6-31 G(d) 

MP2/ 

6-31 + +G(d)// 

MP2/ 

6-31 G(d) 

MP2/ 

G-31G(d)// 

MP2/ 

6-31 G(d) 


9GUA 

0.0 

0.0 

0.0 

7.0 

7GUA 

1.6 

1.8 

-0.2 

1.6 

9GUA•H 2 0 

0.0 

0.0 

0.0 

5.7 

7GUA * H 2 0 

-4.1 

0.5 

—2.3 

1.6 

9GUA *2H 2 0 

0.0 

0.0 

0.0 

4.9 

7GUA•2H 2 0 

6.8 

2.7 

4.8 

2.8 


Evaluated from MP2-optimized geometry using HF density. 


other that the zero-point energy level could appear 
even under the barrier. In the case of the A—B 
distance between 2.8 and 3.0 A, the value of the 
proton-transfer barrier is much higher. This obser¬ 
vation is the reference point for our analysis of the 
structural data (Fig. 1 and Tables I and II) of 
mono- and dihydrated tautomers of 9(7) guanine. 

According to the data of previous theoretical 
studies and experimental observations [3a, b], the 
influence of the interaction with one and/or two 
water molecules on the bond distances and bond 
angles of prototypic molecules manifests itself 
mainly in the region of intermolecular hydrogen 
bonding. Our data (see Fig. 1 and Tables I and II) 
completely confirm this point. All other structural 
parameters of the heterocyclic rings are vitually 
the same [except for the N 2 —C 6 "single" bond of 
9(7)GUA tautomers which is also under the influ¬ 
ence of hydrogen bonding] and depend only on 
the chemical structure of tautomers. 

The addition of the second water molecule 
makes a noticeable change in the geometrical 
structure of the hydrogen-bonded complexes. All 
of them become much more linear compared to 
the monohydrated complexes; the distance be¬ 
tween 0 14 and O 20 of dihydrated complexes is 
much shorter than the corresponding distance 
0 14 —0 17 of the monohydrated complexes. Also, 
the distance between 0 17 and O 20 is much shorter 
than the interatomic distance in a free water dimer 
(see data in Tables I and II). All the observed 
trends for the 9(7)GUA ■ H 2 0 and 9(7)GUA * 2H 2 0 
complexes are congruous with the available exper¬ 
imental and calculated [3a, b] data of the mono- 
and dihydrated complexes of pyridone. Thus, 


based on geometrical characteristics, one might 
expect that the participation of two water 
molecules in a proton-transfer reaction between 
the N 1 and 0 14 atoms of the guanine molecule 
could be more favorable than is the participation 
of only one water molecule. 

RELATIVE STABILITIES 

The relative stabilities of the isolated and sol¬ 
vated guanine tautomers have been predicted by 
different levels of theory up to an MP4(SDQ)/6- 
311 + +G(rf, p)//MP2/6-31G(d) approximation 
[13]. In Table III, the calculated stabilities of iso¬ 
lated tautomers are in complete agreement with 
the data in [13]. 

Our present discussion will concentrate on an 
evaluation of the relative stability of the hydrated 
complexes. For the isolated tautomers of guanine, 
the following stability pattern is established: 

7GUA ~ 9GUA. (1) 

This relationship (1) corresponds well with the 
available experimental data for the species in the 
gas phase [4]. According to the data of experimen¬ 
tal IR studies, 7GUA is probably the most stable 
tautomer. The tendency for the 7GUA form of 
guanine monohydrates to stabilize exists. Based on 
the data presented in Table III, the same stability 
pattern in obtained for complexes of guanine with 
one water molecule: 

7GUA • H 2 0 « 9GUA • H 2 0. (2) 

The described peculiarity of monohydrated com¬ 
plexes is changed for the dihydrated isolated corn- 
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TABLE IV_ 

Calculated IR frequences (FREQ, in cm - ^ 
and absolute intensities (INT, km mol " 1 ) for 
9-guanine species. 


9-Guanine 
FREQ INT 

9GUA- 

FREQ 

H 2 0 

INT 

9GUA- 

FREQ 

2H 2 0 

INT 

152 

6 

50 

6 

34 

4 

180 

2 

110 

18 

57 

1 

218 

10 

153 

2 

59 

4 

339 

17 

167 

1 

147 

18 

355 

93 

186 

6 

157 

4 

366 

7 

218 

25 

164 

1 

392 

1 

247 

90 

192 

4 

522 

7 

351 

6 

199 

100 

573 

43 

370 

38 

220 

12 

576 

109 

381 

63 

237 

4 

608 

293 

387 

73 

315 

129 

654 

109 

398 

59 

356 

9 

679 

8 

526 

57 

367 

11 

721 

10 

562 

256 

385 

1 

729 

30 

576 

196 

391 

7 

790 

60 

581 

57 

430 

27 

823 

13 

676 

256 

496 

349 

871 

90 

690 

196 

533 

9 

908 

15 

719 

2 

550 

91 

969 

4 

735 

31 

575 

168 

1033 

11 

771 

2 

584 

6 

1147 

13 

815 

48 

680 

176 

1162 

12 

865 

6 

700 

142 

1192 

22 

894 

202 

719 

6 

1251 

21 

909 

40 

736 

54 

1282 

59 

970 

5 

777 

9 

1425 

5 

1032 

13 

819 

22 

1457 

54 

1158 

15 

830 

353 

1478 

62 

1162 

7 

870 

63 

1535 

22 

1206 

10 

905 

26 

1577 

8 

1262 

10 

922 

180 

1687 

26 

1288 

70 

969 

5 

1736 

112 

1428 

11 

1032 

15 

1781 

464 

1466 

11 

1158 

17 

1792 

111 

1513 

189 

1162 

6 

1842 

536 

1536 

16 

1219 

7 

1999 

901 

1578 

9 

1266 

5 

3452 

4 

1702 

45 

1295 

71 

3814 

69 

1743 

76 

1430 

17 

3849 

76 

1787 

305 

1465 

11 

3909 

112 

1797 

148 

1505 

151 

3927 

56 

1840 

638 

1536 

26 



1854 

111 

1572 

16 



1959 

1001 

1704 

44 



3451 

5 

1745 

59 



3732 

305 

1786 

304 



3820 

90 

1804 

144 



3908 

114 

1838 

796 



3937 

61 

1856 

89 


TABLE IV. 
(Continued) 


9-Guanine 
FREQ INT 

9GUA- 

FREQ 

H 2 0 

INT 

9GUA* 

FREQ 

2H 2 0 

INT 


3968 

217 

1866 

99 


4144 

125 

1938 

1073 




3450 

5 




3761 

278 




3811 

168 




3887 

379 




3909 

125 




3930 

637 




3944 

71 




4132 

110 




4145 

135 


plexes which display the quite usual tendency to 
approach the stability of fully hydrated species: 

9GUA • 2H 2 0 > 7GUA • 2H 2 Q. 


IR FREQUENCIES 

The present IR frequencies are intended to guide 
experimental investigators in the study of the title 
species. As can be expected from the calculated 
data, it is clear that the major differences between 
the spectra of isolated species and complexes are 
the intermolecular vibration modes (50-300 cm" 1 ) 
and the valence sym and antisym OH and NH 
vibrations (3400-4100 cm -1 ). The predicted shifts 
of vibrational frequencies upon the complexation 
can be applied to an identification of these species 
in the isolated-matrix experiments. Although the 
HF/6-31G(i) level harmonic frequencies overesti¬ 
mate the experimentally observed unharmonic 
spectrum by approximately 10%, the differences 
between predicted and observed shifts are usually 
not larger than 10-20 cm -1 . 


Conclusion 

In this article, the stability, the harmonic IR 
frequencies, and the structural parameters at the 
post-Hartree-Fock level are revealed. The princi¬ 
pal conclusions from this study are the following: 

1. The structural parameters of mono- and di- 
hydrated oxo-amino-guanine tautomers cal¬ 
culated at the MP2/6-31G* level permits the 
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TABLE V_ 

Calculated IR frequences (FREQ, in cm" 1 ) and 
absolute intensities (INT, km mol }1 ) for 
7-guanine species. 


7-Guanine 
FREQ INT 

7GUA • 
FREQ 

H 2 0 

INT 

7GUA • 
FREQ 

2H 2 0 

INT 

160 

22 

49 

2 

33 

1 

169 

6 

111 

11 

57 

2 

213 

3 

158 

20 

59 

1 

324 

1 

174 

1 

147 

10 

344 

110 

192 

5 

160 

7 

366 

1 

251 

59 

167 

11 

406 

1 

287 

74 

180 

4 

509 

133 

333 

41 

202 

114 

523 

3 

352 

24 

219 

7 

586 

5 

387 

31 

235 

3 

644 

84 

394 

53 

317 

134 

669 

205 

422 

118 

345 

33 

685 

56 

451 

71 

365 

4 

702 

44 

544 

16 

385 

4 

733 

21 

583 

276 

399 

1 

814 

51 

608 

30 

433 

43 

824 

49 

665 

172 

484 

138 

869 

100 

680 

1 

508 

74 

907 

14 

694 

150 

526 

104 

1027 

4 

746 

25 

553 

294 

1041 

1 

778 

1 

594 

13 

1135 

19 

832 

21 

678 

197 

1203 

5 

848 

161 

691 

2 

1223 

40 

896 

87 

699 

104 

1259 

14 

908 

43 

741 

41 

1306 

85 

1027 

6 

798 

9 

1408 

25 

1050 

3 

811 

10 

1452 

19 

1143 

41 

836 

346 

1482 

83 

1194 

57 

869 

63 

1547 

233 

1229 

17 

905 

14 

1607 

64 

1259 

28 

925 

187 

1650 

3 

1336 

78 

1033 

3 

1710 

1 

1369 

65 

1038 

1 

1755 

304 

1461 

113 

1151 

20 

1804 

72 

1504 

186 

1211 

21 

1854 

602 

1551 

208 

1234 

16 

1970 

1003 

1591 

251 

1265 

13 

3450 

6 

1619 

84 

1319 

67 

3799 

56 

1673 

169 

1410 

45 

3851 

79 

1702 

60 

1469 

5 

3908 

49 

1787 

618 

1502 

192 

3913 

142 

1812 

128 

1547 

191 



1832 

156 

1608 

74 



1871 

508 

1661 

66 



3443 

7 

1720 

29 



3825 

51 

1756 

166 



3835 

608 

1811 

46 



3922 

135 

1849 

843 



3944 

46 

1855 

129 


TABLE V _ 
(Continued) 


7-Guanine 
FREQ INT 

7GUA • 
FREQ 

H 2 0 

INT 

7GUA ■ 
FREQ 

2H 2 0 

INT 


3987 

311 

1864 

67 


4144 

156 

1917 

1132 




3447 

7 




3734 

381 




3811 

116 




3893 

348 




3917 

131 




3934 

711 




3937 

10 




4134 

111 




4146 

136 


classification of the intermolecular hydrogen 
bonds as rather weak. 

2. It is found that the interaction of guanine 
tautomers with two water molecules changes 
the order of gas-phase stability (7GUA 9GUA) 
into the order which corresponds to the sta¬ 
bility of the guanine tautomers in a polar 
solvent (9GUA » 7GUA). 

3. The main difference in the calculated IR fre¬ 
quences of isolated, monohydrated, and di~ 
hydrated species are found in the region of 
valence sym and antisym vibrations of OH 
and NH bonds and in the region related to 
the intermolecular vibrations. 
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ABSTRACT: Molecular orbital calculations were carried out for the dimerization step 
in the polymerization process of vinyl acetate and styrene through free radicals and ionic 
mechanisms. The calculations were performed for monomers, dimers, their positive and 
negative ions, and free radicals. The minimum-energy geometry is achieved in all cases, 
the geometrical and electronic parameters are analyzed, a dimerization reaction pathway 
is proposed, and the heats of polymerization obtained are in excellent agreement with 
experimental data. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 767-785, 1997 


Introduction 

I n polymers, the sequence distribution, tacticity, 
and, in general, the intramolecular structure 
are important, because they may suply informa¬ 
tion about the monomer addition process, e.g., 
about the preference of monomers to add in the 
iso- or syndiotactic configuration [1,2]. Moreover, 
knowledge about the intramolecular structure is of 
paramount importance for the understanding of 
the relations between structure and polymer prop¬ 
erties [3] and there is significant interest in charac¬ 
terizing polymeric materials, especially blends at 
the molecular level. This interest is due, in part, to 
the tremendous technological importance of poly- 

International Journal of Quantum Chemistry, Vol. 65, 767-785 (1997) 
© 1997 John Wiley & Sons, Inc. 


mer blends. The bulk, macroscopic properties of 
such materials are determined by the microscopic 
structure that, in turn, is critically dependent upon 
the degree of molecular mixing between the blend 
components. The peculiar properties of polymer 
materials arise from differences between the ther¬ 
modynamic interactions in systems containing 
macromolecules and those systems of only small 
molecules. 

The nature of intermolecular interaction deter¬ 
mines the microscopic structure in any 
condensed-phase system. However, when the sys¬ 
tem contains macromolecules, there is additional 
complexity due the intramolecular considerations; 
the number of configurations available to a poly¬ 
mer chain, although restricted by covalent bond 
geometries, is vast. Thus, the morphology of a 
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chain involves the interplay between the interac¬ 
tions of the polymer with its environment and the 
possible chain conformations. Since the interac¬ 
tions between the chains of a polymer may play an 
important role in defining the ultimate properties 
of these polymers, it is desirable to investigate 
quantitatively the structure of isolated polymer 
chains. The geometry of the dimers may be impor¬ 
tant in defining the interactions between neighbor¬ 
ing chains of the polymer and in the monomer 
addition mechanism. To understand better the ge¬ 
ometry of these dimers, model compounds of 
poly(vinyl acetate) and polystyrene were investi¬ 
gated. 

In this work, we used vinyl acetate (VA) and 
styrene (ST) dimer bond dissociation energies to 
model the reverse reaction of a chain-growth poly¬ 
merization, predicting the heat of polymerization. 
Also, we investigated the evolution of the confor¬ 
mations and the binding process when a monomer 
approaches the active species, an ion or a free 
radical. In other words, we explored a reaction 
coordinate. 


We present results of semiempirical and ab ini¬ 
tio molecular orbital calculations for 2,4-diacetate 
butane as a model compound of poly(VA) and 
2,4-diphenylbutane as model substance of PS. All 
calculations were performed with the system of 
programs Gaussian94 [4]. Two levels of theory 
were used: semiempirical calculations with the 
Austin model 1 Hamiltonian (AMI) [5] and self- 
consistent-field calculations using unrestricted 
open-shell Hartree-Fock wave functions (UHF) [6] 
with two different Gaussian orbital basis sets: 3- 
21G* [7] and 6-31G* [8]. 

Using geometrical parameters obtained with ab 
initio methods for small systems, the geometry 
optimization of bigger molecules could be carried 
out using semiempirical methods like AMI with¬ 
out unduly high computer and memory require¬ 
ments. The AMI method embodies the approxima¬ 
tions that are inherent to the neglect of diatomic 
differential overlap (NDDO) formalism [9] and 
overcome the major weakness of the modified ne¬ 
glect of differential overlap (MNDO) method [10] 
on which it is based. It is known that AMI pro- 



FIGURE 1. Dimer of VA; structure optimized with the HF/6-31G* basis set. The numbering is used throughout 
the text. 
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duces fairly accurate equilibrium geometries, or¬ 
bital orderings, and partial charges [11]. 


Methodology 

The minimum-energy conformation for the 
monomers of VA and ST were obtained in a previ¬ 
ous work [12]. Taking as a basis those geometries, 
model compounds to represent dimers were 
formed; in each case, the initial geometry corre¬ 
sponds to an isotactic all -trans conformation. 

With those geometries, an optimization process 
was initiated. In the first step, full-optimization 
calculations were made for each dimer to find the 
global minimum. With the resulting geometries, 
rotational barriers were calculated, taking as rota¬ 
tion axes the skeletal bonds between heavy atoms. 
The torsion angles were changed with increments 
of 20° in each particular case. For each point, all 
the non-hydrogen geometrical parameters were 
varied simultaneously. 

To form the free radical of each dimer, one 
hydrogen atom was removed from the optimized 
VA dimer and ST dimer models, and by assigning 
a positive or negative charge, the corresponding 
cations or anion were formed. These processes 
were made with the optimization of the geometri¬ 
cal parameters for the atoms nearest to the site 
where the hydrogen atom was removed. In the 
next step, the bond between monomeric units was 
broken; it was done by changing the C 2 —C 3 (see 
Fig. 1) bond distance for the VA dimeric species 
and the C 4 —C 6 (see Fig. 6) bond distance for the 
ST dimeric species. This bond enlargement was 
made in steps of 0.2 A from the equilibrium dis¬ 
tances to 4.5 A. In this step, two types of calcula¬ 
tion were made: In the first, only the geometrical 
parameters of the atoms nearest to the bond-break¬ 
ing site were optimized, and in the second one, all 
the geometrical parameters were optimized. A 
schematic view of the process is 

(a) A—A A—A* + H’ 

(b) A—A' A + A* 

(c) A—A + -> A + A + (1) 

(d) A—A —> A + A , 

where A—A represents a dimer molecule formed 
by two monomers of A, A— A* is the free radical 


of the dimer, A—A + is a cationic dimer, and 
A—A“ is an anionic dimer. 

The minimum-energy conformation and total 
energy for the different monomeric (A, A, A + , 
A") and dimeric (A—A, A— A*, A—A + , A—A~) 
species were found by allowing all atoms to move. 
The bond dissociation energy can be calculated in 
two different ways: as the difference between the 
sum of the calculated total energies for the right- 
side species in its minimum-energy conformation 
and the total energy of the left-side compound also 
in its minimum energy conformation, and from the 
potential energy curves for the dissociation of VA 
and ST dimeric species [Eqs. (Ib)—(Id)], taking the 
difference between the energy of the optimized 
dimeric species (initial point in the curve) and the 
energy value where the curve becomes asymptotic. 


TABLE I_ 

Dimer of VA z-matrix input for the Gaussian-94 
program; the numbers on the atoms correspond 
to Figure 1. 

Dinner of VA 


H 


c 

1 

R2 





c 

2 

R3 

1 

A3 



H 

2 

R4 

1 

A4 

3 

D4 

C 

2 

R5 

1 

A5 

3 

D5 

H 

5 

R6 

2 

A6 

1 

D6 

0 

5 

R7 

2 

A7 

6 

D7 

H 

5 

R8 

2 

A8 

6 

D8 

H 

3 

R9 

2 

A9 

1 

D9 

C 

3 

R10 

2 

A10 

9 

D10 

0 

3 

R11 

2 

All 

9 

Dll 

C 

7 

R12 

5 

A12 

2 

D12 

H 

10 

R13 

3 

A13 

2 

D13 

H 

10 

R14 

3 

A14 

13 

D14 

H 

10 

R15 

3 

A15 

13 

D15 

C 

11 

R16 

3 

A16 

2 

D16 

0 

12 

R17 

7 

A17 

5 

D17 

C 

12 

R18 

7 

A18 

17 

D18 

0 

16 

R19 

11 

A19 

3 

D19 

C 

16 

R20 

11 

A20 

19 

D20 

H 

18 

R21 

12 

A21 

7 

D21 

H 

18 

R22 

12 

A22 

21 

D22 

H 

18 

R23 

12 

A23 

21 

D23 

H 

20 

R24 

16 

A24 

11 

D24 

H 

20 

R25 

16 

A25 

24 

D25 

H 

20 

R26 

16 

A26 

24 

D26 
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The heat of dissociation obtained in this way must 
be the negative of the polymerization heat in terms 
of monomer unit added. The minimum-energy ge¬ 
ometry and the total energy for each radical, cation, 
and anion and monomeric and dimeric species 
were found by full optimization. 


Results and Discussion 

VINYL ACETATE 

In previous studies of the VA monomer [12], we 
reported an energetically more favorable confor¬ 
mation. Using that geometry, the dimer of VA was 
formed. This model compound has the vinyl 
groups in a parallel position perpendicular to the 
skeletal sequence, and this conformation corre¬ 
sponds to an isotactic all-trans. The various atoms 
have been numbered (Fig. 1) and will hereafter be 
referred to by these numbers. With this initial 
geometry, a calculation changing simultaneously 


all the geometric parameters was made. With this 
new geometry and taking the bond C 2 —C 5 as the 
axis of rotation, the rotational barrier was calcu¬ 
lated from 0° to 360° with angle increments of 20°; 
0° corresponds to having the bond C 5 —C 7 cis to 
the bond C 2 —C 3 . In each point, the position of the 
acetate groups atoms was optimized. From this 
rotational barrier, the minimum-energy conforma¬ 
tion was taken as the initial one for the next 
rotational barrier calculation, this time through the 
bond C 2 —C 3 . Optimization of the geometrical pa¬ 
rameters for the atoms neighboring this bond was 
made. Again, with the new minimum-energy con¬ 
formation, a full-optimization calculation was car¬ 
ried out. Table I defines the notation we use for 
the internal coordinates of the VA dimer in the 
format of the z-matrix input for the program 
Gaussian 94. The UHF/6-31G*-optimized values 
of the intermolecular distances, Rj, the bond an¬ 
gles, A], and the dihedral angles, Dj, are given in 
Tables II, III, and IV, respectively. The first two 


TABLE II_ 

Internuclear distances for the different dimeric species of VA, calculated with the UHF/6-31G(d) method. 



Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free radical 

R2 

1.08105 

1.084 

1.0884 

1.0817 

1.0846 

R3 

1.52795 

1.5269 

1.5366 

1.5431 

1.5268 

R4 

1.0815 

1.0835 

1.0898 

1.0932 

1.0885 

R5 

1.52584 

1.5211 

1.4954 

1.473 

1.4959 

R6 

1.0837 

1.0857 

1.0761 

1.0816 

1.0783 

R7 

1.44419 

1.4118 

1.3928 

1.2428 

1.3696 

R8 

1.08037 

1.0826 




R9 

1.08208 

1.083 

1.0866 

1.0839 

1.0831 

RIO 

1.52993 

1.5237 

1.5296 

1.5237 

1.5228 

R11 

1.4502 

1.4202 

1.3786 

1.3976 

1.4196 

R12 

1.36113 

1.3365 

1.3295 

1.5175 

1.3455 


1.08371 

1.0844 

1.0804 

1.0823 

1.0846 

R14 

1.08271 

1.085 

1.0908 

1.0883 

1.0845 

R15 

1.08225 

1.0842 

1.0893 

1.0836 

1.084 

R16 

1.36081 

1.3354 

1.486 

1.3618 

1.3353 

R17 

1.1968 

1.1826 

1.1876 

1.1522 

1.1803 

R18 

1.51217 

1.5117 

1.511 

1.4847 

1.5092 

R19 

1.19753 

1.1832 

1.2647 

1.1745 

1.1832 

R20 

1.51332 

1.5131 

1.5217 

1.5086 

1.513 

R21 

1.07769 

1.079 

1.0801 

1.0794 

1.0792 

R22 

1.08359 

1.0841 

1.084 

1.0834 

1.0848 

R23 

1.08314 

1.0844 

1.0828 

1.0834 

1.0834 

R24 

1.08328 

1.0836 

1.0885 

1.0835 

1.0835 

R25 

1.08254 

1.0847 

1.0948 

1.0851 

1.0848 

R26 

1.07822 

1.079 

1.0875 

1.0787 

1.0791 
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TABLE III_ 

Bond angles for the different dimeric species of VA, calculated with the UHF/6-31G(cO method. 



Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free radical 

A3 

109.20119 

109.3892 

109.5052 

111.3468 

109.4869 

A4 

107.68760 

107.0329 

106.1593 

107.4943 

106.5460 

A5 

109.67980 

109.5263 

110.0410 

110.3933 

109.3951 

A6 

110.86996 

110.6728 

120.5230 

122.9585 

119.9846 

A7 

105.08688 

106.4235 

117.4401 

120.6877 

112.5518 

A8 

111.52025 

111.2892 




A9 

110.00961 

108.8651 

108.8560 

109.0228 

108.8957 

A10 

113.59662 

114.0690 

112.3991 

111.9307 

113.2548 

All 

104.15747 

105.1302 

107.0583 

102.8399 

105.3686 

A12 

124.38530 

123.3404 

122.2633 

119.8762 

123.1970 

A13 

110.80390 

110.8378 

107.6032 

110.7637 

111.0958 

A14 

111.23519 

111.3142 

112.5814 

111.4145 

110.9288 

A15 

109.16586 

110.2305 

110.2342 

110.3348 

109.9922 

A16 

125.74818 

125.2239 

114.661 

125.4188 

125.1943 

A17 

119.24141 

119.1289 

119.7086 

116.5776 

118.7749 

A18 

116.88147 

118.1824 

117.5256 

107.7608 

117.6845 

A19 

119.29680 

119.0126 

116.6171 

117.5297 

119.0181 

A20 

117.31459 

118.8165 

104.436 

118.3378 

118.7979 

A21 

107.87529 

107.9146 

108.1808 

108.3815 

107.9812 

A22 

111.03209 

111.4270 

110.5046 

109.4000 

110.6888 

A23 

110.86750 

111.0104 

111.1130 

109.3901 

111.2064 

A24 

111.64095 

112.0892 

112.1481 

112.2392 

112.1051 

A25 

110.65525 

110.7329 

111.3839 

110.7128 

110.7063 

A26 

107.61335 

107.5731 

109.1456 

107.5398 

107.5774 


column data correspond to the initial (AMI-opti¬ 
mized) and UHF/6-31G*-optimized values for the 
VA dimer, respectively. The last three data 
columns give the UHF/6-31G*-optimized parame¬ 
ters values for the anion, cation, and free-radical 
dimeric species. By adopting the convention given 
by Flory and coworkers [13], the final geometry 
corresponds to a tg conformation. Figure 1 shows 
the VA dimer final conformation. 

With the minimum-energy conformation for the 
dimer obtained in this way, the bond distance 
c 5 -h 8 was changed by increments of 0.2 A from 
the equilibrium value to 5 A. This bond enlarge¬ 
ment is equivalent to the dimer dissociation in one 
hydrogen and one dimer free radical. This dissoci¬ 
ation process was made with optimization of the 
geometrical parameters for the atoms nearest to 
the bond which is being broken. In Table III it is 
possible appreciate how the bond angles C 2 — 
C 5 —0 7 and C 2 —C 5 —H 6 change from almost te¬ 
trahedral in the initial conformation to almost trig¬ 


onal when the system is dissociated, and in Table 
V, how the electronic densities for the atoms 
neighboring C 5 are substantially changed. With 
the dimer free-radical geometry obtained this way 
and the program parameter charge made equal to 
zero, one, or minus one, the geometries and heats 
of formation were obtained for the free-radical, 
cation, and anion dimeric species, respectively. 

For each one of these three species, the C 2 —C 3 
bond distance was changed from the equilibrium 
value to 4.2 A with increments of 0.3 A. Figure 2 
shows the UHF/3-21G* potential energy curves 
obtained for the dissociation of the dimeric species. 
The dissociation products are one monomer and 
one ionic or free-radical monomeric species. 
Around the distance of 2.0 A, the energy is a 
maximum, which means that the dimeric species 
are dissociated and the C 2 —C 3 bond is almost 
broken. However, between 2.0 and 4 A, some 
attractive interactions occur principally between 
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TABLE IV_ 

Dihedral angles for the different dimeric species of VA, calculated with the UHF/6-31G(d) method. 


Atom 

Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free radical 

D4 

-118.03805 

-117.3563 

-116.4926 

-118.5230 

-117.2242 

D5 

124.37077 

125.3241 

125.3115 

128.0308 

124.1003 

D6 

-177.00407 

-176.0425 

-167.6353 

-165.8375 

-166.8072 

D7 

-119.10359 

-119.5823 

-141.3371 

-177.6376 

-141.3146 

D8 

121.47613 

120.4413 




D9 

-179.21453 

-177.4649 

-181.4689 

-176.8852 

-180.4869 

DIO 

-124.41414 

-122.9400 

-119.3468 

-122.5343 

-122.6618 

Dll 

118.08135 

117.7678 

119.0362 

118.2071 

117.8713 

D12 

-178.27760 

-177.9501 

-95.9448 

-182.6225 

-167.3026 

D13 

-175.48810 

-174.8035 

-178.4848 

-176.9629 

-177.8408 

D14 

-120.13487 

-119.7525 

-120.2503 

-119.3246 

-120.1987 

D15 

119.16572 

119.4663 

118.6905 

119.5042 

119.5933 

D16 

-151.51809 

-146.3294 

-145.8997 

-150.9679 

-147.3915 

D17 

-178.93995 

-178.8368 

-182.3227 

-359.1194 

-175.9514 

D18 

-179.80448 

-179.8345 

-180.9907 

-180.0441 

-179.6381 

D19 

-166.59531 

-171.7614 

-28.5634 

-171.9048 

-171.9182 

D20 

-178.31833 

-178.4059 

-134.4832 

-178.1465 

-178.3473 

D21 

-179.42262 

-178.4141 

-179.1716 

-179.9199 

-186.8656 

D22 

119.81004 

120.029 

120.3031 

120.6593 

119.2167 

D23 

-119.65612 

-119.6049 

-120.6860 

-120.6502 

-120.7299 

D24 

-62.28269 

-58.5701 

-51.6634 

-58.0384 

-58.2904 

D25 

120.90497 

120.7807 

120.8520 

121.2944 

120.7116 

D26 

-119.74459 

-120.1251 

-120.4455 

-119.9548 

-120.1626 


O z and the hydrogens H a and H 4 . In Tables II-TV 
are the geometrical parameters obtained for the 
dimer anion, cation, and free radical of the dissoci¬ 
ated VA dimer. Notice the geometry adopted by 
the monomeric species. 

Figures 3 and 4 show the evolution of the Mul- 
liken and electrostatic potential net atomic charges 
calculated for the skeletal atoms as a function of 
the C 2 —C 3 bond distance for the cation of the 
dimer when calculated with the UHF/3-21G(d) 
method. It is necessary to point out the sign and 
magnitude of the charges on the C 3 and C 5 atoms. 
In the final state, C 3 has a positive net atomic 
charge because when the C 2 —C 3 bond is broken a 
monomeric cation with an electron-deficient car¬ 
bon atom is formed. The cationic character is on 
C 3 . On the other hand, the other dissociation prod¬ 
uct is a monomeric unit with a double bond be¬ 
tween C 2 and C 5 . This double bond is evident 
from the net atomic charges on the C 2 and C 5 
atoms which have Mulliken atomic charge nega¬ 


tive values. Figure 4 shows how the atomic charge 
values calculated from the electrostatic potential 
have almost the same behavior in respect to the 
C 2 —C 3 bond distance. The geometrical parame¬ 
ters for the dimeric species, neutral, anionic, 
cationic, and free radical, of the VA are shown in 
Tables II-IV. Notice the monomer geometry which 
is planar and the nonplanar geometry for the 
monomeric species. Figure 5 shows for the free- 
radical dimer the geometry adopted by the dissoci¬ 
ation products. 

The process in the opposite direction must be a 
dimerization process, which may be interpreted as 
the propagation step in a polymerization mecha¬ 
nism. The heats of polymerization calculated for 
VA from the potential energy curves are —18.627 
kcal/mol for the anionic mechanism, -31.778 
kcal/mol for the cationic mechanism, and -23.022 
kcal/mol for the free-radical mechanism; the ex¬ 
perimental value from the literature [14] is -21.0 
kcal/mol. 
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TABLE V_ 

Total atomic charges from Mulliken population analysis for the dimeric species of VA, 
calculated with the UHF/6-31G(c/) method. 


Atom 

VA 

VA anion 

VA cation 

VA free radical 

1 H 

0.205172 

0.147594 

0.287148 

0.287148 

2 C 

-0.332017 

-0.357533 

-0.476120 

-0.476120 

3 H 

0.144219 

0.245764 

0.154746 

0.154746 

4 C 

0.207229 

0.181474 

0.290899 

0.290899 

5 H 

-0.002987 

0.120137 

0.400659 

0.400659 

6 C 

0.159126 

0.173557 

0.318052 

0.318052 

7 H 

-0.609809 

-0.597995 

-0.514816 

-0.514816 

8 C 

0.167823 




9 H 

0.167434 

0.146838 

0.193760 

0.193760 

10 H 

-0.498400 

-0.492737 

-0.511565 

-0.511565 

11 C 

-0.623438 

-0.735056 

-0.634492 

- 0.634492 

12 C 

0.774221 

0.770519 

0.751987 

0.751987 

13 C 

0.176504 

0.218369 

0.223286 

0.223286 

14 H 

0.161408 

0.090427 

0.154930 

0.154930 

15 H 

0.192680 

0.113955 

0.215910 

0.215910 

16 C 

0.776163 

0.509799 

0.769270 

0.769270 

17 C 

-0.526482 

-0.561445 

-0.399745 

-0.399745 

18 C 

-0.621054 

-0.610418 

-0.579832 

-0.579832 

19 H 

-0.528501 

-0.790477 

-0.478013 

-0.478013 

20 H 

-0.624681 

- 0.492259 

-0.629457 

-0.629457 

21 H 

0.229435 

0.196462 

0.276367 

0.276367 

22 H 

0.192066 

0.196666 

0.262972 

0.262972 

23 H 

0.195782 

0.215863 

0.263654 

0.263654 

24 H 

0.190545 

0.100517 

0.197866 

0.197866 

25 H 

0.197787 

0.099525 

0.202732 

0.202732 

26 H 

0.229773 

0.110453 

0.259804 

0.259804 


STYRENE 

Table VI defines the notation used for the inter¬ 
nal coordinates of the ST dimer in the format of 
the z-matrix. The optimized values of the internu- 
clear distances, Rj, the bond angles. A], and the 
dihedral angles, Dj, are given in Tables VII, VIII, 
and IX, respectively. In the first and second 
columns are, respectively, the AMI- and UHF/6- 
31G*-optimized parameter values for the dimer of 
styrene. The last three data columns give the 
UHF/3-21G*-optimized parameters for the anion, 
cation, and free radical of the ST dimer. The atomic 
numbering is presented in Figure 6. 

The curves of the C 4 —C 6 bond distance vs. the 
UHF/3-21G* energy for the dissociation of the 
three species of styrene are shown in Figure 7. 
There, the three curves present a maximum be¬ 
tween 2.1 and 2.4 A, and every one has a different 
form. The curve corresponding to the ST cation 
shows asymptotic behavior from 2.5 A, which 


means that the C 4 —C 6 bond is broken, forming a 
styrene monomer with a double bond between the 
C 6 and the C n atoms and an ST monomer cation. 
Figure 8 shows the dissociation products. The 
cationic character of the dimer can be appreciated 
in Figure 9 by the net atomic charge on the carbon 
atoms 4 and 11 before and after the dimer dissocia¬ 
tion. The free-radical curve reaches its asymptoti¬ 
cal behavior at approximately 2.8 A. The dissocia¬ 
tion products of the dimer anion are a monomeric 
anion and a monomer. The net atomic charge on 
C n is from —0.5 at the equilibrium distance value 
to around -0.2 for distances greater than 2.8 A. 
On the other hand, the net atomic charge on C 4 is 
from almost zero to -0.55. This fact confirms that 
in this case, again, the dissociation is undertaken 
given the above-mentioned products. 

When the C 4 —C 6 bond is breaking, there are 
changes in the bond and dihedral angles of the 
atoms C 8 and H 7 which are bonded to C 4 . Figure 
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1.5 2.5 3.5 

C2-C3 bond Distance(angstroms) 

FIGURE 2. Potential energy curve for the VA dimeric species, calculated with the UHF/3-21G* model chemistry. The 
zero energy corresponds to 606.0 au. 



FIGURE 3. Change of the Mulliken net atomic charges for the main-chain atoms during the dissociation of the VA 
cation of the dimer, calculated at the UHF/3-21G* level of theory. 
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FIGURE 4. Change of the electrostatic potential net atomic charges for the main-chain atoms during the dissociation 
of the VA cation of the dimer, calculated at the UHF/3-21G* level of theory. 



FIGURE 5. Dissociation products of the VA free radical of the dimer, obtained at the UHF / 3-21G* level of theory. 
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TABLE VI_ 

ST z-matrix input for the Gaussian-94 program; 
the numbering of the atoms correspond to 
Figure 6. 


H 


c 

1 

R2 






H 

2 

R3 

1 

A3 




C 

2 

R4 

1 

A4 

3 

D4 

0 

H 

2 

R5 

1 

A5 

3 

D5 

0 

C 

4 

R6 

2 

A6 

1 

D6 

0 

H 

4 

R7 

2 

A7 

6 

D7 

0 

C 

4 

R8 

2 

A8 

6 

D8 

0 

H 

6 

R9 

4 

A9 

2 

D9 

0 

H 

6 

RIO 

4 

A10 

9 

DIO 

0 

C 

6 

R11 

4 

All 

9 

Dll 

0 

C 

8 

R12 

4 

A12 

2 

D12 

0 

C 

8 

R13 

4 

A13 

12 

D13 

0 

H 

11 

R14 

6 

A14 

4 

D14 

0 

H 

11 

R15 

6 

A15 

14 

D15 

0 

C 

11 

R16 

6 

A16 

14 

D16 

0 

C 

13 

417 

8 

A17 

4 

El 7 

0 

C 

12 

R18 

8 

A18 

4 

D18 

0 

H 

12 

R19 

8 

A19 

18 

D19 

0 

H 

13 

R20 

8 

A20 

17 

D20 

0 

C 

17 

R21 

13 

A21 

8 

D21 

0 

H 

17 

R22 

13 

A22 

21 

D22 

0 

H 

18 

R23 

12 

A23 

8 

D23 

0 

C 

16 

R24 

11 

A24 

6 

D24 

0 

C 

16 

R25 

11 

A25 

24 

D25 

0 

H 

21 

R26 

17 

A26 

13 

E26 

0 

C 

25 

R27 

16 

A27 

11 

D27 

0 

C 

24 

R28 

16 

A28 

11 

D28 

0 

H 

24 

R29 

16 

A29 

28 

D29 

0 

H 

25 

R30 

16 

A30 

27 

D30 

0 

C 

27 

R31 

25 

A31 

16 

D31 

0 

H 

27 

R32 

25 

A32 

31 

D32 

0 

H 

28 

R33 

24 

A33 

16 

D33 

0 

H 

31 

R34 

27 

A34 

25 

D34 

0 


10 shows how the bond angles of the C 6/ C g/ H 7 , 
H 10/ and C n atoms change from almost tetrahe¬ 
dral to trigonal values. The variation of the bond 
angle defined by C 2 —C 4 —C 6 is represented by 
the lowest curve in Figure 10. This curve and the 
curves in Figure 11 provide information about the 
trajectory followed by the monomer when it is 
moving away from the other fragment. Such 
changes are because on C 4 and C 6 a hybridization 
change is occurring from SP 3 to SP 2 . Figure 12 
shows the change of the total overlap population 
on the bonds of the main chain as a function of the 


bond distance. There it is possible to appreciate a 
change from a single to a double bond between the 
C 6 and C n atoms. Also, between C 4 and C 6 , there 
is a change from a characteristic value of a single 
bond to a zero total overlap population value 
when this bond has been broken. The change in 
the C 6 bond angle from 112 A to almost 90 A 
shows the trajectory followed by the monomer 
cation when it is approaching the molecular 
monomer. Figure 8 presents the dissociation prod¬ 
ucts. 

The energy difference between products and 
reagents [see Eq. (1)] could be a measure of the 
polymerization heat. From the dissociation curves, 
the calculated values for ST are —16.31, —19.45, 
and -21.34 kcal/mol for the anionic, cationic, and 
free-radical mechanism, respectively. The values 
calculated for the cationic species are in excellent 
agreement with the experimental value which is 
— 16.7 kcal/mol [15,16]. 


Conclusions 

For VA and ST, we optimized the geometry of 
monomers and dimers in their neutral, cationic, 
anionic, and free-radical forms. The heats of poly¬ 
merization obtained from the dissociation process 
are in good agreement with the values reported in 
the literature. With the methods used in this work, 
it is possible to appreciate the rearrangement of 
the electronic density. In all the cases where a 
bond rupture or formation (ionization and dimer¬ 
ization) occurred, changes in the hybridization of 
the atoms directly involved in such processes were 
observed. 

From the energy differences for the process of 
dimerization, it is possible to see that for the VA 
the anionic mechanism seems to be the most feasi¬ 
ble one, followed by the free radicals. The more- 
favored mechanism for ST is the anionic followed 
by the cationic. This order of polymerization is in 
excellent agreement with the Schildknecht classifi¬ 
cation [17]. 

The results enable us to postulate that heats of 
polymerization can be reasonably estimated by ab 
initio quantum mechanical calculations. Many pos¬ 
sibilities may be suggested for the reaction path¬ 
way followed by the monomer in their approach to 
the other species, but only one path is possible in 
the dissociation process. The path which we are 
proposing is the dimerization path for ST and VA. 
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TABLE VII_ 

Internuclear distances for the different dimeric species of styrene, calculated with the UHF/6-31G(d) and 
UHF/3-21G* methods. 



Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free radical 

R2 

1.08313 

1.0845 

1.087351 

1.085147 

1.082979 

R3 

1.08376 

1.0859 

1.084398 

1.082008 

1.084316 

R4 

1.54506 

1.5365 

1.548771 

1.541442 

1.541876 

R5 

1.08437 

1.0847 

1.083480 

1.081592 

1.084130 

R6 

1.54822 

1.5415 

1.565072 

1.592307 

1.560874 

R7 

1.0857 

1.0875 

1.087424 

1.082980 

1.083802 

R8 

1.524 

1.523 

1.512671 

1.517996 

1.521031 

R9 

1.08331 

1.086 

1.083338 

1.079065 

1.083519 

RIO 

1.08414 

1.0871 

1.083540 

1.081076 

1.084238 

R11 

1.5496 

1.5372 

1.510959 

1.476950 

1.509654 

R12 

1.39065 

1.3935 

1.465396 

1.399079 

1.399970 

R13 

1.38684 

1.3893 

1.384125 

1.395863 

1.397356 

R14 

1.08352 

1.0846 




R15 

1.08539 

1.0871 

1.076574 

1.076165 

1.074842 

R16 

1.51699 

1.6146 

1.409728 

1.383034 

1.410167 

R17 

1.38483 

1.3872 

1.389970 

1.393763 

1.393798 

R18 

1.38284 

1.3837 

1.394176 

1.391641 

1.391450 

R19 

1.07168 

1.0757 

1.074704 

1.073076 

1.071672 

R20 

1.07365 

1.0764 

1.082014 

1.073428 

1.073298 

R21 

1.38267 

1.3831 

1.456959 

1.391121 

1.392209 

R22 

1.07227 

1.0758 

1.074224 

1.071714 

1.072019 

R23 

1.07202 

1.0758 

1.080686 

1.071540 

1.072622 

R24 

1.38826 

1.3903 

1.428397 

1.421720 

1.427760 

R25 

1.38713 

1.3908 

1.428047 

1.421996 

1.427277 

R26 

1.07112 

1.0754 

1.075916 

1.070472 

1.072221 

R27 

1.38324 

1.3851 

1.390244 

1.364232 

1.389586 

R28 

1.38389 

1.3856 

1.388108 

1.367788 

1.387982 

R29 

1.07367 

1.0768 

1.073511 

1.073147 

1.073661 

R30 

1.07377 

1.0768 

1.070176 

1.070342 

1.071485 

R31 

1.38479 

1.3855 

1.402132 

1.396969 

1.401893 

R32 

1.07259 

1.0757 

1.073199 

1.069944 

1.072030 

R33 

1.072 

1.0758 

1.073939 

1.069775 

1.072197 

R34 

1.0713 

1.0754 

1.072277 

1.070921 

1.072131 
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TABLE VIII_ 

Bond angles for the different dimeric species of ST, calculated with the UHF/6-31G(d) and UHF/3-21G* 
methods. 


Initial Optimized 

parameters parameters Anion 


Cation Free radical 


A3 

108.33837 

107.7667 

108.787 

108.644 

110.504 

A4 

111.21367 

111.4655 

111.976 

111.722 

108.427 

A5 

108.10089 

107.6871 

108.791 

107.786 

111.214 

A6 

112.17282 

122.9059 

109.928 

111.643 

108.076 

A7 

108.11570 

107.2820 

107.699 

108.915 

111.382 

A8 

110.49859 

111.2918 

112.138 

112.551 

108.608 

A9 

108.76368 

108.5197 

107.480 

108.076 

108.428 

A10 

109.62103 

109.8119 

107.938 

107.689 

112.622 

All 

113.64087 

114.3750 

114.024 

109.519 

121.499 

A12 

121.19954 

121.5492 

120.837 

121.427 

120.421 

A13 

120.48590 

120.4949 

121.337 

119.796 

117.560 

A14 

110.12286 

110.1620 




A15 

109.14581 

109.1545 

117.966 

116.240 

124.834 

A16 

110.65706 

112.3793 

125.044 

127.225 

121.026 

A17 

121.04501 

121.2240 

123.675 

120.617 

120.847 

A18 

120.83009 

121.0439 

118.444 

120.390 

119.901 

A19 

119.93723 

119.9504 

120.941 

120.520 

119.400 

A20 

119.46750 

119.4909 

118.015 

119.900 

120.091 

A21 

120.13621 

120.1610 

118.221 

120.0460 

119.780 

A22 

119.76853 

119.7094 

120.924 

119.755 

119.777 

A23 

119.76843 

119.6806 

118.159 

119.723 

119.896 

A24 

120.63336 

120.8650 

120.366 

118.895 

123.000 

A25 

120.70228 

120.8867 

122.670 

122.381 

120.277 

A26 

120.32113 

120.3923 

120.420 

120.212 

120.995 

A27 

120.81487 

120.9931 

120.957 

120.179 

121.364 

A28 

120.75128 

121.0002 

121.503 

120.585 

118.672 

A29 

119.53357 

119.5145 

118.575 

119.108 

119.625 

A30 

119.45174 

119.5118 

119.215 

119.852 

120.672 

A31 

120.12089 

120.1800 

120.856 

119.721 

119.577 

A32 

119.86709 

119.7680 

119.449 

120.517 

119.801 

A33 

119.82719 

119.7669 

119.860 

120.665 

120.237 

A34 

120.18628 

120.2854 

120.313 

119.193 



778 


VOL 65, NO. 5 




DIMERIZATION PROCESS OF VINYL MONOMERS 


TABLE IX_ 

Dihedral angles for the different dimeric species of ST, calculated with the UHF/6-31G(cf) and UHF/3-21G* 
methods. 



Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free radical 

D4 

-121.79377 

-122.5815 

121.771 

-123.139 

-121.644 

D5 

117.06081 

115.9720 

117.265 

116.908 

117.271 

D6 

62.09327 

62.6393 

56.961 

65.152 

60.288 

D7 

-118.90094 

-117.9309 

-116.470 

-118.285 

-117.901 

D8 

123.61427 

125.5284 

123.991 

121.429 

123.633 

D9 

172.42559 

174.3642 

174.229 

175.667 

173.099 

DIO 

-116.94578 

-115.6387 

-114.803 

117.712 

-115.854 

Dll 

120.83665 

121.3542 

123.273 

122.755 

123.011 

D12 

-62.04618 

-61.8644 

-60.204 

-54.350 

-60.001 

D13 

-179.74842 

-179.8569 

-179.021 

-179.694 

-179.184 

D14 

61.72438 

62.4989 




D15 

-117.86062 

-116.7008 

94.894 

88.511 

94.818 

D16 

121.17856 

121.7599 

-177.612 

-177.834 

-177.681 

D17 

179.79928 

179.8637 

179.136 

179.804 

179.304 

D18 

-179.77884 

-179.8371 

-179.140 

-179.670 

-179.275 

D19 

179.91523 

179.9692 

-179.789 

-179.533 

-179.949 

D20 

-179.99439 

-180.0385 

179.836 

179.180 

179.839 

D21 

-0.05688 

-0.016 

-0.092 

-0.140 

-0.113 

D22 

179.95525 

180.0192 

179.779 

179.726 

179.984 

D23 

179.93643 

179.9625 

-179.989 

-179.749 

179.972 

D24 

-90.34242 

-91.1110 

179.493 ( 

-182.503 

178.103 

D25 

177.99809 

178.6660 

179.775 

179.912 

179.842 

D26 

179.98703 

180.0002 

-179.949 

179.912 

179.978 

D27 

-178.02109 

-178.556 

-179.508 

179.595 

-179.866 

D28 

178.02595 

178.5297 

179.477 

179.552 

179.862 

D29 

179.68947 

179.7631 

-179.970 ( 

179.579 

-179.918 

D30 

-179.85802 

-179.7743 

-179.371 

-179.715 

179.758 

D31 

-0.00875 

-0.0313 

-0.058 

0.928 

-0.020 

D32 

-179.84312 

-179.7941 

-179.720 

179.527 

179.913 

D33 

179.84029 

179.8574 

-179.915 

-179.395 

179.961 

D34 

-179.89313 

-179.8424 

179.985 

179.380 

-179.936 



FIGURE 6. Dimer of ST; optimized structure with the HF/6-31G* method, showing the numbering used throughout 
the text. 
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FIGURE 9. Change of the Mulliken net atomic charges for some relevant carbon atoms of the ST cation of the dimer 
during the dissociation. 
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FIGURE 12. Bond distance vs. half of the total overlap population calculated for the ST cation of the dimer. 
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ABSTRACT: The reaction mechanism and relative stabilities of the intermediates and 
transition states in the reaction of 0 2 plus ethylene molecule using ab initio molecular 
orbital (MO) theories at several levels of theory with the correction of the nondynamic 
and dynamic electron correlation effects were systematically investigated. Full geometry 
optimizations of the corresponding biradical (BR) intermediates, perepoxide (PE) and 
1,2-dioxetane (DO) were performed by complete-active-space CASSCF{2,2}/6-31G* 
method with nondynamic electron correlation effect and Moller-Plesset MP2/6-31G* 
method with dynamic electron correlation effect. For the 1,4-biradical intermediates, new 
gauche-type 1 BR state was found at both CASSCF and MP2 levels, corresponding to the 
transition state of the rotation motion of the 0 2 moiety. It was found from the intrinsic- 
reaction-coordinate (IRC) study that another gauche-type a BR transition state connects 
smoothly to the reactant system 1 0 2 + H 2 C = CH 2 and the gauche minimum X BR state, 
showing that the reaction through the 1,4-biradical intermediates initially proceeds 
through the gauche transition state to form the gauche minimum : BR state, following 
that the free rotation of 0 2 moiety occurs due to the energy barrier less than 4.0 
kcal/mol. The stability of the perepoxide is surprisingly sensitive to the levels of the 
theory and the basis sets employed. The coupled-cluster methods, CCSD and CCSD(T), 
gave the reasonable stabilities of 1,4-biradical intermediates, perepoxide, and dioxetane 
as a reaction product. From the results of the CCSD and CCSD(T) methods, the reaction 
of l 0 2 + H 2 C = CH 2 proceeds by a two-step mechanism through the 1,4-biradical 
intermediates rather than through the perepoxide. © 1997 John Wiley & Sons, Inc. Int J 
Quant Chem 65: 787-801, 1997 
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YOSHIOKA ET AL. 


T he additions of the singlet molecular oxygen 
to various types of the olefin derivatives have 
been extensively studied over 30 years IX 2]. A 
large number of accumulated data suggested three 
types of the reaction modes: (I) 1,2-addition to give 

1,2-dioxetane; (II) ene reaction to give carbonyl 
fragments and allylic hydroperoxides; (III) 1,4-ad¬ 
dition to give endperoxides. For these reaction 
modes, the following five mechanisms are sug¬ 
gested as shown schematically in Figure 1: (i) 
concerted mechanism which proceeds without 
well-defined intermediates; (ii) mechanism through 
a homopolar biradical (BR) intermediates; (iii) 
mechanism through a zwitterionic (ZW) interme¬ 
diates; (iv) mechanism through a one-electron 
transfer (ET) biradical intermediates; (v) mecha¬ 
nism through a perepoxide (PE) intermediates [3, 
4]. The experimental studies on the reaction mech¬ 
anisms [3, 4] have clarified that energy differences 
between these reaction mechanisms are not so large 
even in a gas phase and highly dependent on the 
chemical conditions such as solvent polarity em¬ 
ployed. Several reaction mechanism are, then, op¬ 
erative in each reaction mode. 

On the other hand, theoretical studies [5, 6] 
showed that the reacting complexes between 
molecular oxygen and various olefins are regarded 
as typical quasi-degenerate systems in energy. This 
indicates that the reaction complexes are very sen- 


+ 






FIGURE 1. Typical five reaction mechanisms of singlet 
oxygen molecule and ethylene derivatives. 


sitive for introducing substituents and/or chang¬ 
ing reaction conditions such as solvents, and inter¬ 
conversion of reaction mechanisms easily occurs. 
This also suggests the possibility that the relative 
stabilities of electronic states on the reaction path¬ 
way are highly sensitive to the level of the theory 
with and without correlation effects and basis sets 
employed. 

liarding and Goddard supported the biradical 
mechanism such as CH 2 CH 2 00 with extensive 
calculations at the GVB-CI level with a double-^ 
plus polarization (DZP) basis set [7], The perepox¬ 
ide were located as a minimum with higher en¬ 
ergy than biradical minimum by about 8 kcal/mol. 
On the basis of the MINDO/3 level of theory, 
Dewar et al. studied the reaction of 0 2 with vari¬ 
ous substituted alkenes [8]. They found, in the case 
of ethene itself, that the reaction proceeds through 
a polar perepoxide, which in turn rearranges to 

1,2-dioxetane in a rate-determining step. 

Hotokka et al. have performed the calculations 
of reaction intermediates and transition states 
optimized with constraints of the symmetry and 
geometry at the level of CASSCF and CCI, 
concluding that the formation of the perepoxide 
intermediate is easier than passing through a C s 
syn-biradical pathway [9]. Robb and co-workers 
suggested from the calculations at the level of 
CASSCF{6,6}/4-31G that the 1,2-dioxetane-forming 
reaction is described as a two-step reaction which 
proceeds through a gauche biradical intermediate 
[10]. It is, however, noteworthy that these theoreti¬ 
cal results may not be conclusive because the 
CASSCF do not involve the correction of the dy¬ 
namic electron correlation. In fact, the heat of the 

1,2-dioxetane formation is too small (12.8 [9] and 
4.0 [10b] kcal/mol) compared with the value from 
thermochemical estimation (30 kcal/mol). These 
theoretical results, therefore, show that the reac¬ 
tion mechanism is still not conclusive for the reac¬ 
tion mode as well as the experimental data. 

In this article, we systematically investigate the 
reaction mechanism and relative stabilities of the 
intermediates and transition states in the reaction 
of ethylene molecule plus 0 2 using ab initio 
molecular orbital (MO) theories at several calcula- 
tional levels with the correction of the nondynamic 
and dynamic electron correlation effects [11, 12]. 
Full geometry optimizations of the corresponding 
BR intermediates, PE, and 1,2-dioxetane (DO) are 
performed by CASSCF{2,2}/6-31G* level with non¬ 
dynamic electron correlation effect and Hartree- 
Fock [HF; restricted (R) or unrestricted (U)] 
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MP2/6-31G* level with dynamic electron correla¬ 
tion effect. The relative stabilities of the reaction 
intermediates and transition states are estimated 
again by the approximately spin-projected but 
size-consistent UHF scheme (APUHF) [13], the ap¬ 
proximate spin projection scheme (APUMP2 and 
APUMP4) [6], and the RHF or UHF coupled-clus¬ 
ter [(R or U)CCSD and (R or U)CCSD(T)] [6d, 14] 
methods, CASMP2 [15] method, and CASPT2 [12] 
method at the optimized geometries using the 
same 6-31G* basis set. The reaction mechanisms 
through 1,4-biradical intermediates and perepox- 
ide will be discussed and suggested. The program 
package Gaussian 94 [15] was employed to evalu¬ 
ate the above. 


Calculational Procedures 

Calculations for the singlet biradical intermedi¬ 
ates were carried out at UHF, UHF natural orbital 
(UNO) CASSCF [16, 17] and UMP2 [18] levels of 
theory with 6-31G* basis set. As described in the 
previous works [19] the RHF is not suitable for 
estimating the electronic structures of the species 
in the singlet biradical states due to the triplet 
instability [13]. The singly occupied MOs (SOMO) 
with radical characteristics are, then, induced by 
mixing highest occupied (HO) and lowest unoccu¬ 
pied (LU) MOs of the RHF solution, resulting in 
the different orbitals for different spins (DODS) 
MO picture by the singlet UHF solution. The sin¬ 
glet DODS solution constructed is not a pure 
eigenfunction of the spin angular momentum due 
to the contamination from triplet excited wave 
function. The removement of the contamination 
result in the approximately spin-projected UHF 
(APUHF) [13]. The correlation correction by the 
APUHF method for the case of tricentric biradical 
such as 0 3 , CH 2 NHCH 2 , CH 2 00, 0CH 2 0, and 
CH 2 CH 2 0 gives the simple relation as follows 
[19a]: 


tUNO CASCI{2 / 2} — *rAPUHF — 9uNO CASSCF{2,2} • 


( 1 ) 


Similarly, UMP2 is not also a pure spin state with 
the contamination from higher excited states. The 
approximate spin projection to the singlet state 
gives the APUMP2 wave function. As described in 
a previous work [19b], the APUMP2 method is 
simply related by the CASMP2 method from the 
behavior of potential energy curve for the rotation 


of the ethylene molecule, as follows: 


V> 


UNO CASMP2{2,2} 


= \ 


APUMP2 


+ A, 


( 2 ) 


where A is term corresponding to the orbital relax¬ 
ation energy. 

The geometry optimizations for the singlet bi¬ 
radical species were carried out by two different 
procedures: (i) the natural orbitals are constructed 
by DODS MOs obtained, following that the UNO 
CASSCF{2,2} calculations were performed in an 
active space of two SOMOs and two biradical 
electrons to include the nondynamic electron cor¬ 
relation effect and (ii) UMP2 calculations were 
performed by using DODS MOs of the UHF solu¬ 
tion to include the dynamic electron correlation 
effect [11]. The usual RHF picture was used for 
CASSCF{2,2) and MP2 (RMP2) to calculate the 
singlet closed-shell species such as perepoxide, 
1,2-dioxetane, and so on. 

All geometrical parameters of the reaction inter¬ 
mediates and transition states were fully opti¬ 
mized by the above two methods without con¬ 
straints of the symmetry and geometry. For the 
stationary point, the vibrational frequency analy¬ 
ses were carried out to clarify the intermediate, 
transition state (TS) or higher-order saddle point 
on the potential energy surface. The relative stabil¬ 
ities of the reaction intermediates and transition 
states are estimated again by the APUHF, 
APUMP2APUMP4, UCCSD, UCCSD(T), CASMP2, 
and CASPT2 methods at the optimized geometries. 


Results and Discussion 

REACTANTS 

The geometry optimizations of 0 2 (A ) at the 
CASSCF{2,2}/6-31G* level gives the O—O bond 
distances of 1.210 A, being in quite good agree¬ 
ment with the experimental value of 1.216 A [20a]. 
The O—O bond distance of CASSCF{2,2}/6-31G* 
is smaller by 0.048 A than 1.258 A of CASSCF/4- 
31G by Robb et al. [10b]. For the triplet 0 2 ( 3 S g ), 
the O—O bond distances were estimated to be 
1.168 and 1.247 A by CASSCF{2,2} q and UMP2, 
respectively, being smaller by 0.040 A and longer 
by 0.039 A than the experimental value of 1.208 A. 
The ] A ? state at CASSCF{2,2} level is higher in 
energy by 0.62 eV (14.2 kcal/mol) than the 3 
state, being quite smaller than the experimental 
value of 1.07 eV. 
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In the ethylene molecule, the RMP2/6-31G* 
method gives the optimized geometrical parame¬ 
ters of 1.336 A for C—C bond distance, 1.085 A for 
C—H, and 121.7° for C—C—H bond angle, being 
in quite reasonable agreement with the experimen¬ 
tal values [20b] of 1.330 A, 1.076 A, and 121.7°, 
respectively. 

REACTION PATHWAY THROUGH 

1,4-BIRADICAL INTERMEDIATES 

Optimized 1,4-Biradical Intermediates 

The (2 s + 2s)-type attacks of singlet molecular 
oxygen Co 2 ) to ethylene molecule have the possi¬ 
bility to provide intermediates with 1,4-biradical 
orbitals on terminal carbon and oxygen atoms in 
the s-cis type of the coplanar geometry. Four types 
of the singlet BR intermediates are expected by the 
rotation of the terminal methylene group and/or 
oxygen atom, that is, era'-, Vtt-, l ir<r- / and Vtt-BR 
intermediates. The C—O bond cleavage of 1,2-di- 
oxetane (DO) leads the Va-BR intermediate with 
a-type radical orbital on terminal carbon atom and 
cr-type radical orbital on another terminal oxygen 
atom. The Vcr-BR intermediate has higher energy 
by 12.9 keal/mol than Vtt-BR at APUHF/4-31G 
level, and the Vcr-BR state is also more unstable 
by 13.3 keal/mol [21]. Though the V77-BR state is 
comparable in energy with the Vtt-BR state [21], 
the terminal methylene group is rotated by 90° in 
the Vtt-BR state. This means that the reaction to 
1,2-dioxetane proceeds through the energy barrier 
due to the rotation of the terminal methylene 
group. In this study, we investigated the Vtt-BR 
states as the reaction intermediates to the 1,2-di¬ 
oxetane. Although only s-cis conformation is noted, 
the s-trans and gauche conformations are equally 
possible for 1,4-peroxy biradicals. Full geometry 
optimizations of the corresponding s-cis conforma¬ 
tion, the s-trans and gauche conformations were 
carried out at the levels of CASSCF{2,2}/6-31G* 
and UMP2/6-31G*. 

Four optimized structures with the geometries 
of s-cis, two gauche's, and s-trans types in the Vtt 
BR state were found by UMP2/6-31G* and 
CASSCF{2,2}/6-31G* methods. Their conforma¬ 
tions with optimized geometrical parameters are 
shown in Figure 2. Their relative stabilities to the 
gauche-type 1 BR(G1) state are summarized in Table 
I. The corresponding values by the C ASSCF/4-31G 
method by Robb et al. [10b] are also given for 
comparison. 


As can be seen from Table I, the most stable 
structures of the singlet BR state is the gauche-type 
BR(G1), which was optimized by both UMP2/6- 
31G* and CASSCF{2,2}/6-31G* methods. This 
state is a real local minimum (stationary point) on 
the potential energy surface with all positive 
eigenvalues of Hessian matrix ( v min = 65 cm -1 by 
UMP2 method), being in good agreement with the 
result of Robb et al. [10b]. The bond distances of 
C—O and O—O given by UMP2 are slightly larger 
than those by CASSCF{2,2}, while the C—C bond 
distance of UMP2 is slightly shorter than that of 
CASSCF{2,2). These tendencies are observed in an¬ 
other three optimized structures. For the bond 
distances of O—O and C—0,1.310 and 1.472 A of 
UMP2 and 1.301 and 1.432 A of CASSCF{2,2} using 
the 6-31G* basis set are quite shorter than 1.376 
and 1.530 A of CASSCF{6,6}/4-31G by Robb et al. 
[10b]. These differences may be due to the inclu¬ 
sion of the polarization functions and the use of 
unrestricted natural orbital (UNO) for the CASSCF 
method. The twisted angles (82.8° and 87.1°) of 
O—O—C —C by both UMP2 and CASSCF{2,2} 
are slightly larger than 79.6° of Robb et al. [10b]. 

Next, the stationary point is the s-trans Vtt-BR 
state with higher energy by only 0.6 and 0.3 
keal/mol than the most stable gauche-type BR(G1) 
state by UMP2/6-31G* and CASSCF{2,2}/6-31G* 
methods, respectively. This state possesses the 
lowest vibrational frequencies of 21 cm" 1 at 
UMP2/6-31G* level, indicating that the Vtt-BR 
stationary point is really a shallow minimum on 
the potential energy surface. These results are also 
in quite good agreement with the results of Robb 
et al. [10b]. The bond distances of O—O, C—O, 
and C—C are quite similar to the values of the 
most stable gauche-type BR(G1) state at both 
UMP2/6-31G* and CASSCF{2,2}/6-31G* levels. 
The O—O and C—O bond distances are also 
shorter by 0.071-0.095 A compared with those of 
Robb et al. [10b]. 

Each of UMP2/6-31G* and CASSCF{2,2}/6~ 
31G* methods gives two transition states, the sad¬ 
dle point on the potential energy surface, with one 
negative eigenvalue of Hessian matrix; one is a 
s-cis Vtt-BR state and another is a gauche ! BR(G2) 
state. The s-cis Vtt-BR state has higher energy by 
2.5 and 3.5 keal/mol than those of the most stable 
gauche-type BR(G1) state at UMP2/6-31G* and 
CASSCF{2,2}/6-31G* levels, respectively, com¬ 
pared with 2.0 keal/mol of Robb et al. [10b]. Robb 
et al. found three optimized geometries with the 
structures of s-cis-type 1,4-biradical conforma- 
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FIGURE 2. Optimized geometrical parameters of the singlet biradical states obtained by UMP2/6-31G* method and 
UNO CASSCF{2,2}/6-31G* method in parenthesis. 


tions; one is obtained by both 4-31G and STO-3G 
basis sets using CASSCF method; others are given 
by only STO-3G basis set. Two structures, which 
are the second-order saddle points on the 113 and 
114 energy surface (Fig. 5 in Ref. [10b]), have the 
C—O bond distances of 1.842 A (1.874 A by 4-31G) 
and 1.759 A given by STO-3G basis set employed, 
being quite different from our results of 1.478 A 
(UMP2) and 1.491 A (CASSCF). Robb et al. [10b] 
also showed that the third structure with the C—O 
bond distance of 1.538 A by STO-3G is a local 
minimum on the 113 energy surface. This structure 
corresponds to our structure. Their conclusion is, 
however, in conflict with our result that the s-cis 


cr 7 r~BR state is the saddle point on the potential 
energy surface, namely the transition state. The 

s-cis (J7T-BR and gauche-type BR(G2) states have 
really imaginary frequencies of 125/ and 44/ cm -1 
at UMP2/6-31G* level, respectively. 

It should be also noted that the gauche-type 
BR(G2) state obtained here is far from the C x 
gauche first-order saddle point given by Robb et 
al. [10b] in the molecular geometry and relative 
stability to the gauche-type 1 BR(G1) state. The 
C—O distances are 1.472 and 1.432 A and the 
relative energies to most stable gauche-type BR(G1) 
state are estimated to be 0.7 kcal/mol by both 
UMP2/6-31G* and CASSCF{2,2}/6-31G*, while 
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TABLE I 


Relative stabilities (kcal / mol) of optimized structures of reactants, reaction intermediates, 
transition states, and products. 


States \ Methods 

CASSCF{2,2} 

MP2 

CASSCF a 

0 2 ( 1 A g ) + H 2 C = CH 2 

o 2 (%) + h 2 c = ch 2 

-18.1 


-22.8 (-17.7) 

-32.3 

-30.1 


V^BR (s-Cis) 

3.5 

2.5 

(2.0) 

(gauchel) 

o.o b 

0.0° 

0.0 (0.0) 

(gauche2) 

0.7 

0.7 

0.6 (0.4) 

WBR (s-trans) 

0.3 

0.6 

^R (gauche TS) 


19.9 

6.7 (11.1) 

WBR (s-trans SP2) d 

18.7 e 

21.0 

9.6 (12.1) 

Perepoxide (PE) 

3.2 

-0.6 

-39.0 (32.3) 

PE (SP2) d 

44.4 

12.9 

-25.4 (45.4) 

Dioxetane COO) 

-51.8* 

-51.5 

-26.8 (-41.8) h 

’do (TS) 

— 51.8 s 

-50.8 


2H 2 C = O 

— 84.5' 

-105.0 



a CASSCF/4-31G and (CASSCF/ STO-3G) in Ref. [10b] of Robb et al. 
b Total energy is -227.598231 a.u. 
c Total energy is -228.168207 a.u. 
d SP2 means a second-order saddle point. 

e CASSCF{2,2} value at the geometry optimized by CASSCF{4,4}. 
f Total energy is -227.680865 a.u. 

9 Total energy is -227.680805 a.u. 
h C 2v symmetry not C 2 . 

'The formaldehyde dimer with C-C distance of 10 A was calculated. 


Robb et al. [10b] gave the C—O distance of 1.850 
A and the relative energy of 6.7 kcal/mol by 4-31G 
basis set. This shows that the present gauche-type 
BR(G2) state is apparently different from the tran¬ 
sition state leading to 1,2-dioxetane suggested by 
Robb et al. [10b], even if the gauche-type BR(G2) 
state has single negative eigenvalue of the Hessian 
matrix. 

The potential energy curves of the singlet BR 
states for C—O rotation at UMP2 and CASSCF 
levels are depicted in Figure 3. Four singlet BR 
states obtained here are smoothly connected, indi¬ 
cating that the present gauche-type BR(G2) state is 
apparently the transition state of the rotation mo¬ 
tion of 1,4-biradical states. The rotational energy 
barriers are within 3.5 kcal/mol, showing that the 
free rotation easily occurs in the singlet state. The 
carbon-carbon bond distances of four structures 

o 

are fallen into small regions from 1.478 to 1.482 A, 
being larger than 1.336 A of ethylene molecule at 
RMP/6-31G* level. This indicates that the radical 
characteristics on a terminal carbon atom with 
cleavage of it bonding orbital is kept throughout 
the rotation motion. Similarly, the oxygen-oxygen 
bond distances of 1.306, 1.310, 1.311, and JL311 A 
are longer than 1.208 A (X 3 2“) and 1.216 A (a 1 A ? ) 


of oxygen molecule estimated by the experiment, 
reflecting the biradical characteristics on terminal 
oxygen atom. The C—O distances are between 
1.472 and 1.495 A for the rotation motion. The 
UMP2 method gives longer distances than the 
CASSCF{2,2} method due to inclusion of dynamic 
electron correlation effects except for C—C dis- 
tance of s-trans o-7t-BR state. 

Relative Stabilities of 1,4-Biradical 

Intermediates at Various Calculational Levels 

Table II summarizes the relative stabilities of 
four 1,4-biradical states estimated by various lev¬ 
els of theory based on the geometries optimized by 
UNO CASSCF{2,2} and UMP2 method using 6- 
31G* basis set. The UMP4 with higher-order corre¬ 
lation correction indicates that the most stable 1,4- 
biradical state is the gauche-type BR(G1) state at 
the geometries optimized by both CASSCF{2,2} 
and UMP2 methods. The relative stabilities of four 
states gives the same ordering to those of UMP2 
levels, indicating that BR(G1) and cr7r-BR (s-trans) 
states are the local minimum and BR(G2) and 
\ 777 -BR (s-cis) are the first-order saddle points on 
the potential energy surface. 
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FIGURE 3. Potential energy curves of the singlet biradical states for the rotation ofC — C — 0 — 0 estimated by 
UMP2/6-31G* and UNO CASSCF{2,2} /6-31G* methods. 


The approximately spin-projected (AP)UHF and 
(AP)UMPn levels are constructed by removing the 
contamination of the triplet exited wave function. 
The APUHF gives the same ordering of the rela¬ 
tive stabilities at four biradical states with the 


geometries optimized by CASSCF{2,2} and UMP2 
methods. As is shown in Eq. (1), and presented in 
the previous work [19], the APUHF gave the corre¬ 
lation correction nearly equal to those of the 
CASCI{2,2} and CASSCF{2,2} in the case of tri- 


TABLE II_ 

Relative stabilities (kcal / mol) of four 1,4-biradical intermediates estimated by various levels of theory 
based on the geometries optimized by CASSCF{2,2} and UMP2 using 6-31G* basis set. 


States Geom \ 
Methods 

WBR (s-cis) 

(G1) a 

(G2) a 

o-7T BR (s-trans) 

CASSCF 

UMP2 

CASSCF 

UMP2 b 

CASSCF 

UMP2 

CASSCF 

UMP2 

UHF 

2.11 

3.08 

-0.81 

0.0 

-0.18 

0.36 

-0.97 

-0.08 

APUHF 

2.64 

3.73 

-0.56 

0.0 

0.12 

0.85 

-0.30 

0.68 

CASSCF 

2.50 

3.62 

-1.00 

0.0 

-0.29 

0.74 

-0.73 

0.33 

UMP2 

3.53 

2.53 

1.07 

0.0 

1.67 

0.67 

-1.61 

0.64 

APUMP2 

4.38 

3.52 

1.26 

0.0 

1.90 

1.31 

2.53 

1.68 

UMP4 

3.99 

2.59 

1.44 

0.0 

2.18 

0.72 

2.00 

0.65 

APUMP4 

4.88 

3.61 

1.58 

0.0 

2.41 

1.40 

2.98 

1.76 

CCSD 

4.07 

2.86 

1.27 

0.0 

2.08 

0.70 

1.65 

0.50 

CCSD(T) 

4.31 

2.78 

1.53 

0.0 

2.36 

0.75 

2.02 

0.58 

CASMP2 

3.97 

3.16 

1.08 

0.0 

1.75 

1.06 

2.01 

1.20 

CASPT2 

10.34 

9.48 

-0.33 

0.0 

1.17 

4.93 

8.48 

7.59 


a Gauche geometry, G1 is a local minimum and G2 is a transition state. 

b Total energies in units of a.u. are -227.606549 for UHF, -227.606588 for APUHF, -227.596633 for CASSCF, -228.168207 for 
UMP2, -228.168429 for APUMP2, -228.224839 for UMP4, -228.225177 for APUMP4, -228.212413 for CCSD, -228.229669 
for CCSD(T), -228.175792 for CASMP2, and -228.193027 for CASPT2. 
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centric biradical intermediates such as 0 3 , 
CH 2 NHCH 2/ CH 2 00, 0CH 2 0, and CH 2 CH 2 0. It 
seems that these relationships are also supported 
in the case of tetracentric biradical species. 

However, the APUMPn (n = 2,4) gives the re¬ 
versed stabilities that the gauche-type BR(G2) state 
is more stable than the s-trans 0 - 77 -BR state, con- 

i 

flicting with above results that the s-trans o-7t-BR 
state is the local minimum and the gauche-type 
BR(G2) state is the first-order saddle point on the 
potential energy surface. These results show that 
the approximately spin-projected (AP) procedure 
is, in some cases, not effective for describing the 
energy differences of the energetically quasi-de¬ 
generate chemical system. This may be due to fact 
that the APUMPn methods include the small con¬ 
tribution of the higher excited states than triplet 
excited state. The APUMPn methods should be 
carefully used to describe the chemical systems 
and intermediates with strong biradical character 
as well as APUHF. 

Two coupled-cluster CCSD and CCSD(T) meth¬ 
ods with dynamic electron correlation show that 
the relative stabilities of four 1,4-biradical state 
give the same ordering to those optimized by 
CASSCF{2,2} and UMP2 methods. The energy gaps 
between four BR states fall into within a few 
kcal/mol, giving the reasonable chemical pictures. 

However, in the CASMP2 method, the charac¬ 
teristics of four states such as minimum and tran¬ 
sition state on the potential energy surface are lost 
as well as the APUMP2 and APUMP4 methods, 
indicating that the inclusion treatment of the dy¬ 
namic electron correlation based on the CASSCF 
wave function is not effective for 1,4-biradical 
species. The behaviors of the CASMP2 and 
APUMP2 methods support the relationship given 
in Eq. (2). Though the CASPT2 method gives larger 
energy gaps between four 1,4-biradical states than 
the CASMP2, the CASPT2 has similar tendencies 
on the characteristics of the potential energy sur¬ 
face to the CASMP2. 

From Table II, following conclusions are de¬ 
scribed for the chemical systems which are quasi¬ 
degenerate in energy and have the strong biradical 
character. The dynamic electron correlation is es¬ 
sentially necessary to elucidate the electronic 
structures. The inclusion of the dynamic electron 
correlation should be done carefully and ade¬ 
quately. The geometry optimization should be 
done to represent the energetics in quasi-degener¬ 
ate chemical system and the electronic structures 


with the biradical character even if the high-qual¬ 
ity method is employed. 

Transition States with 

1,4-Biradical Characteristics 

Figure 4 shows optimized geometry of the C 1 
gauche transition state with just one imaginary 
frequency of 1191/ cm-1 at UMP2/6-31G* level. 
These structures are corresponding to the transi¬ 
tion state given by the CASSCF calculation of 
Robb et al. [10b]. As is indicated in Table I, the 
relative energies of 19.9 kcal/mol at UMP2 level is 
larger than 6.7 kcal/mol of Robb et al. [10b]. The C 

—C and C—O bond distances of 1.373 and 1.873 A 

0 

are shorter by only 0.028 A and longer by only 
0.023 A of Robb et al. [10b], being similar tenden¬ 
cies in the case of the 1,4-biradical states. 

Robb et al. [10b] indicated the qualitative de¬ 
scription of the potential energy surfaces (Fig. 5 in 
Ref. [10b]). They described that the gauche TS is 
connected to the gauche 1,4-biradical minimum 
(Gl). They also suggested that the avoided cross¬ 
ing corresponding to the closure of the gauche 
1,4-biradical minimum to 1,2-dioxetane exists and 
the transition state is located far from the gauche 
TS obtained. However, they did not discuss the 
path from the gauche TS to the 1,2-dioxetane 
molecule concretely. 

The intrinsic-reaction-coordinate (IRC) calcula¬ 
tion was performed to elucidate the characteristics 
of the gauche transition state. Figure 5 shows the 



^FKGTS) 

FIGURE 4. Optimized geometrical parameters of the 
C 1 gauche transition state obtained by UMP2/6-31G* 
method. 
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-1.5 -1 -0.5 0 0.5 1 1.5 2 

Reaction Coordinate 


FIGURE 5. Energy variation from the gauche transition state along the intrinsic reaction coordinate using UMP2/ 
6-31G* method. 


energy variation from the gauche transition state 
along the intrinsic reaction coordinate. The energy 
profile smoothly connects to both directions of two 
different states. The geometries on the terminal 
points of the reverse and forward directions of 


intrinsic reaction coordinate are depicted in Figure 
6. It is found from Fig. 6(a) that the geometry 
obtained from the reverse direction is almost equal 
to that of the gauche 1 BR minimum shown in 
Figure 2, indicating that the gauche transition state 




(A) Reverse direction (B) Forward direction 

FIGURE 6. Geometrical parameters of the states obtained by the intrinsic reaction coordinate calculation from the C. 
gauche transition state using the UMP2/6-31G* method: (a) reverse direction, (b) forward direction. 
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1 

is smoothly connected to the gauche BR state. 
Figure 6(b) represents the geometry obtained from 
the forward direction. The C—O distance is 2.710 

o 

A, being quite longer as a chemical bonding. The 
C—C bond distance in the ethylene moiety is 1.310 
A, being nearly equal to 1.336 A of the ethylene 
molecule optimized by RMP2/6-31G* method. 
Apparently, the C 1 gauche transition state has a 
reaction path to connect with the dissociation limit 

of ’o 2 + h 2 c = ch 2 . 

The reaction of 1 O z + H 2 C — CH 2 through the 
1,4-biradical intermediates initially proceeds 
through the C 1 gauche transition state to form the 
gauche X BR minimum state, following that the free 
rotation of 0 2 moiety along the C—O bond axis 
occurs due to the energy barrier less than 4.0 
kcal/mol. The free rotation brings about the possi¬ 
bility to form the 1,2-dioxetane molecule. 

The transition state corresponding to the gauche 
X BR(G TS) state was not found at the UNO 
CASSCF{2,2} level. The occupation numbers of the 
UNOs are 1.998, 1.874, 1.052, 0.948, 0.126, and 
0.002 for HOMO - 2, HOMO - 1, HOMO, LUMO, 
LUMO + 1, LUMO + 2, respectively, showing that 
the active space in CASSCF should be composed 
of four orbitals and four electrons at least. How¬ 
ever, at the UNO CASSCF{4,4} level, the transition 
state corresponding to the 1 BR(G TS) state was not 
also found. This indicates that the wide active 
space with more orbitals and more electrons may 
be necessary [19]. On the contrary, the occupation 
numbers of the gauche minimum X BR state are 
given by 1.998, 1.996, 1.063, 0.937, 0.004, and 0.002 
at above same ordering, showing that two radical 
orbitals are concentrated to HOMO and LUMO. 
Similarly, for the s-cis cr7r-BR state, the occupation 
numbers of HOMO - 1, HOMO, LUMO, and 
LUMO -F 1 are 1.997, 1.0, 1.0, and 0.003, even the 
transition state of the rotation motion. In the previ¬ 
ous work of tricentric bisdesmiphiles such as 0 3 , 
CH 2 OCH 2 , CH 2 ONH, and HNOO [19a], it was 
shown that the CASSCF{2,2} method, in which the 
occupation numbers of only HOMO and LUMO 
suggest the radical orbitals, gives the qualitatively 
and quantitatively correct description of the sin¬ 
glet 7777 -radical states. Therefore, the active space 
of two orbitals and two electrons are reasonable to 
represent the 1,4-biradical states except for the 
case of the gauche ! BR transition state. 

Another optimum state with the s-trans geome¬ 
try was found as shown in Figure 7. This structure 
corresponds to the C s anti-first-order saddle point 
of Robb et al. [10b]. Our structures obtained by 


1.231 



H a -C-C-0: 86.5(85.6) 

Hp-C-C-0:104.6(104.9) 

FIGURE 7. Optimized geometrical parameters of the 
s-trans-type second-order saddle point obtained by 
UMP2 / 6-31G* method and UNO CASSCF{4,4} / 6-31G* 
method in parenthesis. 

UMP2/6-31G* are, however, a second-order sad¬ 
dle point on the potential energy surface with two 
imaginary frequencies of 1267/ and 37/ cm" 1 . The 
total energy is decreased with rotation of the oxy¬ 
gen molecule along the C—O bond axis, approach¬ 
ing to the C x gauche transition state of the first- 
order saddle point. 

The geometrical parameters of CASSCF shown 
in Figure 7 is obtained by the active space of four 
orbitals and four electrons, since the optimized 
geometry was not found at the CASSCF{2,2) level. 
The relative energy in Table I was estimated by 
the CASSCF{2,2} method at the geometry opti¬ 
mized by the CASSCF{4,4} method. The occupa¬ 
tion numbers of the s-trans 1 BR(SP2) state are 
given by 1.997, 1.832, 1.0, 1.0, 0.168, and 0.003 at 
same ordering to the above, while the occupation 
numbers of the s-trans minimum X BR state are 
1.997, 1.0, 1.0, and 0.003 for HOMO - 1, HOMO, 
LUMO, and LUMO + 1, respectively. 

REACTION PATHWAY THROUGH 

PEREPOXIDE INTERMEDIATES 

As shown in Figure 8, two optimized structures 
of the perepoxide type were found at both 
RMP2/6-31G* and CASSCF{2,2}/6-31G* levels. 
One (a) is a true minimum on the potential energy 
surface, another (b) is a second-order saddle point. 
These behaviors are very similar to those of 
CASSCF/4-31G and STO-3G by Robb et al. [10b]. 
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O 



o 



(C-C)-O-O : 111.7 
(112.7) 


(C-C)-O-O: 118.7 
(114.9) 


(A) (B) 

FIGURE 8. Optimized geometrical parameters of (a) perepoxide and (b) perepoxide-like (second-order saddle point) 
obtained by UMP2/6-31G* method and CASSCF{2,2} /6-31G* method in parenthesis. 


However, the geometrical parameters obtained are 
quite different from each other even at the CASSCF 
levels. For the minimum C s perepoxide, the 
CASSCF{2,2} gives the shorter bond distances; for 
example, 1.433 A of C—O and 1.532 A of O—O 
for 1.559 and 1.634 A of Robb et al. [10b], respec¬ 
tively. The RMP2 method, which gives a longer 
C—O bond distance and shorter O—O bond dis¬ 
tance than those of the CASSCF method, has also 
similar tendencies. On the other hand, for the 
second-order saddle point, the MP2 method gives 
the bond distances of 1.748 and 1.288 A of C—O 
and O—O, CASSCF/4-31G gives 1.821 and 1.467 
A, respectively. 

As can be seen from Table I, the relative stabili¬ 
ties of the perepoxide based on the gauche mini¬ 
mum l BR state are surprisingly conflicting each 
other at the calculational levels and basis sets 
employed. The minimum basis set by CASSCF 
gives positive relative energy, the double-zeta ba¬ 
sis set gives negative one, and the inclusion of the 
d-polarization switches the sign back to positive, 
giving the relative energy nearly equal to the 
gauche minimum 2 BR state. By the correction of 
the dynamic electron correlation, the stability of 
the perepoxide falls into the compatibility with the 
gauche minimum X BR state. 


The perepoxidic intermediates have been stud¬ 
ied by semiempirical and ab initio molecular or¬ 
bital works. The CNDO/2 study [22] suggested 
that the perepoxide was a quasi-intermediate with 
the shallow minimum on the potential energy sur¬ 
face. It is also suggested by the MINDO/3 study 
[8] that the reaction of 1 0 2 + H 2 C = CH 2 pro¬ 
ceeds through the perepoxide rather than the 1,4- 
biradical intermediates. The MNDO study [10b] 
indicated that the perepoxide is a local minimum 
on the potential energy surface with the higher 
energy by 32.6 kcal/mol than the dissociation limit 
of ! 0 2 + H 2 C — CH 2 . On the other hand, the 
extensive calculations at the GVB-CI level with the 
DZP basis set by Harding and Goddard [7] sup¬ 
ported the biradical mechanism from the result 
that the perepoxide was located as a minimum 
with higher energy than biradical minimum by 
about 8 kcal/mol. The ab initio MO MCSCF and 
CCI study of Roos et al. [9] concluded that the 
perepoxide is a preferable reaction pathway rather 
the 1,4-biradical intermediates. At last, we can see 
the results by Robb [10b] shown in Table I. 

It is apparent from above discussion that the 
stability of the perepoxide is very sensitive to the 
basis sets and calculational levels employed in 
describing the electronic structure. This means that 
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the relative stability to the 1,4-biradical states and 
the reasonable reaction pathway are not conclusive 
within the previous investigations and our calcula- 
tional levels, indicating that more high-quality cal- 
culational levels are necessary to elucidate the 
clear reaction mechanism of 1 0 2 + H 2 C = CH 2 . 
These will be discussed later in this text. 

1,2-DIOXETANE AS THE 

REACTION PRODUCT 

Figure 9 shows the most stable geometries of 

1,2-dioxetane optimized at both RMP2/6-31G* and 
CASSCF{2,2}/6-31G* levels with all positive vi¬ 
brational frequencies. This structure has the C 2 
symmetry with twisted angles of 19.2° and 10.1° at 
RMP2 and CASSCF levels, respectively, being con¬ 
sistent with that of Robb et al. [23]. On the other 
hand, the RHF/6-31G* gives the structure with 
C lv symmetry as a local minimum on the potential 
energy surface with the lowest positive frequency 
of 54 cm -1 , as shown in Figure 10. The RMP2 and 
CASSCF methods have also the similar structures 
shown in Figure 10. These states are, however, the 
first-order saddle point with single imaginary fre¬ 
quency and have higher energy by 0.68 and 0.01 
kcal/mol than the state of the C 2 symmetry at 
RMP2 and CASSCF levels, respectively, indicating 
that the C—O bonds in the ground state of the 

1,2-dioxetane swing alternately. 

As shown in Table I, the relative stability be¬ 
tween the ! BR(G1) and the 1,2-dioxetane at the 
CASSCF{2,2}/6~31G* level gives comparable value 

1.521 
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to that at the MP2/6-31G* level, being quite dif¬ 
ferent from 27 kcal/mol of CASSCF/4-31G by 
Robb et al. [10b]. The heat of 1,2-dioxetane forma¬ 
tion at CASSCF{2,2} level is estimated to be 33.7 
kcal/mol compared with 12.8 [9] and 4.0 [10b] 
kcal/mol, being reasonable value with 30 kcal/mol 
of thermochemical estimation. This indicates that 
the polarization function and correlation effect play 
important roles for describing the electronic struc¬ 
ture of the 1,2-dioxetane. 

Robb et al. [23] studied the reaction mechanism 
of the chemiluminescent decomposition of 1,2-di¬ 
oxetane by using CASSCF/4-31G method. They 
gave the relative energies between 1,2-dioxetane 
and two formaldehydes by 57.1 kcal/mol at 
CASSCF level and 61.4 kcal/mol at MP2 level 
using the optimized geometries of CASSCF. As is 
shown in Table I, our relative energy is 32.7 
kcal/mol by CASSCF{2,2}/6-31G* method, being 
smaller than their CASSCF results. Flowever, the 
relative energy of 53.5 kcal/mol at the MP2/6- 
31G* level is comparable with their result. Since 
discussion of the decomposition of 1,2-dioxetane is 
not part of this study, the investigation of this 
problem should be left in future. 

From Figures 9 and 10, the carbon-carbon bond 
distance of 1,2-dioxetane at the MP2 level is given 
by 1.509 A, which is shorter than the standard 
single-bond length of C—C single bond. The oxy¬ 
gen-oxygen bond distance is 1.521 A, being quite 
longer than 1.421 A given by geometry optimiza- 



FIGURE 9. Optimized geometrical parameters of FIGURE 10. Optimized geometrical parameters of 

1,2-dioxetane with C 2 symmetry obtained by 1,2-dioxetane with C 2v symmetry obtained by 

RMP2 / 6-31G* method and CASSCF{2,2} /6-31G* RMP2/6-31G*, CASSCF{2,2} /6-31G* in parenthesis, 

method in parenthesis. and RHF/6-31G* in square brackets. 
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tion of RHF/6-31G*, indicating that the corre¬ 
lation effect plays an important role to describe the 
electronic structure of 1,2-dioxetane. The 
carbon-oxygen bond distance also changes from 
1.423 to 1.458 A by inclusion of correlation effects. 

RELATIVE STABILITIES OF REACTION 

INTERMEDIATES AT VARIOUS 

CALCULATIONAL LEVELS 

Table III summarizes the relative energies of 
reaction intermediates, transition state, and prod¬ 
ucts estimated by various levels of theories using 
the geometries optimized at the MP2/6-31G* level. 
The relative energies of the perepoxide are surpris¬ 
ingly sensitive to the calculational levels of theo¬ 
ries. 

Compared with the energy barrier height of 18.0 
kcal/mol from ^RfGl) to 1 BR(G TS) by the UHF 
method, the APUHF method increases to 28.2 
kcal/mol, while the CASSCF method with the 
nondynamic electron correlation does not almost 
increase, that is, 19.9 kcal/mol. Though the dy¬ 
namic electron correlation is important to describe 
more precisely the singlet biradical states, the rela¬ 
tive energies of UMP2 and UMP4 are not almost 
changed. The relative energy by the APUHF, 
APUMP2, and APUMP4 methods are largely in¬ 
creased as well as the CASMP2. The coupled-clus¬ 
ter methods decrease the barrier height to 9.9 and 
6.7 kcal/mol by CCSD and CCSD(T), respectively. 

On the other hand, as is shown in Table III, the 
perepoxide is largely fluctuated by the calcula¬ 
tional levels. Compared with results of the RHF 


method, the nondynamic and dynamic electron 
correlation decreases the relative energies between 
^RfGl) state and the perepoxide. At the MP2 and 
MP4 levels, the stabilities of the perepoxide are 
compatible with the ^RCGl) state. The coupled- 
cluster method unstabilized the perepoxide to 6-8 
kcal/mol. 

The relative stabilities of the 1,2-dioxetane are 
not so sensitive to the calculational levels, falling 
into between 40 and 52 kcal/mol. However, the 
CASMP2 method decreases it to about 33 kcal/mol. 
The energy differences between DO and two 
formaldehyde are nearly equal to 50-60 kcal/mol 
except for the CASSCF and CASMP2 methods. 

It is apparent from Table III that the reaction 
pathways from the singlet oxygen plus ethylene to 
1,2-dioxetane will be differently described by the 
calculational levels. In the UHF and CASSCF 
methods, the reaction pathway through 1,4-biradi- 
cal states is preferable rather than through the 
perepoxide intermediate due to the greater stabil¬ 
ity of the BR state than the perepoxide. At the MP2 
level, though the transition state from ! BR(G1) 
state or perepoxide to the 1,2-dioxetane is not 
found, two reaction pathways might be competi¬ 
tive with each other. At the coupled-cluster 
method, the ^RCTS) state and the perepoxide, 
which is a local minimum on the potential energy 
surface, are iso-energetics, indicating that the pos¬ 
sibility of reaching the singlet BR(G1) state is higher 
than the perepoxide intermediate. 

As was examined previously [6d, 13, 19a] a 
potential energy curve for the dissociation process 
of covalent bond in a bicentric system is divided 


TABLE III_ 

Relative stabilities (kcal / mol) of reaction intermediates estimated by various levels of theory based on the 
geometries optimized by RMP2 or UMP2 using 6-31G* basis set. 


Methods \ States 

’BR (G1) a 

’BR (G TS) 

Perepoxide 

DO 

2H 2 C = O 

R(U)HF 

0.0 

18.0 

30.4 

-11.5 

-75.7 

APUHF 

0.0 

28.2 




CASSCF 

0.0 

19.9 

4.9 

-51.0 

— 82.1 b 

R(U)MP2 

0.0 

19.9 

-0.6 

-51.5 

-105.0 

APUMP2 

0.0 

34.6 




R(U)MP4 

0.0 

16.4 

2.2 

-46.9 

-99.9 

APUMP4 

0.0 

29.3 




CCSD 

0.0 

9.9 

8.1 

-40.9 

-93.7 

CCSD(T) 

0.0 

6.7 

5.7 

-43.5 

-95.1 

CASMP2 

0.0 

35.5 

-4.6 

-33.4 

— 109.3 b 

CASPT2 

0.0 

17.0 

5.0 

-42.9 

— 100.6 b 

a Total energies are given in footnote in Table II. 

b The formaldehyde dimer with C — C distance of 10 A was calculated. 
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into three characteristic regions: (I) stable region, 
(II) intermediate (transition) region, and (III) un¬ 
stable (magnetic) region. As is well known, the full 
configuration interaction (FCI) provides a reason¬ 
able potential curve over the whole region. The 
RHF and RMP methods can be applied to species 
in region I, but they usually give an unreasonable 
result in region III. The RHF coupled-cluster (RCC) 
method is suitable to species in region I, usually 
reliable in region II, and unreasonable in region III. 
On the other hand, the UHF method provides a 
proper dissociation curve, but it suffers the so- 
called spin contamination error, giving the higher 
energy than FCI over the whole region. The UMPn 
method provides the potential curve close to FCI, 
but usually fails for the description of species in 
region II. However, the potential curve and wave 
function presented by the UHF coupled-cluster 
(UCC) should be very close to that by FCI if the 
exponential part is taken to involve sufficient 
higher excitation operators. In fact, the spin con¬ 
tamination errors on the potential curve were not 
serious even at the truncated UCCSD(T) level in 
the bicentric systems examined before [14]. 

It seems, therefore, that the coupled-cluster 
method is reliable and necessary to describe the 
reactions including the unstable or radical inter¬ 
mediate. The perepoxide and 1,2-dioxetane con¬ 
structed of the covalent bond should be located in 
region I, while the singlet biradical intermediates 
and transition state with strong biradical character¬ 
istics are in region III. Therefore, the coupled-clus¬ 
ter method is reliable and necessary to describe the 
reactions, including the unstable or radical inter¬ 
mediate. It is concluded from Table III that the 
singlet oxygen reaction to ethylene molecule will 
proceed by a two-step mechanism through 1,4-bi¬ 
radical intermediates, though the transition state 
should be located by coupled-cluster treatment or 
more sophisticated method. 


Conclusion 

The reaction mechanism and relative stabilities 
of the intermediates and transition states in the 
reaction of ethylene molecule plus 0 2 using ab 
initio MO theories at several levels of theory with 
the correction of the nondynamic and dynamic 
electron correlation effects were systematically in¬ 
vestigated. Full geometry optimizations of the cor¬ 
responding biradical (BR) intermediates, perepox¬ 


ide (PE), and 1,2-dioxetane (DO) were performed 
by CASSCF{2,2}/6-31G* method with nondy¬ 
namic electron correlation effect and RHF or UHF 
(R or U)MP2/6-31G* method with dynamic elec¬ 
tron correlation effect. 

Four optimized geometries of s-c is, two 
gauche's, and s-trans types in the l BR states were 
found by both UMP2/6-31G* and CASSCF 
{2,2}/6-31G* methods, possessing biradical char¬ 
acteristics with terminal methylene moiety and 
terminal oxygen atom. One of two gauche's and 
s-trans types are local minimums on the potential 
energy surface. For the rotation around the C—O 
bond axis, four ! BR states are smoothly connected 
with the rotation barrier less than 4 kcal/mol, 
indicating that the free rotation easily occurs in the 
singlet state. One of two gauche types, which was 
found as a new geometry of 1,4-biradical singlet 
state, and s-cis-type geometry are the transition 
states of the rotation motion. 

The C a gauche geometry was also found as a 
transition state on the potential surface. The IRC 
study showed that it connects smoothly to the 
gauche 1 BR minimum state for the reverse direc¬ 
tion and the dissociation limit of 1 0 2 + H 2 C = 
CH ? for the forward direction. It is, therefore, 
concluded that the reaction of 0 2 + H 2 C = CH 2 
through the 1,4-biradical intermediates initially 
proceeds through the C x gauche transition state to 
form the gauche a BR minimum state, following 
that the free rotation of 0 2 moiety along the C—O 
bond axis occurs due to the energy barrier less 
than 4 kcal/mol. 

The relative stabilities of four 1,4-biradical 
states, the transition states, perepoxide, and 1,2-di¬ 
oxetane were estimated again by the APUHF [13], 
APUMP ft (n = 2,4) [6], CCSD, CCSD(T) [6d, 14], 
CASMP2 [15], and CASPT2 [12] methods at the 
geometries optimized at both CASSCF{2,2}/6-31G 
and MP2/6-31G* levels. The APUHF gives the 
same ordering of the relative stabilities for four 
1,4-biradical states at the geometries optimized by 
CASSCF{2,2} and UMP2 methods, while the 
APUMP (n = 2,4) provides the reverse stabilities 
for 1,4-biradical states. It may be, therefore, noted 
that both APUHF and APUMP methods are suit¬ 
able and unsuitable to describe the qualitative 
pictures of the species and quantitative pictures, 
respectively. The description of the perepoxide is 
very sensitive for basis sets employed and levels of 
theory, causing the difficulties determining the 
reaction mechanism of 1 0 2 + H 2 C = CH 2 . For the 
description of the 1,2-dioxetane, the correction of 
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the dynamic electron correlation such as MP2 
treatment is necessary at least. 

As is mentioned in the previous section, the 
RHF coupled-cluster (RCC) method is suitable for 
species in the stable region I and usually reliable 
in intermediate region II. The potential curve and 
wave function presented by the UHF coupled- 
cluster (UCC) should be very close to that by FCI 
over the whole regions [6d, 12]. It seems, therefore, 
that the CCSD and CCSD(T) methods are neces¬ 
sary to elucidate the reaction mechanism including 
the unstable and radical intermediates. From the 
data of the CCSD and CCSD(T) methods, the reac- 
tion of 0 2 + H 2 C = CH 2 proceeds by a two-step 
mechanism through the 1,4-biradical intermediates 
rather than through the perepoxide. 
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ABSTRACT: Molecular dynamics simulations of clusters of Li + and Be 2+ cations 
with up to 12 water molecules were performed calculating the particle trajectories using 
Hartree-Fock-derived forces. It was found that independent of the starting configuration 
tetrahedral clusters are obtained in less than 1 ps. Only for Li + clusters, transition states 
with five and three water molecules are found. We discuss the structures of the clusters 
and the vibrational dynamics of the water molecules and compare them with other ab 
initio simulations, with simulations using analytical potential functions, and with static 
calculations. Water-dissociation (hydrogen transfer) reactions which take place at elevated 
temperatures were investigated. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 803-816, 
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Introduction 

T he hydration of ions plays a key role in many 
processes occurring in solution chemistry and 
is of great importance in many areas of physics. 
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biochemistry, and biology. The molecular dynam¬ 
ics (MD) method has established itself as a valu¬ 
able tool to obtain insight into the properties of 
aqueous solutions and hydrated ions at a micro¬ 
scopic level [1-3]. 

In this article, we report structural and dynami¬ 
cal results from MD simulations of clusters of Li + 
and Be 2+ cations with water molecules. Many- 
body effects were incorporated into the simulation 
by using energies and forces from Hartree-Fock 
wave functions to propagate the trajectories. Due 
to the strong interactions caused by the small and 
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highly charged cation, aqueous Be 2+ clusters are 
ideal systems for studying strong nonadditive in¬ 
teractions. Comparisons are made with clusters of 
the singly charged Li + cation for which nonaddi¬ 
tive effects are less important but for which only a 
few theoretical studies on clusters exist. 

Experimental [4-6], quantum chemical [7-9], 
and simulation [6, 10, 11] studies on aqueous salt 
solutions containing Be 2+ all report or predict 
a coordination number of four for not-too- 
concentrated solutions. For example, O-NMR [5] 
experiments on a Be(C10 4 ) 2 solution at 295 K gave 
a coordination number of 4.1 ± 0.2. 

It has been shown by ab initio calculations of 
clusters [7-9] that the high charge density on the 
Be 2+ ion leads to strong nonadditive interactions. 
A comparison of classical molecular dynamics 
simulations with pair potentials only and with 
three-body effects incorporated [10,11] showed 
that in correctly determining the coordination 
number nonadditive interactions play a crucial 
role. 

Recently, calculations on Be 2+ (H 2 0) n systems 
were also carried out using the so-called 
Car-Parinello (CP) molecular dynamics method 
by Marx et al. [12-14]. Their results are compared 
with our findings in more detail in the subsequent 
sections. 

Li + ions in water were among the first molecu¬ 
lar liquids studied in computer simulations [15] 
and there exist many MD simulation studies of Li 
ions in various concentrations [16-18]. While the 
coordination number of Li + in dilute aqueous 
solutions is still inconclusive, most of the newer 
studies assume a sixfold coordination to be most 
likely. 

In the following sections we discuss at first the 
details of our calculations. Then, structural proper¬ 
ties of the clusters, simulation experiments per¬ 
formed in order to study the dissociation of water 
molecules, and vibrational properties are dis¬ 
cussed. Finally, we compare the method used in 
the present work with alternative ones and sum¬ 
marize our conclusions. 


Details of Calculations 

In most MD simulations, analytical potentials 
are employed to derive the interparticle forces and 
most of the analytical potentials assume a pairwise 
additivity of the interactions, at least between un¬ 


bonded atoms. While this is often a good approxi¬ 
mation, it becomes inaccurate or even fails in cases 
where effects like polarization, charge transfer, or 
bond-breaking/bond formation are relevant. In 
principle, analytical potentials can be constructed 
to account also for such effects and many efforts 
have been made along these lines [19-22]. An 
alternative that became viable in recent time is to 
employ quantum chemical methods to treat the 
system as a whole when calculating the forces 
acting on the particles. 

For the MD simulations reported here, the parti¬ 
cle trajectories were calculated via forces from 
Hartree-Fock Hamiltonians with small basis sets. 
The wave function was optimized at every time 
step. The equations of motion were solved using a 
velocity-Verlet algorithm [23] and all degrees of 
freedom were kept active. For the constant- 
temperature simulations, the Berendsen thermo¬ 
stat [24] was used. A time step of 0.3 fs was 
sufficient to resolve also the hydrogen motions 
and a typical MD run was performed for 6000 time 
steps, corresponding to 1.8 ps elapsed time. One 
time step of a complex with 12 water molecules 
took about 180 s on a single MIPS 8000 processor 
of an SGI power challenge computer. The time is 
almost exclusively spent in the force calculation 
while the time for obtaining the new positions and 
velocities is negligible. The forces were obtained 
from a slightly modified Gaussian 94 computer 
code [25] which we interfaced to our MD routines. 

THEORETICAL LEVEL 

In principle, any suitable wave function—or 
density functional-based method for which energy 
gradients are available—can be used in an ab 
initio MD approach. We decided to work at the 
Hartree-Fock (HF) level since for the systems 
studied here electron correlation plays a minor 
role and a comparison of the computer codes ac¬ 
cessible to us showed the calculations of forces 
to be about twice as fast for HF as for density 
functional-based methods like B3LYP [26, 27] or 
for MP2. Density functionals without gradient cor¬ 
rection are considerably faster but they fail to 
reproduce hydrogen bonding correctly. 

To be able to handle systems with two hydra¬ 
tion shells, we are restricted to use rather small 
basis sets. For a comparison of basis-set quality, 
we carried out geometry optimizations on the sys¬ 
tems Be 2+ (H 2 0) n (n = 1-4) and list the results in 
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Table L (The binding energy per water molecule is 
given by AE = [E([Be(OH 2 )J 2 + ) - E(Be 2+ ) - 
nE(H 2 0)]/n). 

The CEP [29] basis set in single-zeta contraction 
compares energetically well with larger ones. On 
the HF level, it gives A E values of -136.6, —129.6, 
—116.3, and —103.1 kcal/mol for the mono- to 
tetrahydrates, compared with, e.g., -135.6, -128.0, 
-107.9, and —96.3 kcal/mol for the "DZP" basis 
set and it is (accidentally) even closer to the 6- 
311G** basis at the MP2 level (-147.3, -136.7, 
—119.0, and —103.2 kcal/mol). We found that small 
double-zeta basis sets like 3-21G give binding 
energies somewhat too high. 

Table I also shows that the CEP basis set in a 
single-zeta contraction leads to Be—O distances 
about 0.07 A too long. Also, the O—H distances 
are slightly too long. These errors stay constant 
during the series of hydrates. The use of CEP in a 
double-zeta contraction would have improved 


these distances at the expense of a fivefold in¬ 
crease of computing time. The H—O—H angle is 
in good agreement with more expensive basis sets 
and methods. We finally decided to use the CEP 
basis set in a single-zeta contraction for the MD 
simulations. Table II lists the results for the 
Li + (H 2 0)„ (ft = 1-4) clusters with the CEP basis 
set. Table III shows the harmonic vibrational fre¬ 
quencies for Be 2+ (H 2 0)„ (ft = 1-4) compared to 
two other basis sets and methods. The shifts in the 
O—H stretching vibrations are the largest for n = 
1 and decrease then with increasing charge trans¬ 
fer to the cation. Most relevant are the differences 
between the O—H stretching vibrations of the 
isolated H 2 0 molecule and the fourfold coordi¬ 
nated clusters. For the CEP basis set, the slight 
average upshift of the symmetric (~ 10 cm" 1 ) and 
the large downshift of the asymmetric stretching 
mode (~—290 cm -1 ) are in reasonable agreement 
with the 3-21G values and also with shifts found 


TABLE I_ 

Binding energies and optimized geometries of Be 2+ (H 2 0) n with n = 1-4. 



Method and 

basis set 

n 

X 

1 

o ^ 

^HOH 

(degree) 

r Be—O (A) 

A E 

(kcal / mol) 

Ref. 

HF 

CEP-4G 

1 

1.033 

107.5 

1.635 

-136.6 

This 



2 

1.027 

108.3 

1.644 

-129.6 

work 



3 

1.020 

108.9 

1.682 

-116.2 




4 

1.014 

108.8 

1.736 

-103.1 


HF 

DZP 

1 

0.976 

108.0 

1.536 

-135.6 

[7] 



2 

0.968 

108.3 

1.537 

-128.0 




3 

0.961 

107.9 

1.613 

-107.9 




4 

0.956 

108.0 

1.662 

-96.3 


HF 

3-21G 

1 

0.999 

108.7 

1.517 

-169.2 

[8] 



2 

0.991 

109.6 

1.533 

-158.0 




3 

0.981 

110.2 

1.585 

-140.0 




4 

0.975 

110.6 

1.645 

-123.1 


HF 

3-21G* 

1 

0.999 

108.0 

1.503 

-161.3 

[8] 



2 

0.990 

108.4 

1.528 

-149.0 




3 

0.980 

108.4 

1.580 

-130.8 




4 

0.974 

108.2 

1.648 

-114.1 


DFT 

Plane wave 

1 

0.991 

108.0 

1.490 

-149.6 

[12] 


basis set 

2 

0.982 

108.7 

1.525 

-140.2 



+ ECP 

3 

0.974 

108.6 

1.585 

-127.9 




4 

0.970 

108.6 

1.668 

-117.7 


MP2 

6-311G** 

1 

0.991 

108.0 

1.507 

-147.3 

[28] 



2 

0.983 

108.1 

1.526 

-136.7 




3 

0.974 

107.8 

1.579 

-119.0 




4 

0.970 

107.0 

1.668 

-103.2 
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TABLE II ___ 

Hartree-Fock energies and optimized geometries of Li + (H 2 0) n with n = 1-4 (CEP basis set). 


n 

r O— H 

(A) 

ZHOH 

(degree) 

r li —O 

(A) 

A E 

(kcal / mo!) 

1 (C 2l/ ) 

1.0090 

107.2 

1.945 

-41.7 

2 (D zd ) 

1.0081 

107.5 

1.961 

-39.9 

3 (Dg) 

1.0074 

107.9 

1.994 

-36.6 

4 

1.0067 

108.0 

2.043 

-32.9 


for other cations [31]. Interestingly, the reported 
shifts from the DFT calculations with a plane-wave 
basis set [12] are zero. 


Structural Results 

In contrast to Be 2+ where tetrahedral coordina¬ 
tion is expected if nonadditive effects are incorpo¬ 
rated in the simulations, this is not so clear for Li + . 
For example, a neutron diffraction study of a 3.57 
m LiCl solution resulted in a coordination number 
of 5.5 ± 0.3 [32] and MD simulations of aqueous 
solutions at different concentrations but with pair 
potentials also seem to indicate that in infinitely 
diluted systems an octahedral tetrahedral coordi¬ 
nation is preferred [16-18]. On the other hand, 
simulations on the system Li + /H 2 0 with nonad¬ 


ditive potentials [33, 34] resulted in tetrahedral 
coordination of the cation. 

We performed simulations with different initial 
configurations. Independent of the initial state, te- 
trahedrally coordinated complexes are obtained. 
The same starting configurations for Be 2+ (H 2 0)„ 
and Li + (H 2 0)„ (n = 6,8,10,12) were used. The 
simulations were performed at 300 K using a 
Berendsen thermostat [24] for equilibration. 

Figure 1 shows the different starting configura¬ 
tions of the Be 2+ clusters. In the center of the 
figure, a representative configuration resulting 
from the simulations is shown. The tetrahedral 
coordination of the oxygen atoms of the inner-shell 
water molecules (plotted in black) can be seen 
clearly. This particular configuration originates 
from the initial structure Be 2+ (H 2 0) 2 (H 2 0) 8 (c) 
but the other three starting configurations would 


TABLE III___ 

Harmonic frequencies of the most important modes in the Be 2+ (H 2 0) n clusters with n = 0-4. 




H 2 0 



n = 1 



n = 2 



n = 3 



n = 4 


Mode 

CEP 

(this 

work) 

3-21G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21G 
[8] 

Plane 

wave 

[12] 

Be—(OH 2 ) 
bending 




888- 

913 

944- 

955 

710- 

816 

831- 

905 

866- 

950 

640- 

840 

718- 

912 

751- 

938 

580- 

740 

636- 

783 

657- 

785 

490- 

600 

Be—O 
stretch 




963 

1126 

1060 

1213 

1376 

1210 

1028 

1129 

980 

877- 

982 

1007- 

1018 

830 

H—O—H 
bending 

1727 

1799 

1610 

1752 

1820 

1700 

1754 

1824- 

1827 

1730 

1761- 

1764 

1834- 

1840 

1730 

1767- 

1783 

1830- 

1840 

1730 

O—H 
sym. str. 

3900 

3812 

3710 

3676 

3500 

3490 

3742 

3588- 

3610 

3600 

3835- 

3853 

3703- 

3726 

3680 

3903- 

3919 

3775- 

3796 

3710 

O—H 4246 

asym. str. 

3945 

3830 

3709 

3511 

3520 

3803 

3630 

3620 

3933- 

3935 

3764- 

3966 

3750 

4033- 

3936 

3853- 

3857 

3830 


The experimental values for H 2 0 are 1595, 3657, and 3756 cm" 1 for the anharmonic and 1648, 3832, and 3943 cm 1 for the 
harmonic frequencies [30]. For the larger complexes, the lowest and highest frequency of a set of modes are given, if available. 
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FIGURE 1. Different starting configurations of the Be 2+ (respectively, Li + )-water clusters: (a) Be 2+ (H 2 0) 6 ; 

(b) Be 2+ (H 2 0) 8 ; (c) Be 2+ (H 2 0) 2 (H 2 0) 8 ; (d) Be 2+ (H 2 0) 4 (H 2 0) 8 ; (center) atypical Be 2+ (H 2 0) 12 configuration resulting 
from (c). Black spheres represent oxygen atoms of water molecules in the first hydration shell. 


lead to similar final configurations (as is symbol¬ 
ized by the four arrows in Fig. 1), qualitatively 
only differing in the number of second-shell 
water molecules. Configurations with Li + as a 
central cation, on average, show the same fourfold 
coordination. 

First, we discuss the simulations involving 
beryllium, then the ones with lithium cation: 

Be 

(a) The initial configuration denoted (a) 
in Figure 1 is a Be 2 + (H 2 0) 6 cluster which was 
obtained from the T h symmetrically optimized 
geometry [8] and which was also used in [12]. 
There, it was found that, starting from an octa¬ 
hedral cluster at 20 K, a fourfold hydrated struc¬ 
ture is obtained after 0.2 ps. We found that a 
similar tetrahedral structure is reached after 0.1 ps 


(by slightly displacing the cation from the origin, 
we had removed the symmetry from the system). 
Then, we carried out equivalent simulations with 
Be 2+ (H 2 0) 8 (b), a hexahedral initial configuration 
(eight water molecules cubically arranged in the 
first shell), with Be 2+ (H 2 O) 10 (c), two water 
molecules in the first shell and eight in the second 
and with Be 2+ (H 2 0) 12 (d), four water molecules in 
the first and eight water molecules in the second 
shell. All these simulations resulted in fourfold 
coordinated first hydration shells after similar 
times (~ 0.1 ps). No exchange of water molecules 
between the first and the second hydration shell 
was observed after this time. 

A quantitative picture of the solution structure 
can be obtained from the radial distribution func¬ 
tions. In the following discussion, we refer to 
Be 2+ (H 2 0) 12 unless mentioned otherwise. Because 
finite clusters are investigated, the radial distribu- 
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tion functions are not normalized by r 2 , as is 
usually done for periodic systems. The first maxi¬ 
mum of g Be _ 0 (r) at 1.72 A (Fig. 2, solid line) can 
be compared to 1.65 A [13] (CP simulation) and 
1.75 A (MD study with a threebody potential [11]). 
The value obtained from the X-ray diffraction study 
[6] of a 5.3 molal aqueous BeCl 2 solution is 1.67 A. 
The Be—O distances in crystals with fourfold co¬ 
ordinated Be 2+ ions are shorter and lie between 
1.60 and 1.63 A [35]. Considering that the CEP 
basis set leads to an overestimation of the Be—O 

o 

distance of about 0.07 A (Table I), the values from 
X-ray [6] and the CP simulation [13] are probably 
more accurate than in the MD study [11] and are 
consistent with our data. The peak maximum of 
1.72 A can further be compared with the values of 
the optimized cluster with (1.714 A, Table IV) and 
without (1.736 A) the eight second-shell water 
molecules, showing that the dynamic motion has a 
smaller influence on the average Be—O distance 
than on the second hydration shell. 

The second maximum of g Be _ 0 at 4.07 A devi¬ 
ates more from the potential minimum at 3.957 A. 
For the three systems with fewer water molecules, 
values between 3.6 and 4.4 A are obtained. The 
first and second peaks of g Be __ G are completely 
separated and are well defined, similar to what 
was found in the MD study [11] with periodic 


boundaries. These data cannot be compared fur¬ 
ther to the CP simulation study [13] because the 
small periodic box of about 10 A containing 31 
water molecules in [13] already disturbs the sec¬ 
ond hydration shell, leading to an unstructured 
g Be _o( r ) curve beyond 3.5 A. 

The Be—H distribution function has its first 
maximum at 2.42 A (Fig. 2, dotted lines). This is 
again about 0.07 A more than the value obtained 
in [13]. It is again close to the values of the poten¬ 
tial minimum (2.457 A), indicating that the water 
molecules in our simulation are, on average, only 
slightly tilted away from a planar Be—H 2 0 sub¬ 
configuration. 

The integral values are also plotted in figure 2 
for O and H and show the expected plateaus for 4 
and 12 oxygen atoms or 8 and 24 hydrogens, 
respectively. The distribution of the H—O—H an¬ 
gles for the first and second hydration shell have 
maxima at 111.5° and 108.3°, respectively (opti¬ 
mized values: 109.9° and 108.1°) and thus show 
that the angle is little influenced by the cation. The 
distribution of the intramolecular O—H distances 
in the first shell (Fig. 3, solid line) gives a peak 
with a maximum between 1.00 and 1.04 A, nearly 
symmetrically around the potential minimum of 
1.023 A. Compared to an isolated H 2 0 molecule, 
the bonds are elongated by about 0.01 A. The 



FIGURE 2. Be—O radial distribution function g Be ___ 0 {r) with running coordination number n Be — 0 (r) (solid curves) 
and Be—H radial distribution function g Be _ H (r) with running coordation number n Be _ H (r) (dotted curves) of 
Be 2+ (H 2 0) 12 . 
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TABLE IV 


Optimized geometries of the Be 2 + 

(H 2 0) 4 (H 2 0) 8 and the Li + (H 2 0) 4 (H 2 0) 8 

clusters (HF with CEP basis set). 



r O — H 

Z.HOH 

r Be —O 


(A) 

(degree) 

(A) 

Be 2+ (H 2 0) 4 (H 2 0) 8 




First shell 

1.023 

109.9 

1.714 

Second shell 

1.012 

108.1 

3.957 

Li 2+ (H 2 0) 4 (H 2 0) 8 




First shell 

1.008 

108.3 

2.033 

Second shell 

1.011 

108.0 

4.476 


average distance between the oxygens in the first 
and second hydration shell in our the simulation 
(2.35 A) is about 0.1 A longer than the optimized 
value (2.243 A), indicating a large dynamic 
contribution. 

Li 

Simulations with the Li + ion were performed 
with the same set of starting configurations 
as described above. A tetrahedrally hydrated 
Li + ion emerges as the most stable subsystem 


but its formation is much slower than for 
Be 2+ and we observed “transition states" with 
different coordination numbers in the initial states 
of the simulations. It took between 0.45 ps [from 
Li + (H 2 0) 2 (H 2 0) 8 ] and 0.70 ps [from Li + (H 2 0) 8 ] 
until the tetrahedral complex was formed. 

Figure 4 shows two examples of intermediate 
structures: (a) The complex with a threefold coor¬ 
dinated Li + was formed from the Li + (H 2 0) 6 clus¬ 
ter after 0.16 ps. (b) A complex with five water 
molecules was formed from Li + (H 2 0) 2 (H 2 0) 8 
after 0.25 ps. Similar complexes with three- and 



FIGURE 3. Distribution of the intramolecular O—H distances in the first hydration shells of Be 2+ (H 2 0) 12 (solid curve) 
and Li + (H 2 0) 12 (dotted curve). 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


809 






BISCHOF ET AL. 


(a) 



FIGURE 4. Snapshots of (a) an intermediate state with threefold coordinated Li + after simulating for 0.16 ps from the 
Li + (H 2 0) 6 configuration and (b) an intermediate state with fivefold coordinated Li + after simulating for 0.25 ps starting 
from Li + (H 2 0) 2 (H 2 0) 8 . 



fivefold coordination were also found for the other 
starting configurations. Since we used no periodic 
boundary conditions or any potentials confining 
the particles in the simulations, water molecules 
can—and will finally—evaporate from the second 
hydration shell. In the Be 2+ /H 2 0 system, the 
strong attraction of the water molecules makes this 
a process too slow to be observed, but for 
Li + /H 2 0, such evaporation is observed in all 
simulations. 

Figure 5 shows the radial distribution functions 
g Li _ 0 (r) and g u _ H (r ) for Li + (H 2 0) 4 (H 2 0) 8 . The 
first peak of the Li—O distance distribution has its 
maximum at 2.08 A. The corresponding value of 
the potential minimum is 2.03 A (Table IV). In 
simulations studies with nonadditive potentials 
[33, 34], values of 1.98 and 2.0 A were found. The 

o 

Li—O distances are about 0.35 A longer than the 
Be—O distances, which directly reflects the differ¬ 
ent ionic radii of the ions (Li + : 0.59 A vs. Be 2+ : 

o 

0.27 A) and shows that the ion-water attraction 
and not the water-water repulsion are dominat¬ 
ing. The g(r) curves decrease nearly to zero be¬ 
tween their first- and second-shell maxima. The 
first peak again integrates to four. 

The second hydration shell of Li + leads to a 
broad peak between 4.0 and 5.0 A with a maxi- 
mum at 4.57 A (potential minimum: 4.48 A). The 


tail of the function is caused by the slowly evapo¬ 
rating water molecules. The maximum of g Li — n * s 
located at 2.72 A. This is about 0.15 A longer than 
that found in a simulation carried out with pair 
potentials and periodic boundaries [36]. From our 
geometry optimization, a value of 2.75 A is ob¬ 
tained. The maxima of the distribution of the 
H—O—H angle are found at 108.5° and 107.5° for 
first and second shells, respectively. 

The intramolecular O—H distance distribution 
(Fig. 3, dashed line) has its maximum between 
0.993 and 1.020 A for the first-shell water 
molecules. They are thus slightly less elongated 
than in the complexes with Be 2+ . 

The distance between the oxygens in the first 
and second hydration shell from the simulation is 
about 0.05 A longer than between the correspond¬ 
ing potential minima. While this is only about half 
of the value found for Be 2+ , it is still a large 
deviation. 


Dissociation of Water Molecules 

Be 2+ can cause abstraction of hydrogen from 
water molecules, giving BeOH + . This is the reason 
why solutions of Be 2+ salts are acidic (a 0.5 molal 
solution of BeCl 2 has an approximate pH value of 
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FIGURE 5. Li—O radial distribution function g u _ 0 (r) with running coordination number n u _ 0 (r) (solid curves) and 
Li—H radial distribution function g Li „ H ( r ) with running coordination number n Li _ H (r) (dotted curves) of Li + (H 2 0) 12 . 


3.5). Ab initio MD allows the simulation of such 
dissociation reactions. At room temperature, this 
reaction is rather infrequent in the liquid phase 
and will be even more so in small clusters. In the 
case of elevated temperatures, however, the reac¬ 
tion takes place more rapidly. We performed the 
following experiments: 

(a) Starting from Be 2+ (H 2 0) 8 with velocities cor¬ 
responding to 20 K, we found water dissocia¬ 
tion according to 

Be 2+ (H 2 0) 8 -> Be + (H 2 0) 2 0H 

+ H 3 0 + + 4H 2 0 

after about 0.17 ps. The potential energy at the 
beginning of the simulation led to a tempera¬ 
ture of about 2000 K during this time. A typi¬ 
cal configuration of the products is shown in 
Figure 6. 

(b) The acidity of Be 2+ can be expected to be 
much higher if it is not saturated with four (or 
more) water molecules. Indeed, a small model 
system consisting of one molecule of water in 
the first hydration shell and one hydrogen 
bonded to the first one reacts after 0.3 ps at 
300 K according to 

Be 2+ (OH 2 )OH 2 -> BeOH + + H 3 0 + . 

(c) MD simulations starting with the equilibrated 
Be 2+ (H 2 0) 4 (H 2 0) 8 configuration were per- 



FIGURE 6. Typical configuration after dissociation of a 
water molecule from the Be 2+ (H 2 0) 8 cluster after 0.45 
ps simulation time. 


formed at different temperatures with intervals 
of about 1000 K. At 5000 K, we observed water 
dissociation after 0.45 ps. 

Figure 7 shows the distance between the hydro¬ 
gen atom which is being transferred and both of 
the oxygen atoms involved in the dissociation re¬ 
action. At start of the simulation, this hydrogen 
and the oxygen atom of the water molecule, which 
will be protonated, are about 6 A apart (light 
curve). The hydrogen exchange at 0.45 ps happens 
within about two cycles of the O—H vibrations 
and no protonated water dimer with a shared 
hydrogen or a similar species is formed. The vibra- 
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FIGURE 7. Hydrogen transfer in Be 2+ (H 2 0) 12 at high temperatures. The distance between a hydrogen atom and the 
oxygen atom in the same water molecule (black curve) and the oxygen atom to which the hydrogen is transferred (light 
curve) are shown as a function of time. 


tional amplitudes of the old and the new O—H 
bonds are similar. After the transfer, the two frag¬ 
ments, OH” (as part of BeOH + ) and H 3 0 + , imme¬ 
diately move away from each other. While these 
nonequilibrium simulation experiments on water 
dissociation are clearly not statistically representa¬ 
tive for water dissociation, in general, in clusters 
or in the liquid phase, there are only a few cases 
where such events have been observed in simula¬ 
tion studies. 


Vibrational Dynamics of the 
Water Molecules 

From an experimental point of view, the shifts 
of the O—H vibrations of water molecules which 
become members of ionic clusters or hydration 
shells are highly interesting since they allow one to 
probe the molecular environment [37]. 

From equilibrated Be 2 + (H 2 0) 4 (H 2 0) 8 and 
Li + (H 2 0) 4 (H 2 0) 8 clusters, simulations of about 
5000 time steps each were performed. The veloci¬ 
ties at every time step were used to calculate the 
vibrational modes. The results are listed in Table 
V. The vibrations of an isolated water molecule 
calculated from a room-temperature MD trajectory 
include anharmonic contributions but differ only 
by a few wave numbers from harmonic values 


obtained from the matrix of second derivatives at 
equilibrium geometry. It can be seen from Table V 
that even for Be 2+ —H 2 0 the difference is 
small compared to the frequency shift upon 
complexation. 

The MD frequency shift of the asymmetric O—H 
stretching mode for the Be 2 + (H 2 0) 4 system is -543 
cm” 1 and the one of the symmetric O—H stretch¬ 
ing mode is -244 cm -1 . The corresponding har¬ 
monic values are —537 and —224 cm” 1 . For com¬ 
parison, a Hartree-Fock study of Mg 2+ —H 2 0 and 
Al 3+ —H 2 0 [31] gave O—H (decoupled) fre¬ 
quency shifts of 220 cm” 1 for Mg 2+ and 670 cm” 1 
for Al 3 \ It seems reasonable that the OH shift 
resulting from the Be 2+ ion is located between the 
values of Mg 2 + and Al 3+ . 

In contrast to the Be 2+ ion, Li + causes no such 
strong frequency shift of the OH stretching modes 
compared to the single water OH stretching modes 
(symmetric stretch: +54 cm” 1 ; asymmetric stretch: 
-54 cm” 1 ) The corresponding value in [31] was 
-70 cm” 1 (O—H uncoupled). 

These test calculations serve as a basis for inter¬ 
preting the results for larger clusters. In Figure 8, 
the power spectra I(v) as calculated from the 
velocity vectors v(t) of the atoms are plotted (in¬ 
tensities are arbitrary and do not reflect IR 
intensities; denotes the Fourier transform; this 
formula is equivalent to calculating the Fourier 
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TABLE V__ 

Molecular dynamics-derived and harmonic frequencies (in cm _1 ) of the isolated water molecule and water 
in the Be 24 / H 2 0 and Li"V H 2 0 systems with 1 and 12 water molecules (the given frequencies refer to the 
molecules in bold typeface). 




H—0—H 
bending 

0—H 

sym. stretch 

O—H 

asym. stretch 

h 2 o 

Harmonic 

1727 

3900 

4246 

h 2 o 

MD 

1736 

3906 

4232 

Be 2+ OH 2 

Harmonic 

1752 

3676 

3709 

Be 2+ OH 2 

MD 

1736 

3662 

3689 

Li + OH 2 

Harmonic 

1782 

3955 

4152 

Li + OH 2 

MD 

1790 

3960 

4178 

Be 2 + (H 2 O) 4 (H 2 0) 8 

Harmonic 

1845 

3696 

3725 

Be 2 + (H 2 O) 4 (H 2 0) 8 

MD 

1845 

3689 

3710 

Be 2 + (H 2 0) 4 (H 2 0) 8 

Harmonic 

1743 

3912 

4211 

Be 2 + (H 2 0) 4 (H 2 0) 8 

MD 

1736 

3900 

4198 

Li + (H 2 0) 4 (H 2 0) s 

Harmonic 

1792 

3952 

4141 

Li + (H 2 0) 4 (H 2 0) 8 

MD 

1790 

3981 

4239 

Li + (H 2 0) 4 (H 2 0) 8 

Harmonic 

1729 

3913 

4242 

Li + (H 2 0) 4 (H 2 0) 8 

MD 

1763 

3927 

4252 


transform of the velocity autocorrelation function): 

I(v) ~ \&{v(t)) •'9tv(—t))\. 

Since no splitting into symmetric and asymmetric 
modes is visible, the following procedure [38] was 
employed to separate them: If the hydrogen veloci¬ 
ties relative to the oxygen velocities are projected 


onto the O—H bonds, their sums and differences 
should correspond to the symmetric and asymmet¬ 
ric modes. For Figure 8 and Table V, this scheme is 
used to separate the modes. It can be seen that 
their maxima nearly coincide. 

The first-shell bending mode appears at 1845 
cm -1 . For the second shell, the bending mode is 



FIGURE 8. Power spectra of the symmetric and asymmetric 0—H stretching modes of the water molecules in the first 
(solid curves) and second (dashed curves) hydration shells of Be 2 + (H 2 0) 12 . 
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located at 1736 cm -1 ; the symmetric O—H stretch¬ 
ing mode, at 3900 cm” 1 , and the asymmetric OH 
stretching mode, at 4198 cm” 1 (Table V). The fre¬ 
quency shift between the two shells is thus 211 
cm" 1 for the symmetric O—H stretching mode 
and 488 cm" 1 for the asymmetric stretching mode. 
The O—H modes of the second shell agree within 
a few wavenumbers with the OH stretching modes 
of an isolated water molecule. 

Figure 9 shows the power spectrum for 
Li + (H 2 0) 4 (H 2 0) 8 . The symmetric O—H stretch¬ 
ing mode of the first shell is located at 3981 cm" 1 , 
and the asymmetric stretching mode, at 4239 cm" 1 
(Table V). The second-shell frequencies are just 
slightly different (3927 and 4259 cm" 1 , respec¬ 
tively). There is no strong frequency shift of the 
OH stretching modes between the two shells in 
contrast to the Be 2+ system. It could perhaps have 
been expected that the agreement between har¬ 
monic frequencies and simulation-derived values 
is less close for the larger complexes than, i.e., for 
the more rigid monohydrates. However, as can be 
seen from Table V, at least for the O—H modes, 
this is not the case. 


Summary and Conclusions 

The data presented show that many but not all 
qualitative aspects of hydrated Be 2+ clusters can 


approximately be derived from optimized geome¬ 
tries. However, without the corresponding 
simulations, this fact would not have been 
easy to conclude. Besides providing the statis¬ 
tical distributions, the MD simulations show the 
occurrence of intermediate coordination states in 
Li + clusters and the dynamical effect on the dis¬ 
tance of the two hydration shells. The simulations 
involving hydrogen transfer indicate that it would 
be possible to study statistical features of these 
reactions by performing simulations with an ex¬ 
tended set of systematically varying initial condi¬ 
tions. The evolution of the Li + clusters from their 
initial configurations shows that in aqueous 
clusters of this cation the most stable hydration 
number of four can easily be disturbed. 

We found MD simulations with forces derived 
from ab initio wave functions to be a practical 
method for studying these systems. While this 
method is traditionally used for geometry opti¬ 
mization, it is not applied as often as the so-called 
Car-Parrinello (CP) scheme for molecular dynam¬ 
ics. In the latter, the electron density is tradition¬ 
ally expanded in a plane-wave basis set. More 
characteristically, however, the orbital coefficients 
are treated as part of the coordinates and the 
combined gradient of the orbital coeofficients and 
of the nuclear coordinates with respect to the en¬ 
ergy is used for propagating the system. 



FIGURE 9. Power spectra of the symmetric and asymmetric O—H stretching modes of the water molecules in the first 
(solid curves) and second (dashed curves) hydration shells of Li + (H 2 0) 12 . 
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At first it might seem that using the optimized 
wave function must be slower since iterations are 
needed at every time step, but this disadvantage is 
partially alleviated by the possibility of using a 
larger time step, since the electronic degrees of 
freedom must be combined with rather low ficti¬ 
cious masses which also complicate frequency cal¬ 
culations. The time step used in the present work 
(0.3 ps) is at least twice as large as the one typi- 
cally employed in CP simulations. A second argu¬ 
ment why the CP method could be advantageous 
is its reportedly better energy conservation [39]. 
Indeed, for perfect energy conservation, extremely 
accurate nuclear forces are necessary and they 
might be difficult to obtain from a wave function 
optimized to some threshold criterion than from 
the set of generalized coordinates. We did not, 
however, observe any problems related to this 
circumstance. 

From a pragmatic point of view, it seems that 
for the simulation of periodic systems there is 
presently no alternative to using plane waves as a 
basis set. For nonperiodic systems, simple 
“hierarchical" MD simulation schemes like the 
one we used here, where the wave function is 
optimized at every step and its gradient calcu¬ 
lated, might be equally economic. Finally, it shall 
be mentioned that there exist also successful at¬ 
tempts to combine Hartree-Fock-type calculations 
with the generalized coordinate treatment [40] as 
well as the “opposite" combination, using 
plane-wave basis sets within a traditional DFT 
scheme [41]. 
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ABSTRACT: The geometry and vibrational spectrum of TeCl 4 was calculated with 
various quantum chemical methods [Hartree-Fock, second-order Moller-Plesset (MP2) 
and generalized gradient approximation density functional theory (GGA-DFT)]. Five 
different basis-set combinations were tested: the relativistic effective core potentials with 
double-zeta split valence basis (RECP) of Hay & Wadt (HW) and Stevens et al. (CEP); the 
above RECP basis sets extended with polarization functions for Te and using a 6-31G* 
basis for Cl (HW* and CEP*); a medium-size all-electron basis set (ALL). The quality of 
the calculated data was assessed by comparison with recent experimental results. The 
Hartree-Fock method combined with the HW and CEP basis sets provided a very good 
approximation of the experimental vibrational spectra. The quality of the results is 
comparable to those of the best methods (MP2, B3-P, B3-PW with HW* and CEP* and 
B3-LYP, B3-P, B3-PW with the ALL basis set). However, the HW and CEP basis sets 
provided very poor geometry and vibrational frequencies when they were used in 
combination with any correlated method in this work. Similarly, the DFT methods using 
Becke's 1988 exchange functional (B-LYP, B-P, B-PW) without the inclusion of the exact 
exchange let to very poor results with the basis sets used in this study. © 1997 John Wiley 
& Sons, Inc. Int J Quant Chem 65: 817-826, 1997 
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Introduction 

V ibrational data are of great importance at 
various fields of chemistry and physics such 
as molecular mechanics and molecular dynamics, 
statistical thermodynamics, gas electron diffraction 
analysis, etc. However, their experimental deter¬ 
mination for inorganic molecules is difficult. High 
temperatures, long path lengths, and sensitive de¬ 
tectors are needed to detect the (often low-intense) 
vibrational bands of the low-volatile compounds. 
Additionally, the interpretation of the high-tem¬ 
perature spectra is hampered by the broad rota¬ 
tional contours, hot bands, and possible presence 
of other components in the gas phase (decomposi¬ 
tion and association products). Hence, results from 
other techniques such as matrix isolation spec¬ 
troscopy and theoretical methods are acknowl¬ 
edged to extend the high-temperature experimen¬ 
tal data. 

The quantum chemical calculations have been 
successfully applied for the interpretation and pre¬ 
diction of vibrational spectra of organic com¬ 
pounds [1]. However, the application for inorganic 
molecules is more difficult. First of all, these latter 
molecules are usually built up out of heavier ele¬ 
ments, rendering the computations with all-elec¬ 
tron basis sets very expensive. Furthermore, the 
errors arising from the neglect of relativistic effects 
are important from the fourth row of the periodic 
table. These two problems can be solved by using 
relativistic effective core potentials (RECP) [2-8], 
which are extensively used in Hartree-Fock (HF) 
and post-HF calculations. 

The density functional (DFT) methods are 
known to provide good estimates of the vibra¬ 
tional spectra of organic compounds. Recent DFT 
studies of transition-metal complexes also have 
shown promising agreement with the experimen¬ 
tal infrared (IR) frequencies and intensities [9, 10]. 
However less information is available about the 
performance of the DFT methods with effective 
core potentials. The use of DFT functionals with 
ECPs generated previously from Hartree-Fock 
atomic calculations is not free of problems. Since 
the exchange and correlational functionals are non¬ 
linear in the density, the core and valence densities 
are not strictly separable. Hence, the use of ECPs 
generated from HF calculations should be care¬ 
fully tested for their applicability in DFT work. 
Promising results have been published recently by 


Russo et al. [11]. Investigating a small but repre¬ 
sentative set of transition-metal compounds 
[ScF 3 , TiF 4 , VF 5 , TiO, Ni(CO) 4 , CuF], they found 
that the RECPs of Hay and Wadt [2-4] can be used 
in DFT methods, as well. 

In the present study we test the performance of 
currently most successful generalized gradient ap¬ 
proximation (GGA) and hybrid DFT methods us¬ 
ing effective core potentials as compared to 
Hartree-Fock and second-order Moller-Plesset 
(MP2) methods as well as to all-electron basis sets. 

The tellurium tetrachloride molecule was cho¬ 
sen as a model for this study. Its molecular geome¬ 
try (determined by gas electron diffraction) and 
experimental vibrational spectra are known from a 
recent study [12]. The TeCl 4 molecule has a trigo¬ 
nal bipyramid structure with a vacant equatorial 
position ( C 2v symmetry). The molecular geometry 
is in accordance with the valence-shell electron-pair 
repulsion (VSEPR) theory [13]. Characteristics are 
the axial Te—Cl bonds being longer than the equa¬ 
torial bonds and the axial chlorines being slightly 
bent toward the equatorial ones. Concerning the 
vibrational data, eight of the nine fundamentals of 
TeCl 4 were found in the experimental specta and a 
good estimate [based on a scaled quantum me¬ 
chanical (SQM) [14] force field] is available for the 
unobserved A 2 (v 4 ) mode [12]. 


Computational Details 

Geometries, Cartesian harmonic force constants, 
and IR intensities of TeCl 4 were calculated at the 
HF, MP2 [15], and DFT levels of theory using the 
Gaussian 94 program [16]. Six exchange-correla¬ 
tion density functionals were employed in the pre¬ 
sent study: Becke's 1988 (B) [17] and Becke's three 
parameter (B3) [18] gradient-corrected exchange 
functionals were combined with the Lee-Yang- 
Parr (LYP) [19], Perdew (P) [20], and Perdew- 
Wang (PW) [21] correlation functionals. The de¬ 
fault grid (75,302) implemented in the Gaussian 94 
program was used in the DFT calculations. We 
used five different basis set combinations: 

1. HW: the Hay & Wadt RECPs with double- 
zeta valence basis sets for both Cl and Te [3] 

2. CEP: the RECPs of Stevens et al. and the 
corresponding double-zeta valence basis sets 
for both Cl and Te [5, 6] 

3. HW*: the 6-31G(d) basis set for chlorine and 
the Hay & Wadt RECP with double-zeta va- 
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lence basis set [3] extended with a single set 
of d-type polarization functions [22] for tel¬ 
lurium 

4. CEP*: the 6-31G(d) basis set for chlorine and 
the RECP of Stevens et al. with double-zeta 
valence basis set [6] extended with a single 
set of d-type polarization functions [22] for 
tellurium 

5. ALL (all-electron basis set): the standard 6- 
31G(d) basis set for chlorine and a 3-21G(d) 
basis [23] for tellurium 

Harmonic vibrational frequencies and IR inten¬ 
sities were computed for the fully optimized ge¬ 
ometries. The first derivatives of the potential en¬ 
ergy with respect to the nuclear coordinates were 
calculated analytically at all quantum chemical 
levels. The second derivatives were obtained ana¬ 
lytically for the all-electron basis set and numeri¬ 
cally for the RECP basis sets. 


Results and Discussion 

GEOMETRY 

The computed geometrical parameters of TeCl 4 
obtained at various levels of theory are compared 
to the experimental geometry in Table I. 

The DFT methods containing Becke's 1988 ex¬ 
change functional (B-LYP, B-P, and B-PW) with 
the most economic HW and CEP basis sets found 
the T d geometry as a single minimum on the 
potential energy surface. All the other calculations 
converged to the right C 2v structure resulting 
longer axial Te—Cl bonds with respect to the 
equatorial ones in accordance with the experimen¬ 
tal data. The methods resulting in false geometries 
were omitted from the further studies. 

Figures 1(a) and 1(b) show the deviations be¬ 
tween the calculated and experimental geometrical 
parameters. The most significant difference be¬ 
tween the computed results is the very poor per¬ 
formance of the HW and CEP basis sets for the 
bond distances [cf. Fig. 1(a)]. 

The calculated bond lengths are longer than the 
experimental ones, with one exception of the 
HF/HW* result for the Te—Cl eq bond. In gen¬ 
eral, the computations underestimate the Cl eq —Te 
—Cl eq bond angle, however, differ in the character 
of the Cl ax —Te—Cl eq angle (cf. Table I). The ex¬ 
perimental tilt of the axial chlorines toward the 
equatorial ones is reproduced by the HF and MP2 
methods (except the MP2/CEP level) and by the 
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B3-type DFT/ALL calculations. In all the other 
cases no tilt, or a small opposite tilt, was obtained. 
The latter bond angle is generally overestimated 
by the various methods, except the HF with all the 
basis sets and the MP2/ALL [cf. Fig. 1(b)]. The 
deviations for all the geometrical parameters are 
generally larger than the experimental errors. 

In general, the computed bond lengths and the 
Cl ax —Te—Cl eq angle increase in the order HF < 
MP2 < B3-P - B3-PW < B3-LYP < B-P - B-PW < 
B-LYP. [The only significant deviation from this 
trend was found for the bond lengths obtained at 
the MP2/HW and CEP levels; cf. Fig. 1(a)]. We 
note that similar behavior has been found earlier 
for C—C and C—O bond lengths of sugars 
and 1,2-ethanediol [24, 25]. The computed 
Cl eq —Te—Cl eq bond angle varies in a narrower 
range. The above trend, but with negative sign 
(because of the underestimating character of the 
computations), can be recognized, except for the 
results obtained by using the ALL basis set. 

Summing up, the experimental geometry of 
TeCl 4 is best reproduced by the HF and MP2 
methods. The DFT methods provided poorer re¬ 
sults, especially those that contain the B-type ex¬ 
change functional. The HW*, CEP*, and ALL ba¬ 
sis sets are of similar quality, HW* being slightly 
superior; the HW and CEP basis sets are much 
poorer. 

FREQUENCIES 

The observed and computed fundamentals of 
TeCl 4 are compiled in Table II. The deviations 
between selected computed and the experimental 
frequencies are shown in Figure 2. We present here 
the results obtained at various levels of theory 
using the HW and HW* basis sets. They resemble 
the results obtained with the other related basis 
sets (HW to CEP, HW* to CEP* and ALL). The 
following trend can be observed: The B3~type 
DFT/HW and B-type DFT/HW* frequencies are 
considerably underestimated, while the HF and 
MP2/HW* ones are overestimated. The HF/HW 
and most of the B3-type DFT/HW* frequencies 
vary around the experimental values. We note the 
significant underestimation of the v s and v 9 
stretching fundamentals by the DFT calculations. 
The computed frequencies increase in the order 
B-LYP < B-P - B-PW < B3-LYP < B3-PW - B3-P 
< MP2 < HF. This trend is just the opposite to 
that seen for most of the geometrical parameters 
(cf. Fig. 1). 
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TABLE I_ 

Molecular geometry of TeCI 4 . 




Bond length (A) 


Bond angle (deg) 


Basis 3 

Method 

T® Cl ax 

Te-Cl eq 

ci ax "*" e ci ax 

Cleq Te Cl eq 

Q| —Je — Cl 

°'ax °'eq 


Exp. b 

2.430(6) 

2.298(6) 

175.9(6) 

103.0(7) 

88.7° 

HW 

HF 

2.537 

2.409 

172.8 

101.4 

87.7 


MP2 

2.588 

2.468 

177.8 

100.5 

89.3 


B-LYP d 

2.585 


109.5 




B-P d 

2.568 


109.5 




B-PW d 

2.566 


109.5 




B3-LYP 

2.578 

2.481 

174.3 

99.3 

91.9 


B3-P 

2.559 

2.463 

176.6 

99.6 

90.9 


B3-PW 

2.560 

2.466 

177.6 

99.9 

90.8 

CEP 

HF 

2.568 

2.431 

174.4 

100.6 

88.2 


MP2 

2.626 

2.501 

177.9 

98.6 

90.7 


B-LYP d 

2.607 


109.5 




B-P d 

2.587 


109.5 




B-PW d 

2.586 


109.5 




B3-LYP 

2.600 

2.505 

171.0 

98.7 

92.9 


B3-P 

2.579 

2.486 

174.8 

99.2 

91.7 


B3-PW 

2.583 

2.491 

173.9 

99.1 

92.0 

HW* 

HF 

2.440 

2.290 

175.2 

100.7 

88.1 


MP2 

2.441 

2.313 

176.6 

100.2 

88.9 


B-LYP 

2.507 

2.389 

172.5 

99.4 

92.4 


B-P 

2.485 

2.369 

174.6 

99.5 

91.7 


B-PW 

2.479 

2.367 

175.9 

100.0 

91.3 


B3-LYP 

2.472 

2.346 

179.0 

100.1 

90.4 


B3-P 

2.453 

2.330 

179.6 

100.2 

89.9 


B3-PW 

2.455 

2.333 

179.8 

100.1 

89.9 

CEP* 

HF 

2.461 

2.313 

175.5 

100.6 

88.5 


MP2 

2.463 

2.339 

176.8 

100.4 

89.0 


B-LYP 

2.522 

2.415 

172.2 

99.9 

92.5 


B-P 

2.500 

2.395 

174.9 

100.1 

91.6 


B-PW 

2.499 

2.393 

174.9 

100.1 

91.7 


B3-LYP 

2.491 

2.371 

178.8 

100.2 

90.4 


B3-P 

2.471 

2.354 

179.7 

100.3 

89.9 


B3-PW 

2.475 

2.357 

179.8 

100.3 

90.1 

ALL 

HF 

2.449 

2.317 

173.5 

100.7 

87.9 


MP2 

2.448 

2.338 

174.7 

100.5 

88.3 


B-LYP 

2.509 

2.410 

178.5 

102.3 

90.4 


B-P 

2.481 

2.383 

179.6 

102.2 

89.9 


B-PW 

2.480 

2.382 

179.8 

102.1 

89.9 


B3-LYP 

2.477 

2.368 

177.8 

101.7 

89.3 


B3-P 

2.452 

2.345 

176.8 

101.5 

89.0 


B3-PW 

2.456 

2.348 

176.9 

101.5 

89.0 

a For details of basis sets HW, CEP, HW* 

, CEP*, and ALL see text. 



b From a gas electron diffraction study, Ref. [12], r a values. 
c Calculated from the data given in Ref. [12]. 
d The computed geometry has tetrahedral symmetry. 
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HW CEP HW* CEP* ALL HW CEP HW* CEP* ALL 


Te-CIgx Te-Cl e q 

FIGURE la. Deviations between computed and experimental geometrical parameters: (a) bond lengths; (b) bond 
angles. 

The performance* of the computational levels experimental and theoretical frequencies (Fig. 3). f 
can be assessed and compared by the root-mean- The rms deviations are scattered within the 8-53 

square (rms) and maximal deviations between the cm" 1 range, while the maximal deviations (ab- 

r u , uU ,. C£ , solute values) vary from 15 to 80 cm" 1 . 

By performance we mean here how the different theoreti- J 

cal levels are able to estimate the experimental (anharmonic) 

vibrational spectra. We do not intend to investigate the quality 1 The v± mode was not involved in the evaluation of these 

of the calculated harmonic frequencies. deviations. 
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HW CEP HW* CEP* ALL HW CEP HW* CEP* ALL 


Cl ec j-Te-Cleq 

FIGURE 1b. (continued) 


On Figure 3 a significant trend can be observed 
for both the theoretical levels and the basis sets, a 
trend which is reflected at both the rms and maxi¬ 
mal deviations. 

The performance of the HW and CEP basis sets 
used with the correlated methods is very poor, 
however, it is surprisingly good with the Hartree- 
Fock theory. The HF/FIW level belongs even to 
the best methods. The HW*, CEP*, and ALL basis 


sets performed well with the MP2 and B3-type 
DFT levels, while about two-three times larger 
deviations can be observed for the results obtained 
at the B-type DFT calculations. It is noteworthy 
that the MP2/HW* and CEP* frequencies are in 
very good agreement with the experimental data, 
however, the agreement worsened using the all¬ 
electron (ALL) basis set. Here the B3-type DFT 
levels are definitely superior. We note also the 
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TABLE II_ 

Experimental and calculated fundamentals (cm -1 ) and calculated IR intensities (km mol -1 ) of TeCI 4 . 


rms max. 


Basis 3 

Method 

v i 

^2 

^3 

^4 

^5 

v 6 

V 7 

^8 

^9 

dev. 

dev. 


Exp. b 

72 

104 

158 

(150 c + 10) 

165 

290 

312 

378 

382 



HW 

HF 

73 (3) 

107 (13) 

150 (19) 

160 (0) 

174 (8) 

285 (1) 

307 (224) 

366 (21) 

359 (9) 

10.5 

21 


MP2 

57(3) 

82 (9) 

125 (11) 

133 (0) 

147 (4) 

254 (1) 

299 (145) 

319 (24) 

315 (10) 

38.0 

67 


B3-LYP 

50 (3) 

61 (6) 

109 (6) 

113 (0) 

128 (2) 

258 (4) 

300 (129) 

308 (33) 

309 (14) 

46.8 

73 


B3-P 

52 (2) 

67 (6) 

133 (7) 

118 (0) 

133 (2) 

265 (4) 

309 (134) 

320 (34) 

321 (15) 

37.3 

61 


B3-PW 

53 (2) 

68 (6) 

111 (7) 

121 (0) 

135 (3) 

266 (4) 

310 (133) 

316 (35) 

318 (13) 

40.8 

64 

CEP 

HF 

73 (4) 

102 (13) 

147 (18) 

153 (0) 

168 (6) 

271 (1) 

300 (279) 

365 (22) 

354 (11) 

15.4 

28 


MP2 

54 (4) 

69 (8) 

117(8) 

121 (0) 

135 (2) 

238 (2) 

294 (162) 

303 (29) 

306 (14) 

48.2 

76 


B3-LYP 

40 (3) 

54 (5) 

103 (4) 

108 (0) 

123 (1) 

250 (6) 

300 (154) 

298 (38) 

306 (16) 

52.7 

80 


B3-P 

48 (3) 

63 (6) 

108 (6) 

115(0) 

130 (2) 

258 (6) 

310 (161) 

309 (41) 

317 (16) 

44.7 

69 


B3-PW 

47 (3) 

61 (6) 

107 (6) 

115 (0) 

130 (2) 

257 (6) 

308 (160) 

307 (40) 

315 (16) 

46.0 

71 

HW* 

HF 

96 (3) 

128 (10) 

181 (14) 

184 (0) 

201 (10)301 (2) 

306 (341) 

408 (60) 

415 (23) 

25.3 

36 


MP2 b 

85 (2) 

113 (7) 

163 (9) 

166 (0) 

183 (5) 

291 (3) 

322 (256) 

376 (59) 

385 (21) 

9.4 

18 


B-LYP 

65 (2) 

72 (4) 

126 (4) 

127 (0) 

146 (1) 

258 (8) 

290 (186) 

315 (62) 

326 (22) 

37.2 

63 


B-P 

67 (2) 

79 (4) 

131 (5) 

134 (0) 

153 (1) 

270 (7) 

303 (194) 

331 (64) 

341 (23) 

27.1 

47 


B-PW 

70 (1) 

86 (4) 

132 (5) 

137 (0) 

155 (2) 

271 (7) 

305 (196) 

330 (64) 

342 (22) 

26.0 

48 


B3-LYP 

72 (2) 

95 (5) 

145 (7) 

149 (0) 

167 (2) 

275 (5) 

305 (229) 

345 (62) 

357 (22) 

16.7 

33 


B3-P 

73 (2) 

100 (6) 

149 (7) 

154 (0) 

172 (3) 

286 (5) 

317 (234) 

358 (65) 

370 (22) 

9.6 

20 


B3-PW 

78(2) 

100 (6) 

150 (7) 

153 (0) 

171 (3) 

284 (5) 

315 (234) 

357 (64) 

369 (22) 

10.0 

21 

CEP* 

HF 

92 (3) 

122 (9) 

177 (15) 

176 (0) 

195 (8) 

304 (2) 

314 (353) 

400 (61) 

407 (25) 

20.3 

30 


MP2 

80 (2) 

107 (7) 

158 (10) 

158 (0) 

175 (5) 

291 (3) 

328 (256) 

368 (59) 

376 (22) 

8.4 

16 


B-LYP 

59 (2) 

73 (3) 

118 (4) 

122 (0) 

139 (1) 

262 (8) 

299 (183) 

309 (60) 

320 (22) 

40.3 

69 


B-P 

64 (2) 

79 (4) 

124 (5) 

128 (0) 

146 (2) 

273 (8) 

312 (194) 

325 (64) 

335 (22) 

30.6 

53 


B-PW 

64 (2) 

80 (4) 

126 (5) 

130 (0) 

147 (2) 

273 (8) 

312 (196) 

326 (64) 

336 (22) 

29.8 

52 


B3-LYP 

72 (2) 

92 (5) 

141 (7) 

142 (0) 

160 (2) 

277 (6) 

313 (229) 

340 (62) 

352 (22) 

19.3 

38 


B3-P 

74 (2) 

96 (5) 

145 (8) 

147 (0) 

164 (3) 

288 (5) 

324 (237) 

354 (65) 

365 (23) 

12.5 

24 


B3-PW 

74 (2) 

95 (5) 

145 (8) 

146 (0) 

164 (3) 

286 (6) 

322 (237) 

352 (65) 

363 (23) 

13.3 

26 

ALL 

HF 

96 (2) 

131 (8) 

181 (13) 

183 (0) 

200 (11)307 (2) 

327 (326) 

411 (64) 

416 (26) 

27.0 

35 


MP2 

86 (1) 

119 (6) 

166 (9) 

167 (0) 

184 (6) 

299 (3) 

341 (252) 

384 (62) 

389 (23) 

15.2 

29 


B-LYP 

52 (2) 

80 (4) 

125 (5) 

138 (0) 

155 (1) 

238 (3) 

267 (197) 

335 (65) 

338 (23) 

36.5 

52 


B-P 

64 (1) 

95 (4) 

135 (5) 

147 (0) 

164 (2) 

266 (4) 

301 (205) 

353 (67) 

355 (23) 

18.5 

27 


B-PW 

73 (1) 

99 (4) 

139 (5) 

142 (0) 

157 (3) 

279 (6) 

321 (207) 

341 (68) 

348 (23) 

19.9 

37 


B3-LYP 

70 (2) 

104 (5) 

146 (7) 

155 (0) 

172 (3) 

275 (4) 

309 (233) 

365 (66) 

369 (23) 

9.8 

15 


B3-P 

74 (1) 

111 (5) 

152 (7) 

161 (0) 

177 (4) 

294 (4) 

332 (239) 

380 (68) 

384 (24) 

9.1 

20 


B3-PW 

79 (1) 

112 (5) 

154 (7) 

156 (0) 

171 (5) 

293 (4) 

333 (238) 

364 (68) 

375 (24) 

10.4 

21 


a For details of basis sets HW, CEP, HW*, CEP*, and ALL see text. 
b From Ref. [12]. 

C SQM value using the MP2/HW* harmonic force field. 


significant improvement of the results obtained at 
the B3-LYP level used with the ALL basis set 
compared to those with HW* and CEP*. The 
results of the B3-P and B3-PW levels used with 
CEP* worsened somewhat compared to those us¬ 
ing the HW* basis and to MP2/CEP*. The perfor¬ 
mance of the Hartree-Fock theory with these three 
larger basis sets is similar to that of the B-type DFT 
methods. 


The best agreement between the experimental 
and computed frequencies was obtained at the 
MP2/CEP* level, however, only slightly worse 
are the HF/HW, the MP2, B3-P and B3-PW/HW*, 
and the B3-P, B3-LYP and B3-PW/ALL results 
(rms deviation less than 11 cm -1 , maximal devia¬ 
tion not exceeding 21 cm -1 ). 

The computed vibrational data of TeCl 4 are 
suitable to assist the assignment of the bands in 
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FIGURE 2. Deviations between calculated and experimental vibrational frequencies obtained at various levels of theory using basis sets 
(a) HW and (b) HW*. 
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FIGURE 3. (a) Rms and (b) maximal deviations (absolute values) between the calculated and experimental vibrational 
frequencies. 


the experimental spectra (cf. Table II). The stretch¬ 
ing fundamentals can be assigned unambiguously 
by the computed IR intensities. In most cases the 
relative sequence of the frequencies is in accord 
with the experimental observations, and only the 


computed v 8 and v 9 stretching fundamentals are 
interchanged at the HF/HW, HF/CEP, and 
MP2/FIW levels as well as the calculated v 7 and 
v 8 modes at the B3-P and B3-PW/CEP levels. The 
deformation modes have similar (small) computed 
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IR intensities, and the relative order of their fre¬ 
quencies agrees with that of the experimental ones. 
The unobserved v 4 mode was positioned between 
v 3 and v 5 by all the quantum chemical calcula¬ 
tions except for B3-P/HW and HF/CEP*, how¬ 
ever the experimental values of these fundamen¬ 
tals are also very close [12]. 


Conclusions 

1. The molecular geometry of TeCl 4 is best re¬ 
produced at the HF and MP2 levels using the 
HW*, CEP*, and ALL basis sets. The density 
functional methods gave rise to poorer re¬ 
sults, especially those using Becke's 1988 ex¬ 
change functional The RECP basis sets (HW 
and CEP) provided poor results, especially 
for the bond distances. 

2. Generally, the vibrational frequencies calcu¬ 
lated by the MP2 and B3-type DFT methods 
with basis sets HW*, CEP*, and ALL (except 
MP2/ALL and B3-LYP/HW* and CEP*) as 
well as at the HF/HW level are in good 
accord with the experimental data. The 
agreement is good enough to assist the as¬ 
signment of the measured vibrational spec¬ 
tra. 

3. The HW and CEP basis sets compensate ef- 
fectively the errors of the HF method for 
TeCl 4 . The HW* and CEP* basis sets are 
best combined with MP2, B3-P and B3-PW. 
Regarding the all-electron basis set a combi¬ 
nation with the B3-type DFT methods is su¬ 
perior. 

4. Based on their performance and cost effec¬ 
tiveness the methods using relativistic effec¬ 
tive core potentials are advantageous for 
large-scale computations. 
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ABSTRACT: A general strategy for computing the vibrational frequencies of 
adsorbates on surfaces is presented. Rather than use numerical fits to the surface energy, 
we advocate diagonalizing a mass weighted energy second-derivative matrix. The 
computation of energy derivatives has several advantages: It facilitates a more accurate 
treatment of the adsorbate/substrate mass ratios. The normal modes of the surface 
vibrations are given automatically in the diagonalization step. Finally, a major advantage 
of using energy derivatives is that they provide estimates of the range of the 
adsorbate/substrate interactions which can be used to ensure that the frequency values 
are converged. We illustrate the strategy for cluster calculations which simulate the 
H/Al(100), O/AKlll), H 2 0/A1(111), and Al/Si(lll) surfaces. © 1997 John Wiley & Sons, 
Inc. Int J Quant Chem 65: 827-838, 1997 


Introduction 

H igh-resolution electron energy loss spec¬ 
troscopy (HREELS) [1, 2] and reflection ad¬ 
sorption infrared spectroscopy (RAIRS) [1, 3] are 
vibrational spectroscopic techniques which now 
play a valuable role in identifying and characteriz¬ 
ing structural information on adsorbates, such as 
reactive intermediates, on surfaces. Generally, 
symmetry arguments coupled with experimental 
experience from related better-understood systems 
enables vibrational assignments to be correlated 
with a specific atomic or molecular adsorbate at a 
particular surface site. Unfortunately, this empiri¬ 
cal approach does not always facilitate an unam¬ 
biguous assignment of the adsorbate identity and 

International Journal of Quantum Chemistry, Vol. 65, 827-838 (1997) 
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structure to the data. For example, we have re¬ 
cently argued that water molecularly adsorbs on 
the Al(lll) surface at low temperatures [4, 5], 
while the experimental HREELS data was origi¬ 
nally interpreted with water dissociation taking 
place [6]. Our argument was based on ab initio 
cluster calculations where we computed the geom¬ 
etry and vibrational frequencies for the chemi¬ 
sorbed water. We were able to show that the two 
high-frequency vibrational modes observed at 3450 
and 3720 cm -1 by HREELS could be assigned, 
respectively, to H-bonded and non-H-bonded OH 
stretches in water [5] and did not require the 
presence of a hydroxyl group on the Al(lll) sur¬ 
face as proposed earlier [6]. 

There have been a large number of ab initio 
calculations used to predict vibrational frequencies 
for molecules. Such calculations are powerful be- 
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cause they can readily provide useful vibrational 
information about both well and poorly character¬ 
ized molecular structures. A useful overview of 
the accuracy of the vibrational frequencies com¬ 
puted by Hartree-Fock (HF), Moller-Plesset 
(MP2), and density functional theory (DFT) was 
given recently by Scott and Radom [7] for a series 
of small polyatomic molecules containing up to 
four heavy atoms. Ab initio vibrational frequency 
calculations for quite large molecules have also 
been reported. For example, Pulay and co-workers 
recently published DFT calculations which com¬ 
puted vibrational frequencies for free-base por¬ 
phyrin [8]. However, the computational require¬ 
ments of vibrational calculations must obviously 
scale rapidly with system size, making their use in 
analyzing adsorbates on surfaces challenging. 
Cluster calculations are one of the common tech¬ 
niques for theoretically investigating adsorption 
on surfaces. The cluster approach assumes that the 
dominant chemical interactions are local in nature 
and tend to involve only a few of the substrate 
atoms close to the adsorbate [9]. It is the local 
nature of the adsorbate interactions that enables a 
reduction in the computational effort since only a 
subset of vibrational modes of the cluster need to 
be calculated. Apart from some recent calculations 
involving relatively small clusters which have ob¬ 
tained adsorbate vibrational frequencies by explic¬ 
itly computing a second derivative matrix [10, 11], 
the usual approach to computing the adsorbate 
vibrational frequencies has been by making nu¬ 
merical fits to the variation in the total energy of 
the cluster as the adsorbate atoms are displaced 
along expected normal mode directions [12]. This 
energy-fitting procedure has also been used for 
slab calculations [13]. The calculations treat the 
substrate mass as being infinitely larger than that 
of the adsorbate, but, as we show below, this can 
lead to large errors in the adsorbate frequencies. In 
addition to the substrate mass approximation, the 
energy-fitting procedure requires some approxi¬ 
mation to the form of the normal modes when 
considering a polyatomic adsorbate. For example, 
both Rosch and Bagus et al. obtained a stretching 
frequency for CO adsorbed on a surface by fixing 
the CO center of mass at the equilibrium distance 
above the surface and fitting to the energy varia¬ 
tion with CO distance [12]. To avoid making as¬ 
sumptions about the form of the normal modes, 
we have started computing vibrational frequencies 
for an adsorbate on a cluster by diagonalizing the 
mass-weighted energy second-derivative matrix 


computed for a subset of atoms composing the 
cluster. In the case of a single adatom, the energy-fit 
procedure and the energy second derivatives for 
the adatom alone result in identical adsorbate vi¬ 
brational frequencies. A disadvantage of our strat¬ 
egy over numerical fits is that the more computa¬ 
tionally demanding energy derivatives now need 
to be calculated. However, by computing esti¬ 
mates of the energy second derivatives between 
the adsorbate and substrate and using a frequency 
correction formula, originally proposed by Black 
[14], one can more realistically incorporate the 
adsorbate/substrate mass ratio into the frequency 
calculation. The frequency correction formula also 
provides a guide to the spatial extent of the adsor¬ 
bate interactions with the substrate and provides a 
convergence check on the necessary cluster size. 

In this article, we illustrate strategies for com¬ 
puting more physically realistic adsorbate vibra¬ 
tional frequencies by considering H, O, and water 
adsorbates on Al surfaces and for an Al adatom on 
the Si(lll) surface. In the next section, we describe 
how the cluster calculations were performed and 
how the vibrational frequencies were computed 
from the mass-weighted energy second-derivative 
matrix and the frequency correction formula. In 
the third section of the article, we present results 
and discuss the adsorbate vibrations for each of 
the four systems. We first show that, as expected, 
due to the H adatoms having the smallest adsor- 
bate-to-substrate mass ratio, H adsorption on the 
Al(l00) surface is the most straightforward system 
to theoretically treat. We find substantial fre¬ 
quency correction terms, owing to the larger 0.59 
O-to-Al mass ratio, to occur when using the en¬ 
ergy second-derivative matrix to compute the vi¬ 
brational frequencies for a single O adatom on the 
Al(l 11) surface. The size of the O frequency correc¬ 
tion terms becomes much smaller if a larger sec¬ 
ond derivative matrix is formed using additional 
second derivatives from the nearby substrate 
atoms. We find an Al s O subcluster to produce O 
vibrations consistent with the corrected O-only fre¬ 
quency calculation. We also consider different¬ 
sized subclusters in the frequency calculations for 
water adsorbed on the Aid 11) surface. An impor¬ 
tant feature about the adsorbed water frequency 
calculations is that we do not need to make any 
assumptions about the form of the adsorbate nor¬ 
mal modes since they are given automatically by 
the diagonalization procedure. However, vibra¬ 
tional calculations at the SCF level on molecules 
are well known to overestimate the frequencies 
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and we go on to show for the H 2 0/A1(111) sur¬ 
face, using energy-fitting procedures, how the nor¬ 
mal modes obtained at an initial level of theory 
can be used to obtain estimates of the vibrational 
frequencies at a higher level of theory. In the 
Al/Si(lll) system, we show that the frequency 
correction formula is still a useful tool for identify¬ 
ing adsorbate and substrate localized vibrations 
even though the Al-to-Si mass ratio is 0.96 and the 
adsorbate frequency values are overlapped by vi¬ 
brations from the Si substrate. Concluding remarks 
are given in the final section of the article. 


Method 

In our previous work, we used cluster calcula¬ 
tions to simulate the H/A1(100) [15], 0/Al(lll) 
[16, 17], water/Al(lll) [4, 5], and Al/Si(lll) [18] 
surfaces. We found that the general features of the 
adsorption can often be reliably modeled by taking 
the set of central substrate atoms which interact 
most strongly with the adsorbate atoms and sur¬ 
rounding them by a complete shell of neighboring 
substrate atoms having the same geometry as in 
the extended surface. For example, in our study of 
the H/A1(100) surface, we used the A1 9 (5,4)H and 
A1 14 (8,6)H clusters, with one and two central A1 
atoms, to simulate H adsorption at the four-fold 
on-top and two-fold bridging sites, respectively, 
where the (5,4) and (8,6) notation refers to the 
number of substrate atoms in each cluster layer. 
We ignore edge effects for the A1 clusters, while in 
the Si calculations we follow the usual practice of 
terminating the Si dangling bonds at the cluster 
edges with H atoms. The A1 cluster atoms are 
constrained to appropriate fee lattice positions us- 

o 

ing the bulk lattice parameter of 4.05 A [19] except 
in the water/Al(l 11) calculations where we allow 
surface relaxation effects for the central A1 atoms. 
In our simulation of the Al/Si(lll) surface, we 
found the inclusion of substrate relaxation effects 
to be essential in order to obtain a stable adsorp¬ 
tion site which agrees with the experimentally 
observed surface structure and we constrained the 
outer Si atoms at their experimental Si bulk lattice 
positions [20]. 

All the cluster calculations were performed us¬ 
ing the GAMESS [21] computer program. The 
equilibrium adsorbate geometry is found by par¬ 
tially optimizing the clusters' total energy while 
constraining most of the substrate atoms to their 


extended surface positions [22]. The adsorbate par¬ 
tial geometry optimizations and vibrational fre¬ 
quency calculations utilize energy gradients and 
these computations are performed at the restricted 
closed and open Hartree-Fock level. We also 
discuss some single geometry cluster energies 
computed using restricted second-order Moller- 
Plesset theory [23]. The Hay and Wadt effective 
core potential (ECP) and its double zeta valence 
basis set [3s3p]/(2s2p) was used for A1 and Si 
[24]. In the Al/Si(lll) clusters, d -polarization 
functions were also added to the basis sets for the 
A1 adatom and its neighboring Si atoms. The Dun¬ 
ning and Hay split valence basis [4s]/(2s) was 
used for H [25]. The Dunning and Hay split va¬ 
lence basis set augmented with a d-polarization 
and a diffuse p-function [9s6pld]/(3s3pld) was 
used for O [25]. More complete details on the 
calculations are given in our earlier work [4, 5, 
15-18]. 

Computing the complete set of vibrational fre¬ 
quencies for larger clusters rapidly becomes a de¬ 
manding task and would produce a number of 
vibrational modes associated with cluster edge ef¬ 
fects which are not strictly relevant to the experi¬ 
mental characterization of the adsorbate. For this 
reason, in both cluster and slab calculations, a 
more economical approach is to fit the cluster 
energy to some potential energy surface for a se¬ 
lect number of vibrational modes involving the 
adsorbate [12]. A typical application of this proce¬ 
dure was used by Jacobsen et al. [13] in their slab 
calculations to obtain O vibrational frequencies v 
on the Al(l 11) surface using the quadratic fitting 
formula 



where AE is the energy change when the O atom 
is displaced by a distance A b from the minimum. 
Jacobsen et al. took M eff to be the mass of the 
adatom and made the common approximation that 
the substrate has infinite mass. Another difficulty 
with this fitting approach is deciding which direc¬ 
tion to take the A b displacement when dealing 
with polyatomic adsorbates [12]. We encountered 
this problem in our water/Al(l 11) cluster calcula¬ 
tions and we started to compute the adsorbate 
vibrational frequencies and relevant normal modes 
by diagonalizing subblocks of the mass-weighted 
energy second-derivative matrix [4, 5]. 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


829 


HEAD 


Forming the mass-weighted energy second- 
derivative matrix either analytically or numeri¬ 
cally as finite differences of the energy gradients is 
the routine quantum chemical technique for com¬ 
puting vibrational frequencies of molecules [26]. 
However, to keep the calculations tractable and 
since we are only interested in a subset of the 
possible cluster frequencies, we only form the 
mass-weighted second-derivative matrix for a 
fragment of the atoms in the cluster. In block form, 
the full mass-weighted force constant matrix is 


K = 


K 

K 


AA 


SA 


K 

K 




( 2 ) 


where A labels the adsorbate set of atoms used in 
the second-derivative matrix formation and S are 
the remaining substrate atoms composing the clus¬ 
ter. The A subset may also include some substrate 
atoms, but these atoms should be chosen to pre¬ 
serve the symmetry of the adsorbate vibrational 
modes. The z-th row of the force constant matrix is 
initially calculated by taking the finite difference 
of the analytical gradient g vector by displacing 
the atom with the /-th coordinate in the A set to 
give 


normal modes. However, restricting the A set to 
just adsorbate atoms means that the frequency 
calculations are still treating the substrate as hav¬ 
ing infinite mass. Furthermore, these frequency 
calculations do not give any indication as to 
whether any significant interactions between the 
adsorbate and the substrate are being neglected. 

However, by calculating energy gradients and 
forming the off-diagonal K AS block enables an 
estimate to the convergence of the frequency calcu¬ 
lations to be made. Transforming the mass- 
weighted second-derivative matrix K so that the 
separate adsorbate and surface blocks are diago¬ 
nalized 





K 


AS 

ft 2 


(4) 


where ideally the li 2 eigenvalues would corre¬ 
spond to surface phonons, the off-diagonal force 
constant matrix block K AS should be small when 
the adsorbate vibrational frequencies are strongly 
localized and decoupled from the substrate. Fol¬ 
lowing an approach originally suggested by Black 
[14], an estimate of the corrected adsorbate vibra¬ 
tional frequencies can be obtained from 


1 d 2 E 1 A#, 

K n = -7—T-T---/—ir-' (3) 

; ■yjm i m j dtfi dcjj ^m i m j Aqr y 

where Ag = g(Afy) — g(0) and m i is the mass of 
the nucleus associated with coordinate z. Equation 
(3) can also be written in terms of taking ±A^ ; 
displacements so that a more symmetrical force 
constant matrix is produced. Similarly, when i and 
j are both elements of A, an improved force con¬ 
stant block should result from averaging • and 
Kji. Equation (3) also facilitates making an esti¬ 
mate of the K as force constant block provided that 
the gradient vector g is computed for all the atoms 
in the cluster. 

Once the K AA block of the force constant matrix 
is formed, the adsorbate vibrational frequencies co 
are obtained by diagonalizing K AA . As expected, 
we find for a single atom adsorbate the K AA diag- 
onalization and the curve-fitting procedure im¬ 
plied by Eq. (1) produce identical frequencies. For 
polyatomic adsorbates, such as water on the 
Al(lll) surface, the diagonalization of the adsor¬ 
bate force constant block, K AA , produces both the 
adsorbate vibrational frequencies and the relevant 



where the approximation in the last formula as¬ 
sumes that o) i » ft M , the maximum phonon fre¬ 
quency of the substrate. Equation (5) can be de¬ 
rived by applying a simple perturbation theory 
argument to Eq. (4) or by using the more elaborate 
continued fraction technique originally used by 
Black [14]. Bulk A1 has a maximum phonon fre¬ 
quency of approximately 300 cm -1 [27], suggest¬ 
ing that only small corrections should be found for 
high adsorbate frequency modes. Much higher- 
frequency phonon modes exist for the Al/Si(lll) 
surface but we still find Eq. (5) useful because it 
provides a measure of how small the off-diagonal 
K AS block is becoming. In the next section, we 
demonstrate how Eq. (5) generally serves as a 
useful guide to the convergence and reliability of 
the frequency values computed for surface vibra¬ 
tions. 
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Results and Discussion 

H/AL(100) RESULTS 

In Table I, we present the vibrational frequen¬ 
cies for H adsorption at an on-top and a bridging 
site on clusters which simulate the Al(100) surface. 
EELS experiments by Paul found bands at 1750 
and 1125 cm -1 which were assigned to H vibra¬ 
tions at the on-top and bridging sites [28]. The 
general EELS selection rule is that only adsorbate 
vibrations normal to the surface are observed. 
However, it should be noted the EELS band at 
1750 cm -1 may not be simply associated with H 
adsorbed at an on-top site since the intensity band 
grows simultaneously with a peak at 750 cm -1 . In 
our earlier cluster calculations [15], by assuming 
the form of the H atom normal modes and using 
energy fits, we argued the origin of the 1750 and 
750 cm” 1 peaks to be due to two H at neighboring 
on-top sites. However, by following the procedure 
outlined in this article and obtaining the adsorbate 
normal modes by diagonalizing the mass-weighted 
second-derivative matrix for the 2 H atoms, we 
found a strong coupling between the surface nor¬ 


mal and parallel H modes. This surface normal 
and parallel modes coupling contradicts our ear¬ 
lier assumption and makes our previous analysis 
no longer valid. At present, we do not fully under¬ 
stand the origin of the two peaks at 1750 and 750 
cm” 1 and more work is in progress. Nevertheless, 
as indicated in Table I, our computed surface nor¬ 
mal vibrations are in very good agreement with 
the two high-frequency bands observed in the EELS 
experiments. Hjelmberg, in substrate perturbed jel- 
lium calculations, also computed 1694 and 1048 
cm” 1 for the on-top and bridge sites, respectively 
[29]. As expected, due to the small H-to-Al mass 
ratio, we find the frequencies corrected using Eq. 
(5) to differ only slightly from their uncorrected 
values. The largest corrections occur for the H x 
and y surface parallel modes at the on-top site 
where Eq. (5) is not strictly valid since the uncor¬ 
rected frequency, 233 cm” 1 , is less than the A1 
maximum phonon frequency. Our H/A1(100) clus¬ 
ter calculations demonstrate that vibrational fre¬ 
quencies for adsorbates with low masses can be 
reliably obtained using total energy-fitting proce¬ 
dures provided, of course, the correct normal mode 
associated with the adsorbate is known. 


TABLE I_ 

Vibrational frequencies computed using clusters which simulate the H/AI(100) and O / Al(111) surfaces; the 
z direction is normal to the surface and all frequencies are in cm _1 . 


Adsorption site Al X distance Vibrational frequency 

and cluster (A) Uncorrected Corrected Assignment 


On-top 

1.630 

1737 


1767 

H z 

AI 9 (5,4)H 


233 


289 

H x 



233 


289 

H y 

Bridge 

1.832 

1095 


1106 

H z 

AI 14 (8,6)H 


778 


793 

H x in-plane 



526 

O only 

532 

H y out-of-plane 

Three-fold open 

1.788 

595 


708 

Ox 

Al 18 (12,6)0 


594 


703 

o y 



393 

ai 3 0 

577 

O z 



713 


717 

O x 



712 


716 

Oy 



607 


610 

O z 



213 


280 

Al 3 0 x frus. tran. 



211 


281 

Al 3 0 y frus. tran. 



205 


334 

Al 3 0 z frus. tran. 
Al — Al vibrations 
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0/Al(l 11) RESULTS 

The investigation of O adsorption on the Al(lll) 
surface has been a surprising challenge to surface 
science [30]. HREELS experiments at low O cover¬ 
age find peaks in two regions at 545-650 and 
800-850 cm" 1 [31-34]. In agreement with the re¬ 
cent work of others [35-37], we have performed 
cluster calculations which show the 545-650 cm" 1 
HREELS band to correlate with normal vibrations 
of O above the surface at threefold open or fee 
sites and the 800-850 cm" 1 band to correlate with 
symmetric combinations of horizontal vibrations 
from two or more O at neighboring threefold fee 
sites which become HREELS-allowed [17]. In Table 
I, we list results for the frequency calculations for 
a single O atom on the Al 18 (12,6) cluster. The 
uncorrected frequencies are obtained by diagonal¬ 
izing the mass-weighted energy second-derivative 
matrix formed for O only and for the Al s O sub¬ 
cluster consisting of O and its three nearest A1 
neighbors. The uncorrected frequency calculations 
ignore the O-to-Al mass ratio of 0.59 and treat the 
substrate mass as infinite to produce an O surface 
normal vibration in poor agreement with the ex¬ 
perimental values. Similar O vibrational results 
were obtained by Jacobsen et al. using the numeri¬ 
cal fitting procedure given by Eq. (1) in their slab 
calculations [13]. For a slab with a 0.25 O mono- 
layer coverage, Jacobsen et al. computed 411 and 
427 cm" 1 for the surface normal and parallel O 
vibrations. Their surface normal vibration 411 cm" 1 
value matches very closely our O only result. By 
performing the frequency calculations with the 
mass-weighted energy second-derivative matrix 
formed for Al s O subcluster, the O vibrations are 
shifted to much higher frequency and we obtain a 
surface normal O mode which is much more con¬ 
sistent with the HREELS values. The O-only fre¬ 
quencies after correction with Eq. (5) are in very 
close agreement with the three high frequencies 
computed for the A1 3 0 subcluster. Diagonalizing 
the Al s O second-derivative matrix results in 12 
frequencies altogether; we have listed only the six 
highest frequencies in Table I since the nine low- 
frequency modes are mostly associated with 
Al—Al vibrations and the frequency correction 
formula [Eq. (5)] is no longer valid. Further sup¬ 
port that the A1 3 0 subcluster results in more real¬ 
istic frequencies than the O-only calculations comes 
from Eq. (5), which corrects the three high- 
frequency modes by 4 cm" 1 or less. These small 
A1 3 0 subcluster frequency corrections suggest that 


the most important surface interactions responsi¬ 
ble for O vibrations are between the O and the 
three neighboring Al atoms and that the adsorbate 
frequency values are essentially converged when 
the second-derivative matrix formed for the A1 3 0 
subcluster is used. We have tested this frequency 
convergence idea further by performing frequency 
calculations for larger subcluster sizes for O ad¬ 
sorbed on a single layer Al 18 cluster having a 
central Al 6 subcluster surrounded by all the neigh¬ 
boring Al atoms in the Al(lll) surface layer. We 
verified that the O frequencies computed with an 
A1 3 0 subcluster are converged by obtaining the 
same O frequencies with the A1 6 0 subcluster [17]. 

To summarize, these cluster calculations which 
simulate the 0/Al(lll) surface demonstrate that 
caution is needed with energy-fitting procedures 
to obtain adsorbate vibrational frequencies when 
the adsorbate to substrate mass ratio is significant. 
The O-only frequency calculation shows that use¬ 
ful estimates of the adsorbate-to-substrate mass 
effect can be obtained from the correction formula 
given by Eq. (5). Equation (5) corrections to the 
adsorbate frequencies become small when the sub¬ 
cluster A block includes all the dominant adsor¬ 
bate-to-substrate interactions. 

H 2 0/A1(111) results 

Table II summarizes the results of frequency 
calculations for water adsorbed at an on-top site 
on the Al(lll) surface using the Al 15 (10,5)H 2 O 
cluster shown in Figure 1. The Al 15 (10,5) cluster 
consists of two central Al atoms surrounded by the 
complete set of 13 nearest neighbors from the first 
and second layers of the (111) surface of the Al fee 
lattice. We used the Al 15 cluster in our earlier 
work to explore water adsorption at the on-top 
and bridging sites and found the on-top site to be 
preferred. The frequency calculations are per¬ 
formed by diagonalizing the mass-weighted en¬ 
ergy second-derivative matrix for H 2 0 only and 
the A1 2 H 2 0 subcluster. Not surprisingly, because 
of their weak interaction with Al surface, the fre¬ 
quencies for OH stretches and H—O—H bending 
mode are the same for both the H 2 0 only and 
A1 2 H 2 0 subcluster. A Mulliken population analy¬ 
sis does show charge transfer from the water H 
atoms to the Al surface, which is probably respon¬ 
sible for the adsorbed OH stretches being red- 
shifted from the 4237 and 4102 cm" 1 stretch fre¬ 
quencies computed for free water using the same 
basis set [4, 5]. The free-water bending frequency 
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TABLE II_ 

Vibrational frequencies for H 2 0 adsorbed at an on-top site on the Al(111) surface using the 
AI 15 (10,5)H 2 O cluster. 


H 2 Oonly AI 2 H 2 0 MP2 


Mode 

Uncorrected 

Corrected 

Uncorrected 

Corrected 

Frequency 

5 (A) 

Assignments 

1 

4174 

4174 

4174 

4174 

4186 

0.001 

Antisym. OH str. 

2 

4053 

4053 

4053 

4053 

3749 

0.018 

Sym. OH stretch 

3 

1800 

1800 

1800 

1800 

1606 

0.001 

H —O —H bend 

4 

575 

583 

584 

584 

620 

0.099 

In-plane H 2 0 rock 

5 

243 

298 

317 

319 

232 

0.021 

Al — O stretch 

6 



295 

302 



In-plane Al — Al str. 

7 

200 

220 

236 

352 



H 2 0 trust, rot. 

8 



203 

247 



H 2 0 trust, rot. 

9 



185 

250 



H 2 0 trust, rot. 

10 



166 

266 



Al mode 

11 



138 

252 



Al — Al trans. 

12 



119 

146 



z trust, trans. 

13 

91 

156 

84 

191 



H 2 0 trust, rot. 

14 

84 

121 

76 

98 



x trust, trans. 

15 

61 

279 

57 

404 



y trust, trans. 


The uncorrected frequencies are computed by using H 2 0 only and the AI 2 H 2 0 subcluster at the Hartree-Fock level. Estimates of 
MP2 frequencies are obtained by performing numerical fits along the normal modes computed for the H 2 0-only calculations. The 
shift in the MP2 minimum from the original SCF geometry along each normal mode is given by 8 . All frequencies are in cm" 1 . 


treated with the A1 2 H 2 0 subcluster. Mode 5 corre¬ 
sponds to the A1—O stretching mode and Eq. (5) 
in the H 2 Oonly frequency calculation produces a 
fairly large correction term since, again, we are not 
treating the substrate mass correctly. The corrected 
frequency from the A1 2 H 2 0 subcluster calculation 
shows that mode 5 is now probably converged, 
even though Eq. (5) should not be expected to be 
very accurate for such a low-frequency value. In 
our earlier work [5], we assigned the water sym¬ 
metric stretch, the bending frequency, the in-plane 
rock, and the A1—O stretch to the HREELS peaks 
experimentally observed at 3770, 1645, 775, and 
325 cm -1 , respectively, for the submonolayer cov¬ 
erage with 5.4 X 10 14 water molecules/cm 2 [6]. 

The frequency correction formula [Eq. (5)] serves 
as a useful check on the convergence of the ad¬ 
sorbed water frequencies on the Al(lll) surface. 
The small frequency corrections found for the four 
highest-frequency modes could be used as an ar¬ 
gument that numerical fitting procedures would 
also give adequate frequencies for the adsorbed 
water. However, gradient information is needed 
before the argument for simply performing numer¬ 
ical fit calculations can be made. Furthermore, the 
numerical fitting procedure would require an as¬ 
sumed form for the adsorbed water normal modes 


at 1802 cm -1 is unaffected by the surface adsorp¬ 
tion. Relatively small differences between the un¬ 
corrected and corrected frequencies start to appear 
for modes 4 and 5. Mode 4 is a frustrated water 
rotation where the water does an in-plane rock 
above the A1 atom and the small frequency correc¬ 
tion from Eq. (5) suggests that it is more correctly 



FIGURE 1. The AI 15 (10,5)H 2 O cluster with the water 
preferentially adsorbed atop an Al atom. This cluster 
geometry was obtained by a partial geometry 
optimization which included the water and the two central 
Al cluster atoms. 
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and the advantage of our approach is that we 
avoid making any assumptions by diagonalizing 
the mass-weighted energy second-derivative ma¬ 
trix obtained at the SCF level. Unfortunately, SCF 
calculations are well known to overestimate vibra¬ 
tional frequencies of molecules by 10-13% [7]. The 
trends in the SCF frequencies computed for adsor¬ 
bates on surfaces has not been rigorously quanti¬ 
fied, but one way to obtain improved frequency 
values is to use the SCF normal modes with small 
Eq. (5) corrections as displacement directions for 
numerical fits to the cluster energies calculated at 
a higher level of theory. In Table II, we list the 
adsorbed water frequencies obtained by fitting to 
the MP2 energy variation along the normal modes 
calculated in the H 2 0 only SCF calculations. We 
do find the MP2 symmetric OH stretch and H—O 
—H bending frequencies to be in much closer 
agreement with the HREELS vibrations observed 
at 3770 and 1650 cm -1 . The parameter 8 in Table 
II gives the displacement along each normal mode 
between the SCF and MP2 minimum energies and 
this gives some indication as to whether the opti¬ 
mized MP2 adsorbate geometry is significantly 
different from the SCF minimum. We find that the 
largest 8 displacement occurs for the in-plane H 2 0 
rock mode. Hence, for the modes having small 8 
values, the relatively straightforward SCF vibra¬ 
tional calculation combined with Eq. (5) provides 
some justification for obtaining the water on 
Al(l 11) vibrational frequencies using numerical fits 
at the MP2 level without having to perform the 
more computationally demanding MP2 energy- 
gradient calculation. 

Al/Si(l 11) RESULTS 

The interaction of Al with the Si surface has 
been extensively investigated both experimentally 
and theoretically, and for a more extensive review 
of the literature, we refer the reader to our recent 
cluster calculations for this surface [18]. On sur¬ 
faces with up to l ML Al coverage, scanning 
tunneling microscopy experiments [38] and slab 
calculations [39] indicate that the adatom preferen¬ 
tially adsorbs at a T 4 site where the Al bonds to 
three first-layer Si atoms and is directly above a Si 
atom from the second layer rather than the H 3 site 
where the second layer Si atom is absent. Surpris¬ 
ingly, cluster calculations at the Hartree-Fock level 
found Al adsorption at the T 1 site, where the Al 
atom is adsorbed atop a single Si atom, to be more 
stable than is adsorption at either of the T 4 or H 3 


sites. The Al adsorption induces a Sid 11) surface 
relaxation and our work shows these relaxation 
effects can be simulated providing that the cluster 
is large enough and the energy is computed at 
beyond the Hartree-Fock level. We were able to 
show that the largest relaxation effects occur at the 
T 4 site, resulting in an Al adsorption energy 5 
kcal/mol more stable than adsorption at the H 3 
site and considerably more stable than adsorption 
at the T 4 site. However, we only obtained this 
result for the T 4 site after the partial geometry 
optimization of the AlSi 5 subcluster in the 
AlSi 26 H 24 cluster and computing MP2 energy cor¬ 
rections [18]. Some of the previous cluster calcula¬ 
tions have also attempted to calculate Al adsorp¬ 
tion frequencies even though the Al-to-Si mass 
ratio is 0.96 [40], Kelly et al. in HREELS experi¬ 
ments on the Al/Sidll) surface found peaks at 
525 and 450 cm -1 [41]. Kelly et al. noted that their 
interpretation of the two peaks is complicated by 
the closeness of a peak due to the clean Sid 11)2 X 1 
surface and by the similarity of the Al and Si 
masses. To clarify the origin of the 525 and 450 
cm" 1 peaks, Kelly et al. also investigated In ad¬ 
sorption on the Si surface and again found peaks 
at 525 and 450 cm -1 , suggesting that the Al ad¬ 
sorption results in two new peaks due to localized 
substrate phonons. 

To explore the limits of calculating adsorbate 
vibrational frequencies, we investigated comput¬ 
ing the Al—Si vibrations using the partially opti¬ 
mized Si 22 H 21 Al and Si 26 H 24 Al clusters described 
previously [18] which simulate Al adsorption 
atop the Tj and threefold T 4 sites, respectively. 
Since previous cluster calculations have suggested 
that the preferred Al adsorption is at either the T 1 
or the T 4 sites, we did not perform vibrational 
calculations on the H 3 site [18]. The partially opti¬ 
mized T 1 Si 4 Al and T 4 Si 5 Al subcluster fragments 
from the larger clusters are shown in Figure 2. In 
Tables III and IV, we present the Hartree-Fock 
vibrational frequencies obtained by diagonalizing 
the mass-weighted energy second-derivative ma¬ 
trix for differently sized subclusters at the two 
sites. Tables III and IV show that the Si phonon 
frequencies overlap the Al—Si vibrations and the 
approximate form of Eq. (5) is no longer valid, and 
for this reason, we list a A value equaling the 
amount of the correction which would be obtained 
if Eq. (5) was still correct. Our Al-only frequency 
calculations give essentially the same values as 
obtained by Illas et al. who obtained 237 cm -1 at 
the T l and 313 cm -1 at T 4 sites [40]. In the fre- 
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FIGURE 2. The Si 4 A! and Si 5 Al subclusters used in the frequency calculations for the 7, and 7 4 sites. The atom labels 
Si c , Si b , and Si s correspond to the central, bulklike, and surface Si atoms referenced in Tables ill and IV. 


quency calculation for the T 1 site using the Si 4 Al 
subcluster, the surface normal A1—Si stretching 
frequency shifts to 335 cm -1 , as predicted by Eq. 
(5). However, this shift agreement has to be fortu¬ 
itous since A is still 77 cm" 1 for the new stretch 


frequency value. Although we do not expect Eq. 
(5) to be very reliable for the Al/Si(lll) surface, 
the correction factors A do still serve as a useful 
guide to whether the frequencies are converged. 
The three highest frequencies have the smallest A 


TABLE III 


Vibrational frequencies for Al adsorption at the T, site on the Si(111) surface using the Si 22 H 21 AI cluster. 


Mode 

Al only 

Si 4 Al 


Assignments 

Frequency 

A 

Frequency 

A 

1 



509 

26 

x Si cen 


2 



508 

26 

y ^cen 


3 



466 

31 

7 

'“' , cen 


4 



425 

69 

S*bulk 


5 



425 

69 

Sibulk 


6 



404 

75 

S'bulk 


7 



400 

81 

S'bulk 


8 



368 

85 

Sibulk 


9 



368 

85 

S'bulk 


10 

239 

96 

335 

77 

Al —Si 

z str. 

11 



233 

174 

X S^cen 


12 



233 

174 

y Si cen 


13 



148 

100 

Al —Si 

z trans. 

14 

51 

49 

50 

59 

Al x 


15 

51 

49 

50 

59 

Aly 


The uncorrected frequencies are computed by using Al only and the Si 4 AI subcluster, 
frequency correction from Eq. (5). 

All frequencies are in cm 1 . 

and A is 
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TABLE IV_ 

Vibrational frequencies for Al adsorption at the T 4 site on the Si(111) surface using the Si 26 H 24 AI cluster. 


Mode 

Al only 


Si 4 Al 


Si 5 Al 


Assignments 

Frequency 

A 

Frequency 

a 

Frequency 

A 

1 



585 

39 

641 

7 

S'cen Si bu)k s t r - 

2 



484 

21 

485 

23 

Sicen—Sisurf * StT- 

3 



484 

21 

485 

23 

Sicen—Sisurf Y Stl\ 

4 





454 

99 

Sibulk X 

5 





453 

98 

sibuik y 

6 



426 

42 

444 

49 

Sisurf 

7 



416 

64 

416 

64 

Sisurf 

8 



416 

64 

416 

64 

Si S urf 

9 





371 

80 

Sibulk Z 

10 



369 

81 

369 

80 

S'surf 

11 



337 

23 

336 

24 

Si S urf 

12 



337 

23 

336 

24 

Sisurf 

13 

317 

85 

308 

26 

306 

21 

Al—Si cen z str. 

14 



212 

144 

210 

146 

Sicen 

15 



212 

144 

210 

146 

Sicen 

16 

260 

95 

196 

76 

196 

76 

x Al 

17 

259 

95 

196 

76 

196 

76 

y Al 

18 



231 

128 

184 

109 

Al—Si cen z trans. 


The uncorrected frequencies are computed by using Al only and the Si 4 Al and Si 5 Al subdusters. All frequencies are in cm 1 and 
A is frequency correction from Eq. (5). 


correction factors in Table III and correspond to 
vibrational modes for the central Si atom directly 
underneath the Al adatom which is also bonded to 
the three bulklike Si atoms in the Si 4 Al subcluster. 
The three bulk Si atoms in the Si 4 Al subcluster 
each have three nearest neighbors not included in 
the Si 4 Al subcluster, and as a result, their A values 
are larger. It is worth noting that the surface nor¬ 
mal vibration for the central Si atom at 466 cm -1 
agrees remarkably well with the 450 cm -1 peak 
observed in the HREELS experiments by Kelly 
et al. [41]. 

Similar trends in the frequency-correction term 
A are found in Table IV for the T 4 site. The Si 4 Al 
subcluster does not include the Si bulk atom from 
the third layer of the Si(lll) surface. From both the 
Si 4 Al and Si 5 Al subclusters, the Al adatom surface 
normal vibration appears to be converging to a 
frequency of 306 cm” 1 and is not that much differ¬ 
ent from the frequency calculated in the Al-only 
calculation. The convergence of the highest- 
frequency mode, which is associated with the cen¬ 
tral Si directly underneath Al and above the Si bulk 
atom, is more sensitive to the subcluster size used 
and improves tremendously on going to the Si 5 Al 
subcluster. We suspect that this surface normal 


highest-frequency mode could be responsible for 
the 525 cm -1 peak observed in HREELS, although 
our SCF frequency value is much too large. The 
small A value computed for this mode suggests 
that it might be valid to use the Si 5 Al subcluster 
normal modes to perform a numerical fit to the 
MP2 energy and this work is in progress. Overall, 
these Al/Si(lll) cluster calculations show that 
while Eq. (5) is not valid for computing adsorbate 
frequency correction terms, Eq. (5) can still serve 
as a useful guide to the convergence of the fre¬ 
quency of surface vibrations. 


Conclusions 

We have presented a general strategy for com¬ 
puting vibrational frequencies for adsorbates on 
surfaces. Rather than use numerical fitting meth¬ 
ods, we advocate using a scheme which involves 
energy-gradient calculations. The initial vibra¬ 
tional frequencies are obtained by diagonalizing a 
mass-weighted energy second-derivative matrix 
formed for a subcluster of atoms from the surface. 
The diagonalization avoids having to make as¬ 
sumptions on the form of the adsorbate normal 
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modes. Furthermore, the energy gradients facili¬ 
tate making estimates of the influence of the 
adsorbate/substrate mass ratio through the 
frequency-correction formula given by Eq. (5). 
Equation (5) produces only small frequency- 
correction terms for the H/A1(100) surface since 
the H-to-Al substrate mass ratio is small. For the 
O/Al(l 11) surface, we find larger frequency cor¬ 
rections, and these are accurate providing that the 
frequency value of interest is greater than the 
maximum phonon frequency of the substrate. 
However, as we show for H 2 0/A1(111) and 
Al/Sidll) surfaces, even when the surface vibra¬ 
tion frequency does not exceed the maximum sub¬ 
strate phonon frequency value, Eq. (5) still serves 
as a useful guide for checking if the frequency of a 
surface normal mode is converged. Once the fre¬ 
quency of surface normal mode has been shown to 
converge, one can then resort to numerical fitting 
methods and obtain the adsorbate vibrational fre¬ 
quency at a higher level of theory where the gradi¬ 
ent calculation is more prohibitively expensive. 
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ABSTRACT: Systematic theoretical studies for the mechanisms of the epoxidation and 
complete oxidation of ethylene and propylene over silver surface as well as the reactivity 
and the stability of oxygen species on Cu, Ag, and Au surfaces have been presented. The 
dipped adcluster model (DAM) combined with the ab initio Hartree-Fock (HF), second- 
order Moller-Plesset (MP2), and SAC/SAC-CI (symmetry-adapted cluster/configuration 
interaction) methods are used. These studies clarify the origin of silver as a unique 
effective catalyst for the epoxidation of ethylene and the different mechanisms for the 
oxidation of olefins over silver surface. For the epoxidation of ethylene, the superoxide 
O^", which is molecularly adsorbed in bent end-on geometry on the silver surface, is the 
active species. The origin of the unique catalytic activity of silver for the epoxidation of 
ethylene is due to its ability to adsorb oxygen as the superoxide species. Such an 
adsorbed species cannot be stable or exist on Cu and Au surfaces. For the oxidation of 
propylene, both reaction mechanisms initiated by the activation of olefinic carbon and by 
the activation of the allyl hydrogen exist. The activation of the allyl hydrogen is the 
origin of the complete oxidation of some olefins over silver surface. The present results 
not only let us have a better understanding of the reaction mechanisms of olefins over 
silver surface but also supply a basic idea for the new catalyst design of the epoxidation 
reaction. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 839-855, 1997 
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Introduction 


T he heterogeneous selective oxidation of 
olefins to epoxides on silver 


:c=c 


+ —CA 


olefin 


Ag 


^c— 

\ / 

o 

epoxide 


( 1 ) 


is an exceedingly important industrial catalytic 
reaction, and as such has received long and exten¬ 
sive study [1-7]. One particularly interesting as¬ 
pect of the epoxidation reaction is its uniqueness: 
Not only is silver a uniquely effective catalyst for 
heterogeneous epoxidation but also ethylene is the 
only hydrocarbon which may be epoxidized with 
high selectivity. Actually, silver catalysts are used 
in industry to produce several million tons of 
ethylene oxide yearly [1-4]. Recent experimental 
studies show that, except for ethylene, only a few 
olefins such as styrene [8, 9], 3,3-dimethylbutene 
[10], norbornene [11], and butadiene may be epoxi¬ 
dized over silver with high selectivity. In contrast, 
other olefins such as propylene, butenes, and 
pentenes are epoxidized over silver but with ex¬ 
tremely low selectivity [12-16]. 

During the past 20 years, many studies [12-24] 
have been devoted to getting a better understand¬ 
ing of the epoxidation reaction mechanism. How¬ 
ever, the origin of this unique catalytic activity of 
silver has not yet been clarified, and also the 
mechanistic details of the epoxidation and that of 
the competing combustion to carbon dioxygen and 
water are far from understood. One of the main 
questions about the mechanism of ethylene oxida¬ 
tion is the roles of the adsorbed molecular and 
atomic oxygen. Both mechanisms involving molec¬ 
ular oxygen [12-18] and atomic oxygen [19-23] as 
the active species have been suggested. It has been 
suggested that the uniqueness of silver is associ¬ 
ated with its ability to adsorb oxygen as the super¬ 


oxide species [24]. More direct evidence for molec¬ 
ular oxygen as the active species has been ob¬ 
tained by several studies: e.g., only complete oxi¬ 
dation occurs when N 2 0 pulse which gives atomi¬ 
cally adsorbed oxygen is used, but the epoxidation 
reaction occurs when 0 2 pulse which gives molec- 
ularly adsorbed oxygen, at least initially, is used 
[25], The mechanism assuming atomic oxygen as 
an active species was suggested from the experi¬ 
mental data that only atomically adsorbed oxygen 
exist at high temperature in an industrial condi¬ 
tion, since molecular oxygen desorbs or dissoci¬ 
ates. The assumption that molecular and atomic 
oxygens cause the partial and total oxidation reac¬ 
tions, respectively, gives an upper limit for the 
selectivity of 6/7. Recent experiments giving 
higher selectivity than 6/7 [26] imply that the 
atomic oxygen may at least pay an important role 
for the epoxidation of ethylene. 

Theoretical studies have done much less for the 
epoxidation reaction of olefins. Though a few 
works [27-30] studying the epoxidation mecha¬ 
nism of ethylene have ever been published, less 
discussions were presented [27, 30] for the oxida¬ 
tion of propylene, and the calculations did not 
explicitly include the surface contribution. The 
main difficult problem still remaining in the theo¬ 
retical studies is how to describe chemisorption 
species and calculate adsorption energy correctly 
based on a selected small cluster. This may be why 
no ab initio theoretical study for the full reaction 
pathway and the associated energy diagrams of 
the epoxidation reaction of olefins has been re¬ 
ported. 

In a series of our previous studies [31-37], a 
theoretical model, named the dipped adcluster 
model (DAM) [31-33], has been proposed to study 
chemisorptions and surface reactions by involving 
the interactions between bulk metal and ad- 
molecules with consideration of the electron trans¬ 
fer between them and the image force correction. 
Such an interaction is found to be very important 
for studying the chemisorptions and the reactions 
of the oxygen on a metal surface, and even a small 
DAM could be used to give correct electronic 
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structures and reliable adsorption energies in con¬ 
trast to the cluster model. Previous studies [34-36] 
for the chemisorption of oxygen on a silver surface 
showed that the molecularly adsorbed oxygen ex¬ 
ists in two stable geometries; i.e., an end-on (bent) 
structure in an on-top site and a side-on structure 
in a bridge site. The ground state in the former 
geometry is superoxide, which is expected to be 
more electronically favorable for the reaction with 
ethylene [36]. 

In this work, we present a brief account of our 
recent systematic studies on the catalytic activity 
of silver surface for the oxidation of olefins. We 
will try to address the following three questions: 
(1) What species of surface oxygen is responsible 
for the ethylene epoxidation? (2) Why is silver a 
unique catalyst for the epoxidation? (3) Why do 
higher olefins mainly process complete oxidation 
on the same silver catalyst? We performed the ab 
initio Hartree-Fock (HF), second-order Moller- 
Plesset (MP2), and the SAC (symmetry-adapted 
cluster)/SAC-CI (configuration interaction) calcu¬ 
lations with the use of the DAM. More details are 
seen in the literature [38-40]. 


Epoxidation Mechanism of Ethylene on 
Silver Surface 

In this section we give an account of our re¬ 
search on the mechanism of the epoxidation of 
ethylene on a silver surface [38]. The energy dia¬ 
grams of the reactions of ethylene with both 
molecularly adsorbed and atomically adsorbed 
oxygens on a Ag surface are studied. The Eley- 
Rideal mode, namely the reaction between the 
adsorbed oxygen species and a gaseous ethylene is 
assumed. The DAM was used to include the ef¬ 
fects of the bulk metal such as the electron transfer 
between adcluster and surface and the image force. 
Geometry optimizations for calculating the reac¬ 
tion path were performed using the ab initio unre¬ 
stricted Hartree-Fock (UHF) method and the ener¬ 
gies at the optimized geometries were calculated 
by the MP2 methods including the image force 
correction. The Gaussian basis set used is of dou- 
ble-zeta accuracy, and some additional functions 
were added for improvement. They are shown to 
reproduce well the known geometries and the en¬ 
ergetics of the reactions [38]. The model adclusters 
used in the present study are the molecularly 
adsorbed end-on (bent) form in an on-top site and 
the atomically adsorbed on-top structure. 


REACTIONS WITH MOLECULARLY 
ADSORBED OXYGEN 

Epoxide Formation 

We first study the epoxidation reaction of eth¬ 
ylene with the molecularly adsorbed superoxide 
on a silver surface. For the formation of ethylene 
oxide, the reaction takes place by an initial interac¬ 
tion of the oxygen with one of the carbon in 
ethylene [5, 6]. Figure 1 shows the obtained favor¬ 
able initial attack of ethylene to superoxide species 
adsorbed on the silver surface. Starting from this 
geometry, we optimized the geometries of the in¬ 
termediates and the transition states (TSs) along 
the reaction pathway. The optimized geometries 
and energy diagrams are shown in Figure 2. The 
reaction proceeds from the left to the right, and the 
energy is in kcal/mol relative to the free system, 
i.e., Ag 2 + 0 2 + C 2 H 4 . 

Figure 2 shows a two-step mechanism of the 
reactions. The first step of this reaction is the 
adsorption of molecular 0 2 on Ag surface in which 
one electron is transferred from the bulk metal to 
the adcluster, i.e., n — 1. The transferred electron 
mainly occupies the out-of-plane p* orbital of 0 2 , 
and the in-plane p* molecular orbital (MO) of 0 2 
remains singly occupied. The ground state for the 
end-on adsorption of 0 2 is the superoxide. The 
adsorption energy of superoxide is calculated to be 



FIGURE 1 . Model adcluster used. Geometries were 
optimized in the Cs symmetry except for the Ag — Ag 
distance and the O a — Ag — Ag angle fixed at 2.8894 A 
and the 90.0°, respectively. 
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FIGURE 2. Energy diagram for the reaction between ethylene and molecularly adsorbed superoxide on Ag surface. 
The route leading to ethylene oxide (EO) and acetaldehyde (AA) are shown by the solid and broken lines, respectively. 


7.4 kcal/mol at the UHF level and 26.9 kcal/mol 
at the MP2 level. The experimental molecular ad¬ 
sorption energies are 9.2, 9.3, and 24.1 kcal/mol 
for the Ag(lll) [18], Ag(110) [18], and electrolytic 
Ag [41] surfaces, respectively. The next step is the 
attack of ethylene onto the terminal oxygen atom 
O b which is more reactive than the inside one O a , 
leading to the TS and the intermediate with the 
energy barrier of 11.3 kcal/mol. The energy level 
of the intermediate is similar to that of initial step. 
In the TS the a bond between O b and C a is 
formed and the in-plane p bond of 0 2 and the p 
bond of C 2 are broken. The geometry parameters 
at every step clearly reflect the reaction processing 
along the reaction path [38]. 

When the intermediate is produced, the forma¬ 
tion of ethylene oxide is an easy path as shown by 
the real line, leaving an atomic oxygen on the Ag 
surface. The calculated heat of reaction is 37.1 
kcal/mol. The energy diagram given in Figure 2 
shows that the reaction leading to ethylene oxide 
proceeds very smoothly: The reaction is exother¬ 
mic, there are no very high barriers, and there are 
no too stable intermediates. We therefore conclude 
that the epoxide formation reaction from the su¬ 


peroxide species on a silver surface proceeds very 
smoothly. We note that the exothermicity of the 
overall reaction is partially due to the formation of 
the atomically adsorbed oxygen on the silver sur¬ 
face. 

Acetaldehyde Formation 

We next study the formation process of ac¬ 
etaldehyde from ethylene and molecularly ad¬ 
sorbed superoxide on the Ag surface. In Figure 2, 
the path is given by the broken line. When ac¬ 
etaldehyde is produced, it is further oxidized to 
CO 2 and H 2 0 by combustion in the presence of 
adsorbed oxygen, and therefore it is an intermedi¬ 
ate in the complete oxidation process [42]. 

The formation of acetaldehyde starts from the 
intermediate shown in the center of Figure 2: up to 
it the reaction path is common to the epoxidation 
reaction. H rt bound to C n in the intermediate mi¬ 
grates to C b/ giving a stable conformation denoted 
as acetaldehyde intermediate in Figure 2. How¬ 
ever, the energy barrier for this H fl migration is as 
high as 71.5 kcal/mol. This step is therefore ener¬ 
getically forbidden. This is a reason for the high 
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selectivity of the epoxide formation process start¬ 
ing from the superoxide species summarized in 
the preceding section. 

Effect of Silver Surface 

The effect of the silver surface may be twofold: 
One is to provide a reactive species adsorbed on 
the surface, and the other is to provide electrons to 
the reaction site as considered in the DAM. Figure 
3 shows the energy diagram for the reaction be¬ 
tween ethylene and gaseous oxygen without the 
silver surface. Two energy diagrams correspond to 
the calculations with n — 1 and n = 0. The former 
involves an excess electron but the latter is neutral. 
For n = 1, we used the optimized geometries 
shown in Figure 2, and, for n = 0, we performed 
the geometry optimization for the present purpose. 
The geometries illustrated in the upper and lower 
sides correspond to those for n = 0 and n = 1, 
respectively. For n — 0, the O—O distance is cal¬ 
culated to be 1.37 A for the intermediate, which is 

o 

smaller than the corresponding value 1.49 A for 
n — 1. The same is true for all the intermediates 
and TSs. 


Comparing Figure 3 with Figure 2, we clearly 
see the catalytic activity of the silver surface for 
the epoxidation reaction. Without the surface, the 
reaction is endothermic and the pathway has a 
large barrier. Electron transfer (n = 1) certainly 
reduces the barrier, but the direct interaction with 
the actual silver atoms on the surface is quite 
important. Clearly, the exothermicity of the reac¬ 
tion shown in Figure 2 is due to the formation of 
the atomically adsorbed oxygen on the surface. 

REACTIONS WITH ATOMICALLY ADSORBED 
OXYGEN 

When ethylene oxide is ejected out of the Ag 
surface, the atomically adsorbed oxygen O' is left 
on the surface. It is also produced by the dissocia¬ 
tive adsorption of 0 2 on the surface, and the 
dissociative state is more stable than the molecular 
adsorption state. In our calculation, the dissociated 
oxygen has its singly occupied orbital in the 2 p 
orbital parallel to the surface. 

Figure 4 shows the energy diagram for the reac¬ 
tion of ethylene with the atomically adsorbed oxy¬ 
gen on the Ag surface. When ethylene attacks 
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FIGURE 3. Energy diagram for the reactions between ethylene and gaseous oxygens [neutral 0 2 (n = 0) and 0 2 
anion (n = 1)]. The upper geometries are for n - 0, and the lower geometries for n = 1. 
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FIGURE 4. Energy diagram for the reaction between ethylene and atomically adsorbed oxygen on Ag surface. 


atomic oxygen on the surface, the most favorable 
approach is to form the C a —C\ bond as illustrated 
in Figure 4. It is similar to the approach of ethylene 
to the superoxide species shown in Figure 2. The 
calculated energy barrier from the adsorbed oxy¬ 
gen to the intermediate is, however, as small as 1.5 
kcal/mol, reflecting the reactivity of the atomic 
oxygen. 

The next step from this intermediate is very 
important, since it determines the selectivity of the 
atomically adsorbed oxygen. If C b attacks the oxy¬ 
gen to form the —O fl bond, ethylene oxide is 
formed. If C b attacks another C—H bond, causing 
the hydrogen migration, acetaldehyde is a prod¬ 
uct, which is an intermediate for the complete 
oxidation product. 

For the hydrogen migration reaction, the barrier 
was calculated to be 39.1 kcal/mol, which is high 
but lower than the barrier, 71.5 kcal/mol, of the 
reaction in which the superoxide species was in¬ 
volved (see Fig. 2). This process were calculated 
using the adcluster with n = 1. Unfortunately, the 
structure of the TS to form the C b — O n bond could 
not be determined by the geometry optimization 
procedure using both of the n = 1 and n = 0 ad- 
clusters. If the pathway from the intermediate to 


ethylene oxide is one step, it is a quite interesting 
step since it involves both geometrical changes 
and one electron back-transfer from the admolecule 
to the bulk metal. In the DAM picture, the inter¬ 
mediate in Figure 4 is neutrally described with 
n = 1, i.e., with one additional electron supplied 
from the bulk metal to the adcluster, but the prod¬ 
uct, ethylene oxide plus Ag 2 should be neutral, 
i.e., n = 0. Therefore, we calculated two potential 
curves, one starting from the intermediate and the 
other from ethylene oxide plus Ag 2 using n = 1 
and n = 0, respectively. All the geometrical pa¬ 
rameters were assumed to change linearly, except 
for the Ag—O distance, which was fixed to 
2.180 A. 

The potential curve for n = 1 increases mono- 
tonically from the intermediate to the product, 
while the curve for n — 0 also increases monotoni- 
cally from ethylene oxide and Ag 2 to the interme¬ 
diate, and the two energy curves cross. The energy 
difference between the crossing point and the in¬ 
termediate is about 36 kcal/mol, which may be 
considered as the energy barrier of this process. 
Namely at this TS, one electron is transferred back 
to the metal. 
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The selectivity giving either ethylene oxide or 
acetaldehyde would be dependent on the heights 
of the barriers of the two processes and on the 
stabilities of the two products. The calculated en¬ 
ergy barriers for the two processes are similar, 39.1 
and about 36 kcal/mol, respectively. On the other 
hand, the energy difference between the two prod¬ 
ucts and the intermediate are +15.6 and -13.7 
kcal/mol, respectively, which means endothermic- 
ity and exothermicity from the intermediate, re¬ 
spectively. From the above data alone, it is diffi¬ 
cult to decide which is a preferential process, 
though the acetaldehyde formation may be favor¬ 
able for the exothermicity. However, it may be 
said that both ethylene oxide and acetaldehyde are 
formed from ethylene and atomically adsorbed 
oxygen on the Ag surface. Furthermore, this fact is 
very important since it would be a reason for the 
experimental maximum selectivity more than 6/7 
[5-7]. 

OVERALL MECHANISM OF EPOXIDATION OF 

ETHYLENE 

We summarize here our results on the mecha¬ 
nism of the epoxidation of ethylene on a silver 
surface. The calculated energy barriers and the 
heats of the reactions are summarized in Table I. 
The primarily important species for the epoxida¬ 
tion of ethylene on a silver surface is the superox¬ 
ide 0 2 which is molecularly adsorbed on the 
surface in the bent end-on geometry. Ethylene at¬ 
tacks the terminal oxygen atom of the superoxide, 
as shown in Figure 1, with the barrier of about 11 
kcal/mol, and then the reaction proceeds quite 
smoothly leading to ethylene oxide as shown in 


Figure 2. The overall reaction is exothermic by 37 
kcal/mol from C 2 H 4 + 0 2 + Ag surface or by 10 
kcal/mol from C 2 H 4 + superoxide on Ag surface. 
The second exothermicity is due to the larger ad¬ 
sorption energy of the atomic oxygen than that of 
the superoxide. 

On the contrary, the complete oxidation of ethy¬ 
lene from the superoxide species is forbidden due 
to the existence of the large barrier (72 kcal/mol) 
in the hydrogen migration step, though this reac¬ 
tion is largely exothermic by 66 kcal/mol from the 
initial compounds or by 40 kcal/mol from the 
superoxide on Ag and C 2 H 4 . When silver surface 
does not exist, the epoxidation reaction is a very 
unfavorable reaction having a high-energy barrier 
and endothermicity. Thus, the epoxidation of eth¬ 
ylene by the superoxide on the Ag surface is highly 
efficient and selective. 

The atomically adsorbed oxygen, which is left 
on the surface after completion of the epoxidation 
reaction by the superoxide or which may exist by 
the dissociative adsorption of 0 2 on the surface, 
has two reaction channels leading to ethylene ox¬ 
ide and to complete oxidation. The selectivity here 
seems to be small and both products would be 
obtained. This would be the reason why some 
experiments reach the selectivity larger than 6/7. 

In the process leading to ethylene oxide, the 
electron back-transfer from the reaction adsorbate 
complex to the metal surface should be important 
and related to the barrier. We note that both elec¬ 
tron acceptor and donor may work to reduce the 
barrier of the electron transfer step: They work to 
stabilize the n = 1 and n = 0 curves, respectively. 
The effects of Ce and halogen as promoters of 
catalyst are well known experimentally [5-7]. We 


TABLE I 


Comparison of the energy barrier and the heat of reaction for the epoxidation and acetaldehyde formation by 
the molecularly and atomically adsorbed oxygens (kcal / mol). 


Reaction 

Energy barrier 

Heat of reaction 3 

Molecularly adsorbed oxygen 



0 2 (a) / Ag + C 2 H 4 —» C 2 H 4 0 + 0(a) / Ag 

11.3 

10.2 

0 2 (a) / Ag + C 2 H 4 CH 3 CHO + 0(a) / Ag 

71.5 

39.5 

Atomically adsorbed oxygen 



0(a) / Ag + C 2 H 4 —» C 2 H 4 0 + Ag 

~ 36 

21.5 (24.7) 

0(a) / Ag + C 2 H 4 -> CH 3 CHO + Ag 

39.1 

50.8 (52.3) 

Without Ag surface 



o 2 + c 2 h 4 —> 0 + c 2 h 4 o 

31.2 

-13.2 (-11.3) 

o 2 + c 2 h 4 —» 0 + c 2 h 4 o 

52.9 

-38.0 (-35.4) 

a Values in parentheses show the experimental values. 
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speculate that these promoters act in this electron- 
transfer step. Design of the promoter or co-cata- 
lyst, which is effective to this electron-transfer step, 
is of crucial importance since it would be a key for 
raising up the selectivity over 6/7. 


Activation of 0 2 on Cu, Ag, and Au 
Surfaces 

The experimental results show that silver is a 
unique catalyst for the epoxidation of ethylene. 
However, it is not yet well understood why only 
silver is so exceptional. In the former section, we 
showed that the superoxide species adsorbed in 
the end-on form on a Ag surface is very active and 
selective for the epoxidation reaction. Aiming to 
clarify the reason why only silver is an effective 
catalyst for the epoxidation of ethylene, in this 
section, we perform a comparable study for the 
chemisorption and activation of oxygen on Cu, Ag, 
and Au surfaces. 

We take M 2 0 2 (M = Cu, Ag, Au) as adclusters. 
Both electronic structures and the reactivity of the 
molecular end-on and side-on adsorptions are 
studied. The geometries of the molecular end-on 
and side-on species are optimized at the UHF 
level. The relative stabilities of the adsorbed species 
is studied by SAC/SAC-CI method [43-45], since 
the roles of lower ground and excited states are 
quite important in this kind of surface electronic 
processes [34-36]. The DAM [31-33] is again 
adopted to investigate the electron transferability 
of these surfaces. 

REACTIVITY AND STABILITY OF 

SUPEROXIDE SPECIES 

Figure 5 shows an illustration of the geometries 
of the end-on and side-on forms. The calculations 
are performed by assuming one-electron transfer 



FIGURE 5. Illustration of the geometrical conversion 
from end-on (#1) to side-on (#4) forms. The M — M 
(M = Cu, Ag, and Au) distances were fixed at the lattice 
constants. 

from the bulk to the adcluster occurs. The opti¬ 
mized O—O distances are 1.36-1.38 A in the end- 
on form, which are close to or slightly greater than 
that of a free 0 ? anion (1.35 A) [46], and that in the 
side-on form are all about 1.51 A. The calculated 
Cu—O distance are shorter than the Au—O and 
Ag—O distances, reflecting a large adsorption en¬ 
ergy and a short lattice constant of Cu. 

The calculated electronic structure of 0 2 in the 
end-on and side-on adsorption forms is superoxide 
0 2 and peroxide O , respectively. Table II shows 
the net charge and frontier density (or spin popu¬ 
lation), which are related to the reactivity of the 
superoxide species. In all three species, the net 
charges on the inside (0„) and outside (O fc ) oxygen 
atoms are calculated to be about —0.6 and —0.15, 
respectively, while the outside oxygen atoms have 
a frontier density of about 0.86, which is much 
larger than that of the inside oxygen atoms (about 
0.12). This shows that the outside oxygen is much 
more reactive than the inside one and that the 
former is not so negative as the latter one, the 
same result as that reported before [36] for a linear 
Ag—O—O system. Actually, in the epoxidation 
reaction with ethylene, the oxygen that reacts with 


TABLE II 


i nh/kh ■■ ______— -—— - 

Net charge and frontier density of the end-on superoxide species on Cu, Ag, and Au surfaces. 

System 3 


Net charge 



Frontier density 


M a 

M b 

0 a 

0 b 

M a 

M b 

0 a 

o b 

Cu^Oc 

+0.060 

-0.271 

-0.643 

-0.146 

0.007 

0.002 

0.133 

0.858 

Acj?0? 

+0.058 

-0.295 

-0.605 

-0.158 

0.014 

0.003 

0.119 

0.864 

Au 2 0 2 

+0.095 

-0.285 

— 0.644 

-0.166 

0.007 

0.002 

0.116 

0.876 

a The geometry of the end-on form is shown in Fig. 1. 
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ethylene is the outside one as shown in the above 
section. Based on the analyses of the net charge 
and the frontier density, similar reactivity is ex¬ 
pected for the end-on superoxide species on Cu, 
Ag, and Au surfaces. 

The reactivities of the superoxide species on Cu, 
Ag, and Au surface for ethylene are also very 
similar [39]. The differences among Cu, Ag, and 
Au surfaces are small, if the superoxide species 
exists on the metal surfaces. Large observed differ¬ 
ences in the catalytic activity of these metal sur¬ 
faces are therefore attributed to the different stabil¬ 
ities of the adsorbed oxygen species, particularly 
the stability of the superoxide species. 

Next, we examine a conversion from the end-on 
to the side-on geometries. This process accompa¬ 
nies a conversion from the superoxide to the per¬ 
oxide, since the ground state of the end-on and 
side-on species are the superoxide and peroxide, 
respectively, as shown in some details in this sec¬ 
tion. Thus, the method to be used here should be 
able to describe both ground and excited states in 
a good accuracy. For this purpose, the HF method 
is inadequate, we use the SAC/SAC-CI method 
[43-45]. The geometries optimized at the HF level 
are used for the end-on (#1) and side-on (#4) 
forms, respectively. The intermediate geometries 
#2 and #3, which are illustrated in Figure 5, are 
approximated as a linear function of the OMM 
angle. 

Figure 6 shows the potential energy curves 
(PECs) calculated for the conversion process be¬ 
tween the end-on and side-on forms on the Cu, 
Ag, and Au surfaces. In the end-on geometry, the 
superoxide ( 2 B 2 ) is always the ground state, but in 
the side-on geometry, the ground state is the per¬ 
oxide species ( 2 A T ). The superoxide species with 
the 2 A 2 symmetry is less important since it never 
becomes the ground state in this conversion pro¬ 
cess. In the 2 B 2 superoxide, the in-plane 7r* orbital 
of oxygen is singly occupied while the out-of-plane 
77* orbital is doubly occupied. These occupations 
are reversed in the 2 A 2 state. The crossing of the 
PECs of the superoxide and peroxide occurs be¬ 
tween the end-on and side-on geometries. 

Relative stabilities of the superoxide and perox¬ 
ide in the end-on and side-on geometries and the 
barrier of the conversion are quite important for 
understanding the catalytic activities of metals for 
the epoxidation of ethylene. Such information is 
given in Figure 6. In the end-on form, the superox¬ 
ide species is more stable than the peroxide species: 
the energy difference is about 20 kcal/mol for Ag 



End-on Side-on 

FIGURE 6. Potential energy curves calculated by the 
SAC/SAC-CI method for the conversion between the 
end-on (#1) and side-on (#4) adsorption states in the 
lowest three states of the Cu 2 0 2 , Ag 2 0 2 , and Au 2 0 2 
adclusters with n = 1. 


and Au and 5 kcal/mol for Cu. In the side-on 
form, the peroxide species is more stable than the 
superoxide species: The energy difference is 50.1, 
27.6, 21.2 kcal/mol for Cu, Ag, and Au, respec¬ 
tively. The energy difference between the end-on 
superoxide and the side-on peroxide is 50.8, 23.1, 
and —1.2 kcal/mol for Cu, Ag, and Au, respec¬ 
tively. On the Cu surface, the superoxide species 
in the end-on form easily converts into the perox¬ 
ide species in the side-on form because of a large 
stabilization energy and for a lack of the barrier: 
The lifetime of the end-on superoxide species 
should be very short on the Cu surface. On the 
other hand, an energy barrier of about 20 kcal/mol 
is calculated for the conversion on Au. On the Ag 
surface, the end-on superoxide should have a con¬ 
siderable lifetime as seen from the PECs shown in 
Figure 6. Based on the above results, we can con¬ 
clude that the reason for the nonselectivity of Cu 
for the epoxidation of ethylene is due to the ab¬ 
sence of the stable adsorbed superoxide species. 
However, for Au, the results obtained from Figure 
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6 does not give a clear explanation for its inability 
of this reaction. As far as one-electron transfer 
from the bulk metal into the adcluster occurs for 
Cu, Ag, and Au, the reactivity of the superoxide 
on these metals should be similar. 

ELECTRON TRANSFERABILITY OF Cu, Ag, 

AND Au 

The results listed in the preceding section show 
that the oxygen superoxide species adsorbed on 
Au surface has essentially the same reactivity as 
those on Cu and Ag surfaces. The stabilities of the 
molecularly adsorbed oxygen species are mainly 
due to the electron-transfer effect from the bulk 
metal as already shown previously for silver 
[34-36] and palladium [31, 32]. The chemical po¬ 
tentials (work functions) of the Cu, Ag, and Au 
surfaces are 4.48, 4.52, and 5.37 eV for the (110) 
surfaces, 4.94, 4.74, and 5.31 eV for the (111) sur¬ 
faces, and 4.59, 4.64, and 5.47 eV for the (100) 
surfaces [47]. The Cu surface is expected to have 
similar properties to the Ag surface. However, it is 
not clear whether the electron transfer also occurs 
for the Au surface. We therefore investigate here 
the E(n) curves of the M 2 —0 2 systems in the 
DAM in order to clarify why Au cannot be a good 
catalyst for the epoxidation of ethylene. 

Figure 7 is a display of the E(n) curves, namely 
the energy of the adcluster calculated as a function 
of n. It was calculated with the use of the highest 
spin coupling model [39]. The optimized end-on 
geometries of M 2 0 2 adclusters are adopted since 
they are the most stable geometries of the superox¬ 
ide adsorbed species. The E(n) curves are upper 
convex, and at n = 1.0, the systems are more sta¬ 
ble than at n = 0.0, and the order of stability is 
Cu 2 0 2 > Ag 2 0 2 > Au 2 0 2 . According to DAM 
[31], the electron transfer would occur between the 
adcluster and the bulk metal when the chemical 
potential of the adcluster becomes equal to the 
chemical potential of the solid surface, i.e., 
dE(n)/dn = -/x. For Cu 2 0 2 and Ag 2 0 2 systems, 
the tangents of the E(n) curves become equal with 
the experimental chemical potentials at about n = 
0.9. Therefore, from the concept of DAM [31], one 
electron should flow from the Cu and Ag bulk 
metals into the adcluster after some barrier. This is 
the case we used to study the activation of 0 2 on 
the surfaces shown in the preceding section. How¬ 
ever, for the Au 2 0 2 system, though the energy at 



FIGURE 7. E(n) curves for the Cu 2 0 2 , Ag 2 0 2 , and 
Au 2 0 2 adclusters in the highest spin coupling model at 
the optimized end-on geometries. 


n — 1.0 is lower than that at n = 0.0, the tangent of 
the E(n) curve is still smaller than the experimen¬ 
tal chemical potential. Therefore, different from Cu 
and Ag metals, one-electron flow does not occur 
from the Au metal into the adcluster. This point of 
the Au surface is essentially different from the 
other metals and should be the reason why the 
molecularly adsorbed oxygen species cannot exist 
on the clean gold surface. Thus, the inactivity of 
the gold surface for the oxygen is explained by the 
DAM as being due to its poor donating property. 

The chemical potential of Au is much larger 
than those of Cu and Ag. We note that this differ¬ 
ence is attributed actually to the relativistic effect 
in gold [48, 49]. The importance of the relativistic 
effects to the valence-electron properties such as 
chemical shift has been recently clarified in this 
laboratory [50-52]. 

We further note that the electron flow is diffi¬ 
cult to occur between the clean gold metal and the 
adcluster because it does not satisfy the condition 
clarified by the DAM [31]. However, as the energy 
at n = 1.0 is lower than that at n — 0.0, the elec¬ 
tron transfer can be realized if the chemical poten¬ 
tials fi of the gold metal can be lowered by an 
addition of promoters, etc. Experimentally, the re¬ 
cent works performed by Haruta et al. [53, 54] 
show that gold is remarkably active for the oxida¬ 
tion of CO and hydrocarbons when it is supported 
on suitable metal oxides. This may be explained by 
such effect of the metal-oxide support. 
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BRIEF SUMMARY 


small substitution of hydrogen with methyl group, 


The electron transfer from metal to 0 2 is a key 
factor for the chemisorption of oxygen on a metal 
surface, as shown previously [32-37], The differ¬ 
ences in the relative stabilities of different oxygen 
species on different metal surfaces are attributed 
to the differences in the electron-donating ability. 
The present DAM calculations show that, for Cu 
and Ag surfaces, one-electron transfer from the 
bulk metal into the adcluster occurs after some 
barriers. On the other hand, electron transfer is 
difficult to occur for the Au surface since the gold 
surface has the higher work function. This is why 
molecular adsorption of 0 2 does not occur on the 
Au surface. 

On Cu surface, peroxide is much more stable 
than superoxide, and there is essentially no barrier 
for the conversion from superoxide to peroxide, so 
that the lifetime of the superoxide species should 
be too short to react with ethylene. On Ag surface, 
however, the superoxide species would have con¬ 
siderable lifetime on the surface. On Au surface, 
the superoxide species does not exist because the 
molecular adsorption of 0 2 does not occur due to 
the difficulty of the electron transfer from the clean 
bulk metal. We think that this difference in the 
stabilities of the end-on superoxide species on Cu, 
Ag, and Au surfaces is a reason of the observed 
difference in the catalytic activity of these metals 
for the epoxidation of ethylene. 

The above conclusion for the origin of the dif¬ 
ferent catalytic activity of Cu, Ag, and Au surfaces 
for the epoxidation of ethylene suggests a possibil¬ 
ity of making a new catalyst for this reaction. The 
key is .an appropriate lifetime of the end-on super¬ 
oxide species on the metal. Silver realizes it with¬ 
out any special modification. However, if one can 
realize such superoxide species on some metals 
using some modification, we think that the essen¬ 
tial factor is realized. 


H 

H 


C=C 


ethylene 


H 

H 



c=c: 


propylene 


H 

H 


causes such a drastic change of the reaction. It is 
interesting to understand why. To know the differ¬ 
ence of the oxidation mechanism between ethylene 
and propylene will lead to a better understanding 
of the general oxidation mechanism of other olefins 
on silver surface. Further, in the chemical industry, 
propylene oxide is an important material, so that 
to know why silver is so poor for the conversion of 
propylene to propylene oxide is a starting point for 
a design of a new effective catalyst. This section is 
a summary of our recent study on the oxidation 
mechanism of propylene [40]. 

Two possible reasons may be considered for the 
difference in the reaction of ethylene and propyl¬ 
ene on a silver surface. One possibility is that the 
reaction routes are the same but the methyl sub¬ 
stituent causes a large change in the barrier height 
and/or in the stability of the intermediates in the 
course of the reaction. Another possibility is that 
an entirely different reaction route comes out for 
the existence of the methyl group. We have inves¬ 
tigated these two possibilities. As in the case of 
ethylene, we study the reactions of the two active 
oxygen species on a silver surface: the molecularly 
adsorbed superoxide species and the dissociatively 
adsorbed atomic oxygen species. 

The calculational methods are the same as those 
discussed in the second section. The DAM [31-33] 
was used to include the effect of the bulk metal 
such as the electron transfer and the image force. 
The basis sets are the same as before, and the 
geometries of the reactants, intermediates, transi¬ 
tion states, and the products were optimized by 
the Hartree-Fock method and the energies were 
calculated by the MP2 method in order to include 
the effects of electron correlations. 


Oxidation Mechanism of Propylene on 
Silver Surface 

Though silver is an effective catalyst for the 
epoxidation of ethylene (85-87% conversion) [5-7], 
it is a very poor catalyst for the epoxidation of 
propylene (2-5% conversion) [12-16], A relatively 


ATTACK ON OLEFINIC CARBON 

We first study the reaction route, which is es¬ 
sentially the same as that of ethylene [38]: The 
reaction starts from the interaction between the 
adsorbed oxygen and the doubly bonded carbon 
atom of propylene. Figure 8 is an illustration of the 
adcluster for the initial interaction of propylene 
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(a) (b) 


FIGURE 8. Model adclusters for the olefinic carbon 
attack mechanism: (a) on the terminal olefinic carbon, 
(b) on the central olefinic carbon. 


with the superoxide species. Since two carbons in 
propylene are different, two different attacks were 
considered. The interaction with atomic oxygen 
was similarly considered. The reaction may pro¬ 
cess as follows: 



C=C 


H 

H 


+ 0 2 or O 





In (3), aldehyde is a product of the terminal C 
attack and ketone is a product of the central C 
attack, and they are further oxidized to C0 2 and 

h 2 o. 

Figure 9 shows the optimized geometries and 
the energy diagrams for the reaction of the molec- 
ularly adsorbed oxygen with the terminal C of 
propylene. It is similar to Figure 2 for the reaction 
with ethylene: The substitution of one hydrogen 
with methyl group does not cause a large change 
for the reaction route and the energetics. In this 
reaction mechanism, the route going to propylene 
oxide should be quite smooth without any high- 
energy barrier and without any too stable interme¬ 
diate, while the route leading to the aldehyde 
should be essentially blocked up by the existence 
of a high-energy barrier in the hydrogen migration 
step. The diagram for the reaction of the superox¬ 
ide with the central carbon atom of propylene [Fig. 
8(b)] was similar to Figure 9 [40]. Thus, if the 
reaction proceeds along this reaction route, pro¬ 
pylene oxide should be produced in high selectiv¬ 
ity due to the catalytic activity of the superoxide 



FIGURE 9. Energy diagram for the reaction between the terminal carbon atom of propylene and the molecularly 
adsorbed superoxide on a Ag surface. The route leading to propylene oxide and aldehyde are shown by the solid and 
broken lines, respectively. 
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species adsorbed on silver. Also, for the reaction 
with atomically adsorbed oxygen, a similar conclu¬ 
sion as that of ethylene may be obtained as the 
reaction route and the energy diagrams of propy¬ 
lene are similar to ethylene [38]. 

The results shown above mean that if the reac¬ 
tion proceeds through the oxygen attack on the 
olefinic carbon of propylene, silver should be as 
good a catalyst as for ethylene for the epoxidation. 
However, this result contradicts with the experi¬ 
mental observation that the conversion of propyl¬ 
ene into propylene oxide is only 2-5% on the 
silver catalyst [12-17]. We therefore examine an¬ 
other possibility that a reaction route entirely dif¬ 
ferent from those of ethylene may exist. Since 
propylene has a methyl group, but ethylene does 
not, we next examine the reaction route involving 
the reaction of this methyl group with the reactive 
oxygen on a silver surface. 

ATTACK ON ALLYLIC HYDROGEN 

Propylene is different from ethylene by an exis¬ 
tence of a methyl group. We examine here the 
reaction route involving the reaction of this methyl 
group with the superoxide and the atomic oxygen 
on a silver surface. Figure 10 shows the initial 
attack of the superoxide [Fig. 10(a)] and atomic 
oxygen [Fig. 10(b)] on the methyl hydrogen, and 
the reactions process as follows: 

ch 3 H 

> = < + 0 2 /Ag 

H H 

^CH 2 —CH—CH 2 /Ag + H—O—O/Ag 

(4) 

-> CH 2 —CH —CH 2 /Ag + O/Ag + HO/Ag 

(5) 


CH 3 ^ 

_H 



/X =c \ 

+ O/Ag 


H 

H 



—* CH 2 —CH- 

-CH 2 /Ag + HO/Ag 

(6) 


These reactions produce the allyl intermediate ad¬ 
sorbed on the silver surface, which is oxidized 
further to C0 2 and H 2 0. Since allyl radical or 
anion is a rather stable intermediate, this reaction 
route is feasible. 

Figure 11 shows the optimized reaction route 
for the attack of superoxide on the methyl hydro¬ 
gen of propylene as given by (4) and (5). The 
present result was added to the previous one for 
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FIGURE 10. Model adclusters for the allylic H attack 
mechanism: (a) reaction with molecularly adsorbed 
superoxide species, (b) reaction with atomically 
adsorbed oxygen. 


the carbon attack given in Figure 9. The optimized 
geometries of the intermediates and the transition 
states for the present H attack are given in the 
lower side, while those for the previous C attack 
are given on the upper side. The energy levels of 
the intermediates and the transition states for the 
present reaction route are lower than those of the 
previous one. The barrier for the attack on the 
methyl hydrogen was calculated to be 10.8 
kcal/mol, which is lower than the barrier for the 
carbon attack, 13.7 kcal/mol, and then the system 
becomes allyl intermediate and the hydroperoxy 
group adsorbed on silver [Eq. (4)]. This intermedi¬ 
ate is also more stable than the previous one, but 
the hydroperoxy group is unstable and further 
converted into OH and O as given by Eq. (5): The 
barrier in this step is only 2.8 kcal/mol and the 
stabilization energy is as large as 42.6 kcal/mol. 
When two-site interaction between allyl and silver 
surface is further allowed, the system is further 
stabilized by 18.1 kcal/mol. Thus, the reaction 
route starting from the allylic H attack is always 
lower than that starting from the olefinic C attack, 
so that the reactions represented by Eqs. (4) and (5) 
should proceed more easily than the reactions 
given by Eqs. (2) and (3). 

We next examine the similar reaction given by 
Eq. (6) involving the atomic oxygen adsorbed on 
silver. Figure 12 shows the optimized reaction 
route and its energetics. The lower figures give the 
optimized geometries for the allylic H attack and 
the upper ones for the olefinic C attack. The barrier 
for the allylic H attack is essentially zero: only 0.1 
kcal/mol. The allyl intermediate is more stable 
than the intermediate of the previous C attack: 
when two-site interaction between allyl and silver 
surface is allowed, it is further stabilized by 18.1 
kcal/mol. Thus, again, for the atomic oxygen on 
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FIGURE 11 . Comparison of the energy diagrams for the reactions between propylene and the molecularly adsorbed 
oxygen. The route for the allylic H attack is shown by the solid line, and those for the carbon attack are shown by the 
broken and waved lines. 



FIGURE 12. Comparison of the energy diagrams for the two reactions between propylene and the atomically 
adsorbed oxygen on a Ag surface. 
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the silver surface, the allylic H attack is an easier 
reaction than the olefinic C attack. The allyl inter¬ 
mediate thus produced is stable on the silver sur¬ 
face so that it will be further attacked by the 
oxygen species on the surface and would finally be 
converted into C0 2 and H 2 0. 

Thus, for both superoxide and atomic oxygen 
on a silver surface, the reaction starting from the 
allylic H attack proceeds more easily than the one 
starting from the olefinic C attack. Therefore, the 
conversion to propylene oxide does not effectively 
occur on a silver surface. 

The origin of the activation of this allyl hydro¬ 
gen is due to the stability of the preceding ad¬ 
sorbed allyl intermediate. We have investigated 
the nature of the allyl intermediate adsorbed on a 
silver surface. Figure 13 shows the optimized ge¬ 
ometry of the allyl intermediate on Ag 2 calculated 
with n = 0 and n = 1. We see that the adsorption 
energy is larger with n = 1 than with n = 0, indi¬ 
cating that the allyl species on silver is actually an 
allyl anion species. Actually, the charge on the 
allyl group was -0.52 for n = 1, while it was 
— 0.27 in the case of n — 0. The existence of the 
anion allyl species on silver surface had been sug¬ 
gested experimentally by Madix et al. [23] to be 
the intermediate leading to the complete oxidation. 

BRIEF SUMMARY 

Figure 14 is a summary of the reaction mecha¬ 
nism of propylene on a silver surface. The allylic 
hydrogen is more reactive than the olefinic carbon. 


due to the stability of the allyl anion species on the 
silver surface, so that propylene is converted to 
allyl intermediate and further converted to C0 2 
and H 2 0. If an olefin does not have an allylic 
hydrogen, like ethylene, styrene, etc., it is con¬ 
verted to epoxide in high selectivity. Even if it has 
an allylic hydrogen but the allyl intermediate is 
not stable like in norbornene, the route leading to 
epoxide is more preferable. This explains the selec¬ 
tivity of the silver surface for the epoxidation reac¬ 
tions of olefins. 

We note that the present results clearly show 
that the route leading to propylene oxide also 
exists on a silver surface. However, it is not the 
most favorable reaction path. The selectivity is just 
between 2 and 5% experimentally. Since propylene 
oxide is an important material in chemical indus¬ 
try, it is interesting whether we can block up the 
route leading to the allylic intermediate. 


Concluding Remarks 

We reported, in this study, the mechanisms of 
the epoxidation and complete oxidation reaction of 
propylene on a silver surface and the stabilities 
and activities of the oxygen species adsorbed on 
Cu, Ag, and Au surfaces. We performed the (U)HF, 
MP2, and SAC/SAC-CI calculations with the use 
of the DAM which involved the interaction be¬ 
tween bulk metal and admolecules with considera¬ 
tion of electron transfer and image force correction. 



FIGURE 13. Stability of the allyl species adsorbed on Ag 2 cluster in the neutral {n = 0) and anion (n = 1) adcluster 
states. 
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FIGURE 14. Mechanism of the complete oxidation of propylene over a silver surface; (a) with molecularly adsorbed 
oxygen, (b) with atomically adsorbed oxygen. 


For the epoxidation of ethylene, the active 
species is the superoxide, which is molecularly 
adsorbed on the surface in the bent end-on geome¬ 
try. Ethylene attacks the terminal oxygen atom of 
the superoxide leading to ethylene oxide quite 
smoothly. The importance of the silver surface is 
quite evident. On the other hand, the atomically 
adsorbed oxygen species is not selective: Both 
epoxidation and complete oxidation products 
would be obtained. However, to promote the se¬ 
lectivity of the atomically adsorbed species will 
certainly promote the total selectivity to epoxida¬ 
tion product. In the process leading to ethylene 
oxide, the electron back-transfer from the reaction 
adsorbate complex to the metal surface should be 
important. 

The origin of the catalytic activity of the silver 
surface is due to its ability to stabilize the superox¬ 
ide species. On Cu surfaces, peroxide is much 
more stable than superoxide, so that the lifetime of 
the superoxide species should be too short to react 
with ethylene. On Au surfaces, the superoxide 
species does not exist because the molecular ad¬ 
sorption of 0 2 does not occur due to the difficulty 
of the electron transfer from the clean bulk metal. 

For the oxidation of propylene on silver surface, 
two competitive mechanisms exist. The dominant 
mechanism for the epoxidation of olefins is initi¬ 
ated by the activation of the olefinic carbon. The 
dominant mechanism for the complete oxidation 
of propylene is due to the activation of the allylic 
C—H bond. The latter is more favorable due to 
the formation of the stable adsorbed allyl interme¬ 


diate, which is the motivation force for the com¬ 
plete oxidation of propylene. 

We note that the proposed two competitive 
mechanisms not only reflect the difference be¬ 
tween ethylene and propylene but also explain 
rather well the available experimental facts for the 
epoxidation and complete oxidation of olefins. 
Therefore, it reflects a general mechanism to un¬ 
derstand the oxidation of olefins over silver sur¬ 
face. 
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ABSTRACT: We have developed procedures for determining the potential, electron 
density, and current at a planar metal surface with an applied static field in vacuum. 
These calculations are made using density-functional theory within the local density 
approximation (LDA) for the Kohn-Sham exchange and correlation. Several different 
techniques were compared including the use of different expressions for the exchange 
and correlation energies. The steady-state response of field emission to a laser is found 
using Floquet methods to solve the dipole approximation of the time-dependent 
Schrodinger equation with the static potential obtained from density-functional theory. 
These simulations show that there is a resonance in the response that causes a significant 
increase in the tunneling current. This resonance occurs for electrons that are promoted 
above the barrier by absorbing quanta from the laser when the line integral of the 
momentum between the turning points is equal to h /2. © 1997 John Wiley & Sons, Inc. Int 

J Quant Chem 65: 857-865, 1997 


Introduction 

P hotofield emission [1, 2], the study of the 
energy distribution of electrons emitted from 
a metal surface subjected to an intense static elec¬ 
tric field plus a weaker radiation field, is used to 
characterize the surface density of states in metals. 

Lee and Robbins [3] measured the time depen¬ 
dence of the total current from a field emitter 

International Journal of Quantum Chemistry, Vol. 65, 857-865 (1997) 
© 1997 John Wiley & Sons, Inc. 


when the laser beam was chopped with an elec¬ 
tromechanical shutter. They found that under con¬ 
ditions that are typical of the photofield studies, 
the rise time of the current is approximately 1 ms. 
They showed by analysis that the thermal relax¬ 
ation time is within one order of magnitude of the 
rise time, which they measured. Thus, Lee and 
Robbins concluded that the laser-induced increase 
in the current is primarily the result of tip heating, 
and this raises doubt regarding much of the pub¬ 
lished photofield data. However, few references 
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have been made to this study by Lee and Robbins, 
and thermal effects are not mentioned in many of 
the subsequent photofield studies. 

It is the purpose of this study to use numerical 
simulations based on the local density approxima¬ 
tion of Kohn-Sham density-functional theory to 
examine the nonthermal effects of laser-assisted 
field emission in order to clarify the mechanism 
for photofield emission. A number of different 
methods have been used to model laser-assisted 
field emission, but most of these are one-particle 
solutions [4-6] in which the time-dependent 
Schrodinger equation is solved for a specified ap¬ 
proximation of the potential, and all of the effects 
of electron-electron interactions are neglected. A 
many-particle approach [7, 8] that is based on 
nonequilibrium Green's functions [9] shows con¬ 
siderable promise, but we chose to use density- 
functional theory [10, 11] in the present study 
because we consider this procedure to be better 
understood due to a longer and more extensive 
history of application. 


Analysis 

The density-functional formalism, which was 
introduced by Hohenberg, Kohn, and Sham [12, 
13], is a many-particle approach in which the prop¬ 
erties of a system (such as a metal surface) are 
determined as functions of only the electron den¬ 
sity n(x). We use a jellium model in which the 
positive ions within the metal half-space (x < 0) 
are replaced by a uniform positive charge back¬ 
ground with density n + (x). The many-particle 
problem of determining the potential and electron 
density at a metal surface may be solved through 
density-functional theory by finding a consistent 
solution for the following set of four equations: 


— h 1 

— j^ + (4>(x) + V xc (n))^^E 1 4f 1 , (1) 

n(x) = Yj l<M*)| 2 , (2) 

i=i 

£? x 

= — / [n + (x') - n(x')](x'- x) dx', (3) 



-e 2 n^ 

4tts 0 


0.985 + 


0.946 + 8.92 a 0 n 1/3 
[l + 12.6« 0 n 1/3 ] 2 

(4) 


Here, the one-particle Schrodinger equation (1) 
is solved to determine the wave functions for a 
series of energies E, representing the distribution 
of normal energies in the metal. The electron den¬ 
sity n(x) is determined by normalizing the sum¬ 
mation of the probability densities in Eq. (2). The 
Coulomb portion of the effective potential re¬ 
quired in Eq. (1) is determined by integration of 
Poisson's equation as shown in Eq. (3), in which it 
is assumed that the Coulomb potential and electric 
field are zero deep within the metal. A number of 
different methods have been used to approximate 
the exchange-correlation contribution to the effec¬ 
tive potential [14-16]. In most calculations we have 
used the Wigner interpolation form as shown in 
Eq. (4), which was derived using the local density 
approximation (LDA), but we have also tested an 
approximation by Gell-Mann and Brueckner [16] 
which is more appropriate for high electron densi¬ 
ties. More exact calculations have shown [17] that, 
for an electron at a jellium edge, the Fermi hole 
calculated within the LDA approximates the exact 
hole. Thus, the corresponding LDA force field at 
the surface is fairly accurate; thus it is reasonable 
to use this approximation in simulating field emis¬ 
sion. Conventional mks units are used in place of 
atomic units throughout this work, so that Eq. (4) 
contains the symbol a 0 denoting the Bohr radius, 
which is approximately 0.052917 nm. 


Solutions for Static Problems 

In static problems we solve the system of Eqs. 
(1)—(4) by means of iteration. We begin by approx¬ 
imating the effective potential that is required in 
Eq. (1) by using either the Fowler-Nordheim 
model [18] or a closed-form approximation of a 
variational solution [19]. The numerically efficient 
method of back-propagation [20] is used to solve 
the Schrodinger equation without forming a ma¬ 
trix. The solutions are obtained for a large number 
of energies Ej (for 7=1 to N) to simulate the 
distribution of normal energies which was derived 
by Modinos [21]. 

We have used two different methods to intro¬ 
duce an applied static field into the formulation of 
Eqs. (l)-(4) in order to simulate field emission 
from a metal surface, and we have found that 
these two methods give identical results. In the 
first approach the normalization of the electron 
density in Eq. (2) is adjusted so that there is a net 
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charge within the descretized region, and the mag¬ 
nitude of this charge will produce the specified 
value of the applied static field. In this method the 
applied static field does not appear explicitly in 
the calculations. In the second approach the nor¬ 
malization of the electron density in Eq. (2) is 
adjusted so that the descretized region is neutral, 
and the applied static field is specified throughout 
the entire space including the metal. All of the 
examples in this article are for tungsten metal. We 
have assumed that tungsten has a density of 19.30 
g/cm 3 and that there are 6 free electrons per atom. 
Thus, we assume a Fermi energy of 19.1 eV, a bulk 
electronic density of 3.79 X 10 29 /mtr 3 (0.0562 in 
atomic units), and a Wigner-Seitz radius of 0.0857 
ran (1.62 bohrs). 

Figures 1 and 2 show the potential barrier and 
the electron density at a tungsten surface with 
applied static fields of 5 and 2 V/nm, respectively. 
In both of these figures the dashed curves repre¬ 
sent the potential barrier for the Fowler-Nordheim 
model [18], which requires specifying the work 
function (4.5 eV) and includes a correction from 
image theory. The results of our iterative calcula¬ 
tions are shown as solid curves for the potential 
barrier, and dotted curves for the electron density. 
The iterative solutions for these two figures re¬ 


quired under relaxation for convergence, so that at 
each step in the iterations the potential was up¬ 
dated by setting V I+1 (x) — aV/x) + (1 - 
a)Vj„i(x). A relaxation factor (a) of 0.001 was 
used in the iterations with a total of 200 energies. 
A total of 1000 intervals was used on the x axis to 
descretize a length of 2 nm of metal and 6.5 nm of 
vacuum with an increment of A x = 0.0085 nm. 

The iterative static density-functional solutions 
for the potential that are shown in Figures 1 and 2 
converge with consistent results as the depth of 
the metal that is discretized is varied from 2 to 8 
nm, the number of electron energies is changed 
from 50 to 1000, and the number of intervals is 
varied from 200 to 1500. The upper values given 
for these parameters are practical limits for calcu¬ 
lations with a personal computer. The length of 
vacuum that is descretized must be increased for 
very low values of the applied static field so that 
the potential at the far end of the grid is well 
below the Fermi level. There was no significant 
change when the calculations for Figures 1 and 2 
were repeated using the approximation by Gell- 
Mann and Brueckner [16] in place of the Wigner 
interpolation form for the exchange-correlation 
contribution to the effective potential as shown in 
Eq. (4). Figures 1 and 2 show that both the height 
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FIGURE 1 . Potential barrier and electron density at a tungsten surface with a static field of 5 V/nm. The potential and 
electron density are represented by a solid curve and a dotted curve, respectively. The dashed curve is the potential for 
the Fowler-Nordheim model. 
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FIGURE 2. Potential barrier and electron density at a tungsten surface with a static field of 2 V/nm. The potential and 
electron density are represented by a solid curve and a dotted curve, respectively. The dashed curve is the potential for 
the Fowler-Nordheim model. 


of the barrier and the length of the tunneling 
region are decreased, thus increasing the current, 
as the applied static field is increased. 

Figure 3 shows the maximum height of the 
potential barrier at a tungsten surface as a function 
of the applied static field. Values obtained using 


the Fowler-Nordheim model are shown with a 
solid curve, and the results of our iterative calcula¬ 
tions are shown as solid squares connected with a 
dashed curve. As in the first two figures, the 
iterative calculations were made using a relaxation 
factor of 0.001, a total of 200 energies, and a depth 



FIGURE 3. Barrier height at a tungsten surface as a function of the applied static field. Iterative solutions are shown as 
solid squares connected by a dashed curve, and the solid curve is for the Fowler-Nordheim model. 
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of 2 nm of metal. The results in Figure 3 appear to 
show that there is fairly good agreement between 
density-functional theory and the Fowler-Nord- 
heim model. However, it may be seen in Figures 1 
and 2 that while the maximum height of the bar¬ 
rier may be in good agreement for the two models, 
the peak is moved closer to the surface and the 
length of the tunneling region is increased with the 
Fowler-Nordheim model. Thus, we conclude that 
the value of the tunneling current would be under¬ 
estimated with the Fowler-Nordheim model. 


Steady-State Response of Field 
Emission to a Laser 

We begin by replacing Eq. (1) with the dipole 
approximation [ 22 ] for the one-dimensional time- 
dependent Schrodinger equation for a static poten¬ 
tial plus a radiation field: 

-h 2 d 2 % 

—- 7 - + [ V Q + exEr ) cos((ot)] x i r ] = ih -, 

2m dx 2 0 0 1 dt 

(5) 

where the radiation is polarized in the axial direc¬ 
tion, and the static potential V 0 and the magnitude 
of the electric vector of the optical field E 0 may be 
functions of x. Floquet's theorem [23] may be used 
to write the solution of Eq. (5) in the following 
form: 

00 

% = £ (A, N exp[-z(E, + ( 6 ) 

— 00 

where the ifj I N are functions of x that must be 
determined. We have shown previously [ 6 ] that 
consistency of Eqs. (5) and ( 6 ) for arbitrary values 
of time requires that the following ordinary differ¬ 
ential equation is satisfied: 

— h 2 d V/ N 

T 2"[ 0/ / N—l T ^IN+l] X Eq = (7) 

We have used several different methods [24] to 
solve Eq. (5). In the first procedure, which is the 
one used for the examples in this study, shooting 
methods [25] are used to solve the system of si¬ 
multaneous ordinary differential equations result¬ 
ing from Eq. (7). We require that each wave func¬ 
tion is consistent with a single plane wave 


having the appropriate energy incident upon the 
barrier, but a full spectrum of energies is required 
for the reflected and transmitted waves. Recursion 
is used with finite difference approximations to 
march through the barrier, thus determining the 
wave function at the far side of the barrier as 
functions of the known incident wave and the 
unknown reflected waves at the near side. Consis¬ 
tency of the transmitted wave function with only 
outgoing transmitted waves is enforced which re¬ 
sults in a numerical solution for the wave func¬ 
tions at all points in space. This is an extension of 
the numerically efficient method of back-propa¬ 
gation [20], which we use to solve the Schrodinger 
equation for static problems. 

It is possible to replace Eq. (2) with a double 
summation over the indices I and N to determine 
the electron density as a function of position and 
time, and to replace Eq. (3) with an expression for 
the time-dependent scalar potential. However, it is 
not clear as to what should be used in place of Eq. 
(4) to determine the time-dependent exchange-cor¬ 
relation contribution to the effective potential. Sev¬ 
eral different formulations have been presented to 
extend the static density-functional theory to a 
time-dependent formulation [26-30], but we have 
not yet implemented any of them. In particular, 
we are concerned that the random phase approxi¬ 
mation (RPA) may not be valid at optical frequen¬ 
cies. Others have used an adiabatic approximation 
[31, 32] in which they assume that the dynamic 
solution is equivalent to a sequence of static solu¬ 
tions that are obtained for various times during a 
cycle of the optical signal. The adiabatic approxi¬ 
mation has been used to determine optical second 
harmonic generation with metals. However, we 
feel that it is difficult to justify this approach, 
except in the far infrared spectrum, because it 
neglects all effects of the exchange of photons with 
the optical source, a mechanism which is evident 
in Eqs. ( 6 ) and (7). 

As a first approach to the time-dependent prob¬ 
lem, we have simulated the steady-state response 
of field emission to a laser by assuming that the 
time-averaged response by the electrons is the 
same as for the static problem, and the effects of 
the time-dependent response are included in the 
motion of the plasma which causes screening of 
the radiation field. Dominance of the static re¬ 
sponse may be seen in that even with a laser 
power flux density of 10 10 W/m 2 , with a typical 
applied static field of 5 V/nm, the optical electric 
field is 0.055 percent of the static field. Thus, we 
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use density-functional theory to determine the ef¬ 
fective potential considering only the static field, 
and then solve Eq. (5) allowing for the exponential 
decay of the optical field caused by screening 
within the metal. Others have used hydrodynamic 
models to simulate the interaction of an intense 
laser pulse with a metal, and they have used the 
Drude approximation to determine the frequency 
dependence of the complex conductivity in order 
to estimate the penetration of optical fields into the 
metal [33]. We also use the Drude model [34], and 
thus we assume that the depth into the metal at 
which the optical fields decay by 1 /e is given by 

8 = 7 = ==, ( 8 ) 

) 

where co , the plasma frequency, corresponds to a 
wavelength of 54.3 nm for tungsten. For example, 
the depth of penetration in tungsten is 8.65, 8.66, 
and 10.3 nm at wavelengths of 10 /xm, 1/xm, and 
100 nm, respectively. We have found that the re¬ 
sults of our calculations are independent of the 
depth of metal that is descretized when this depth 
is 80 nm or greater, so a depth of 100 nm was used 
in each of the following examples which simulate 
the steady-state response of field emission to a 
laser. 

Figure 4 shows the ratio of the tunneling cur¬ 
rent with a laser to that with no laser as a function 


of the optical wavelength for electrons with nor¬ 
mal energies of 0.95, 0.96, 0.97, 0.98, 0.99, and 1.00 
(from top to base of the figure) of the Fermi level. 
The calculations were made for a tungsten surface 
with an applied static field of 5 V/nm and a laser 
power flux density of 10 10 W/m 2 . 

A resonance in which the current is significantly 
increased by the laser may be seen in each curve of 
Figure 4. The mechanism for this resonance is 
reinforcement of the wave function by reflections 
at the turning points. Thus, the resonance occurs 
for electrons that are promoted above the barrier 
by absorbing quanta from the optical field when 
the line integral of the momentum between the 
turning points is equal to h/2, where h is Planck's 
constant. It may be seen in Figure 3 that with an 
applied static field of 5 V/nm a single photon at 
730 nm has sufficient energy to promote an elec¬ 
tron at the Fermi level to an energy above the 
potential barrier. For the special case of a square 
barrier the resonance requires that the barrier 
length is an integral multiple of one-half the De 
Broglie wavelength for electrons that are promoted 
above the barrier by absorbing single photons [35]. 
This relation of the barrier length to the De Broglie 
wavelength is also the criterion for the transmis¬ 
sion coefficient to be maximum for the static prob¬ 
lem in which the energy of a particle is above a 
square barrier [36]. For square barriers we find 



FIGURE 4. Current ratio with / without a laser vs. optical wavelength for electrons with normal energies of 0.95, 0.96, 
0.97, 0.98, 0.99, and 1.00 (top to base) s F in tungsten with a static field of 5 V/nm and an optical flux of 10 10 W/m 2 . 
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that the resonance has multiple sharp peaks [35] 
instead of the single broad peak that is seen in 
each curve of Figure 5. These differences are con¬ 
sistent with results for the static problem [36] and 
are attributed to the decreased definition of the 
turning points when there are no discontinuities in 
the potential [37]. 

Figure 5 shows the ratio of the tunneling cur¬ 
rent with a laser to that with no laser as a function 
of the optical wavelength allowing for the full 
distribution of energies in the metal at tempera¬ 
tures of — 273, 200, 1000, 1500, and 2000°C, for the 
curves from the top to the base of the figure, 
respectively. The calculations were made for a 
tungsten surface with an applied static field of 5 
V/nm and a laser power flux density of 10 10 
W/m 2 . The resonance is more prominent at lower 
temperatures, and this may be understood because 
of the more narrow distribution of energies. We 
have repeated the calculation for 200°C using the 
Fowler-Nordheim model for the potential barrier. 
The results are similar, but the static current den¬ 
sity is reduced by 22%, and the resonance has a 
current enhancement of 0.227% at a wavelength of 
337 nm instead of 0.242% at 416 run. Each of these 
changes may be attributed to the somewhat greater 
length and height of the barrier with the Fowler- 
Nordheim model. 

Figure 6 shows the ratio of the tunneling cur¬ 
rent with a laser to that with no laser as a function 


of the optical wavelength for a tungsten surface at 
200°C with a laser power flux density of 10 10 
W/m 2 . The value of the applied static field is 4.1, 
4.3, 4.5, 4.7, 5.0, and 5.5 V/nm, for the curves from 
the top to the base of the figure, respectively. This 
figure shows that as the applied static field is 
reduced, the resonance shifts to a shorter wave¬ 
length, but there is also an increased response at 
longer wavelengths. We find that in the limit as 
the applied static field is reduced to values much 
less than 4.0 V/nm the behavior approaches that 
of the photoelectric effect, that is, a sharp all-to- 
nothing transition in the transmission occurs at a 
single wavelength. This may be understood be¬ 
cause, for weak applied static fields, the height 
and length of the barrier are so large that the 
probability of tunneling is negligible so that trans¬ 
mission requires promotion above the barrier by 
the sustained absorption of a photon. 


Discussion 

Our results agree with experimental data from 
photofield studies in that the increase in the emit¬ 
ted current that is caused by a laser is proportional 
to the laser power flux density, maximum for 
polarization normal to the emitting metal surface, 
and from the electrons that absorb a single photon. 
However, it is generally assumed that the spectral 



FIGURE 5. Current ratio with/without a laser vs. optical wavelength for tungsten at -273, 200, 1000, 1500, and 
2000°C (top to base) with a static field of 5 V/nm and an optical flux of 10 10 W/m 2 . 
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FIGURE 6. Current ratio with / without a laser vs. optical wavelength for tungsten at 200°C with static fields of 4.1,4.3, 
4.5, 4.7, 5.0, and 5.5 V/nm (top to base) and an optical flux of 10 10 W/m 2 . 


response of photofield emission is rather simple; 
strongest for illumination near the blue end of the 
visible spectrum and virtually undetectable at 
longer wavelengths. By contrast, our simulations 
suggest that the spectral response is more com¬ 
plex, and that it is quite sensitive to the value of 
the applied static field. We believe that this differ¬ 
ence may be caused by (1) the focus of photofield 
studies being on electron energy distributions 
rather than the spectral response, and (2) the pres¬ 
ence of thermal effects in the data. Our results 
suggest that the nonthermal effects which we have 
simulated occur on a time scale on the order of 10 
fs so that these effects could be separated from 
thermal effects by using lasers that are amplitude 
modulated at radio frequencies in photofield stud¬ 
ies. 
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ABSTRACT: Structural parameters and energy have been calculated for C x _ 3 and C 9 
clusters using density functional theory through the deMon program. The C x _ 3 clusters 
were fully optimized using deMon; different basis sets were tested in order to choose the 
suitable one to be used in the C 9 clusters. In the case of C 2 the results were compared 
with experimental values. DZVP2 basis was selected because it always gave the closest 
value to the experimental data. The C 9 carbon clusters were designed in relation to 
C(100) diamond surface. The energies of the nonrelaxed and relaxed surfaces with and 
without hydrogen were calculated. In the same way CH 3 -relaxed and CH 2 -relaxed 
species were calculated. The diamond growth mechanism proposed in the literature was 
evaluated by an energy analysis. The C—CH 3 distance is reported. © 1997 John Wiley & 
Sons, Inc. Int J Quant Chem 65: 867-875, 1997 


Introduction 

D iamond surface has attracted a lot of atten¬ 
tion because of the technological applica¬ 
tions in thin films grown by chemical vapor depo¬ 
sition (CVD) techniques [1]. Many experimental 
and theoretical studies have been made over the 
last decades. Nowadays several ideas have been 
suggested about the characteristics of the diamond 
surface, the mechanism of diamond growth and 
the precursor agent of this chemisorption process. 

Correspondence to: E. Agacino Valdes. 


Although the first theoretical studies of dia¬ 
mond surfaces focussed in (111) surface, at the 
present, it is known that the (100) face is prevalent 
in diamond growth under conventional CVD con¬ 
ditions [2]. This orientation has the lowest density 
defect, making it more appropriate for electronic 
applications. 

Density functional theory (DFT) will be applied 
to the study of the energies involved in the relax¬ 
ation and chemisorption process of C(100) dia¬ 
mond surface to grow thin film; to do this, C 9 
carbon clusters resembling the C(100) diamond 
surface will be used. Before, small carbon clusters 
C x _3 were calculated using different basis sets in 
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order to select an adequate one to be used in C 9 
carbon clusters. 


Method and Methodology 

In this work, the code deMon based in density 
functional theory was used. This program imple¬ 
ments the linear combinations of Gaussian-type 
orbitals (LCGTO-DF) [3]. These methods have been 
described extensively. The calculations were made 
in local spin density approximation (LSDA). 

The geometries of the C 1-3 clusters and C 9 
(non-relaxed hydrogenated, relaxed, and relaxed 
hydrogenated) were fully optimized by deMon, 
but in other C 9 -carbon clusters only the adsorbed 
species (CH 3 and CH 2 ) were optimized. The start¬ 
ing geometry for the surface cluster was taken 
from other calculations involving surface models 
[4] and therefore it was kept frozen. 

For the small carbon clusters (C 1 . 2 ), four basis 
sets were tested: STO-3G(33/3), DZVP(621/ 
41/1*), DZVP2(721 /51/1*), and TZVP(7111/ 
411/1*). The results of the C 2 cluster were com¬ 
pared with experimental and theoretical data. The 
basis set which produced the best compromise 
between bond energy and bond distance was cho¬ 
sen as suitable to be used in C 9 carbon clusters. 

C ATOM 

It is known that a carbon atom has a ( 2s 2 2p 2 ) 
P ground state with two unpaired electrons. The 
excited state (2s 1 2p 3 ) is 5 S, with four unpaired 
electrons which explain the tetravalency of this 
element and all the chemistry of the hydrocarbons. 

Table I shows the calculated total energy of the 
singlet and triplet states. Notice that, in agreement 
with the experimental evidence, the triplet state is 
the most stable species, but the computed differ¬ 
ence between singlet and triplet states is only 
about 5 X 10 2 for all the basis considered. It is a 
very low value compared with experimental one. 
The DZVP2 basis together with TZVP and DZVP 
gave the closest energy value in relation to the 
experiments. 

C 2 MOLECULE 

Dicarbon molecule has eight valence electrons, 
and the molecular orbital theory establishes a 
KK(a g 2s) 2 2s) 2 (7r u 2p) 4 electronic configuration 


TABLE I_ 

Total energies for a carbon atom. 


Basis set 

E T (hartrees) 

Singlet 

Triplet 

STO-3G 

-36.89954 

-36.95530 

DZVP 

-37.41906 

-37.46365 

DZVP2 

-37.42278 

-37.46727 

TZVP 

— 

-37.46729 

[5] 

— 

-37.68 

[6] 

— 

-37.6886 


which corresponds to a ground state. A 

(7r u 2p) 3 (a g 2p) 1 external electronic configuration is 
the 3 n i( excited state. The spectrum of this 
molecule has been extensively studied and at the 
beginning it was found that the Mulliken bands 
(which arise from the 1 X g lower state) appeared at 
comparable intensity with the Swan bands (which 
arise from the 3 II U lower state). These results raise 
the question about which state is the ground state 
[7]. Today, Ballik and Ramsay's [8] observations 
have fixed 1 2^ at 0.075 eV below 3 II u , and l l< g is 
the ground state for only a small margin. 

Fougere and Nesbet [9] have studied the elec¬ 
tronic structure of this molecule over the range 
R e = 1.5fl 0 to 8.0 a 0 using different Slater-type or¬ 
bitals basis sets and two different levels of config¬ 
uration interaction. The best calculations obtained 
by Fougere and Nesbet were 1.2855 A and 6.76 eV 
for the state; for the triplet state the best bond 
energy result (5.45 eV) was not the best equilib¬ 
rium distance (1.3937 A) because another calcula- 

o 

tion gave 4.00 eV and 1.3261 A, respectively. As it 
can be seen, the x X g state is predicted as the 
ground state. At the present 6.25 ± 0.2 eV [10] and 
6.12 eV [11] are recognized as the best experimen¬ 
tal values of C 2 bond energies. 

Tables II and III show the total energies and the 
bond lengths computed for the triplet and the 
singlet state and the experimental results cited 
above. Other calculations are also tabulated. 

It is important to notice that the total energies 
calculated in both two states and 3 II W are 
very close; this is the reason for which the determi¬ 
nation of the ground state has been controversial 
[ 12 ]. 

The DZVP2 basis gives the best bond energy in 
comparison to experimental and theoretical re¬ 
sults. For the bond length the DZVP2 basis is not 
the best, in this case TZVP basis is better, but since 
the differences in bond lengths are of the order of 


868 


VOL. 65, NO. 5 




THEORETICAL CALCULATIONS OF CARBON CLUSTERS 


TABLE II 


Total energies for a state in dicarbon molecule. 

Basis set 

E t (hartrees) 

r 0 (A) 

D 0 (eV) 

STO-3G 

-74.21381 

1.2637 

8.25 

DZVP 

-75.18474 

1.2734 

7.01 

DZVP2 

75.18873 

1.2786 

6.92 

TZVP 

75.19739 

1.2518 

7.15 

f9] 

— 

1.2855 

6.76 

[10] 

— 

1.243 

6.25 ± 0.2 

[11] 

— 

1.242 

6.12 

[13] 

— 

1.2422 

6.36 

[14] 

75.531 

1.252 

6.24 


2%, this parameter is not as important as the 
energy, and DZVP2 basis is the most suitable to be 
used. These results are in agreement with consid¬ 
erations about a best description C 2 ground state 
must use a large configuration mixing in its ground 
state, therefore, the results can be improved using 
an extended basis [15]. 

C 3 MOLECULE 

The C 3 molecule is just barely linear as is in¬ 
dicated by its very small bending frequency 
(63 cm -1 ). Although Douglas [16] has established 
that C 3 is linear, others say that this molecule 
might be slightly nonlinear [17]. Liskow et al. [18] 
have made self-consistent field (SCF) and configu¬ 
ration mixing calculations and found that a reason¬ 
ably large basis set, including d functions, is nec¬ 
essary to account the small bending frequency, but 
the configuration mixing procedure is not needed; 
they computed the orbital energies as a function of 
the angle at ~ 1.27 A of distance and found that 
the larger basis predicts C 3 to be linear, in agree¬ 
ment the experiment. Peric Radic [19] also studied 
the ground state and the first two excited states of 
C 3 which are experimentally known; they used a 


TABLE III 


Total energies for a 3 n u state in dicarbon molecule. 

Basis set 

E T (hartrees) 

r 0 (A) 

D 0 (eV) 

STO-3G 

-74.21701 

1.3371 

8.34 

DZVP 

-75.20247 

1.3197 

7.49 

DZVP2 

75.20554 

1.3235 

7.37 

TZVP 

75.21277 

1.3081 

7.57 

[9] 

— 

1.3937 

5.45 

[10] 

— 

— 

6.17 

[14] 

75.551 

1.318 

6.16 


TABLE IV_ 

Total energies for a lineal tricarbon molecule. 


Basis set 

E T (hartrees) 

r 0 (A) 

D 0 (eV) 

STO-3G 

-111.51367 

1.334 

17.67 

DZVP 

-112.98469 

1.309 

16.19 

DZVP2 

-112.98888 

1.3132 

16.01 

TZVP 

-113.00586 

1.2962 

16.47 

[18] 

-113.38082 

1.328 

— 

[20] 

— 

— 

13.97 


multiconfiguration SCF, plus extensive configura¬ 
tion mixing calculation and their results were in 
agreement with experiment. Both authors found 
the state is the ground state. 

Different geometrical possibilities were calcu¬ 
lated for the singlet and triplet state: linear, non¬ 
linear, and a triangle (cycle). These last two ge¬ 
ometries, when fully optimized transform to the 
linear geometry. Table IV presents these results. 
The singlet was always the predicted ground state. 
The 1 Xg state is the ground state, in agreement 
with the experimental evidences. As in the case of 
the dicarbon molecule, DZVP2 was the best in 
bond energies, but for the bond length TZVP gave 
the best result. 

Since the bond distance differences are small, 
but not the energy differences, the DZVP2 basis, 
which gave the bond energy closest to the experi¬ 
mental data, was used to model the C 9 carbon 
clusters. 

C 9 CARBON CLUSTERS 

Figure 1(a) shows a bcc unit cell (a = 3.567 A) 
of diamond; notice that the C(100) surface has 
successive parallel chains in (110) direction. When 
the surface is formed, the lack of saturation of the 
surface carbons induce a relaxation process. Hamza 
et al. [21] found that raising the temperature to 
1300 K, C(100) surface produces a LEED pattern 
made up of two domains with 2x1 symmetry; 
forming a tt symmetric dimer bond between two 
surface carbon atoms of two consecutive chains 
[Figure 1(b)] [22]. Atomic force microscopy (AFM) 
[23] and scanning tunneling microscopy [4] has 
also evidenced the C(100) 2 X 1 surface recon¬ 
struction. It is the most accepted geometry, at least 
in experimental conditions of diamond growth. 

The structure of a hydrogenated surface has 
been under discussion for many years; the mono- 
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(c) 


FIGURE 1. (a) Unitary cell of diamond (Ref. [1]); 

(b) relaxed surface C(100) 2 x 1; (c) monohydrogenated 
surface diamond C(100) 2x1. 


hydrogenated, dihydrogenated, and alternating 
structures have been evaluated to establish the 
most stable one. Thomas et al. [24] report that 
atomic hydrogen doses do not remove the 2X1 
LEED pattern even after doses equivalent to 100 
monolayers of hydrogen. Using HREELS and LEED 
techniques, Thomas et al. [24] reported the mono¬ 
hydrogenated dimer row surface as the most sta¬ 
ble. No evidence was observed for other species 
even after a large atomic hydrogen exposure. 

The symmetric dimer with monohydride termi¬ 
nation [Figure 1(c)] is the most likely structure. 
Therefore, the dihydrogen and alternating configu¬ 
rations were not considered. This is consistent with 
theoretical calculations [25]. 

Several authors have used the cluster approach 
to study the chemisorption process involved in 
C(100) diamond growth. It is based in the gener¬ 


ally accepted idea that the chemisorption process 
is a localized phenomenon of orbital interaction. 
C 9 is the best carbon cluster for studying the 
reconstructed C(100) diamond surface. 

One of the first C 9 carbon cluster calculations 
was made by Verwoerd [26]. He studied the recon¬ 
struction of both the clean Si(100) and C(100) sur¬ 
faces and the process of hydrogen chemisorption 
onto these two surfaces by a semiempirical method 
and using a C 9 carbon cluster [27]. Starting from 
the ideal parameters, Verwoerd made a whole 
optimization using a semiempirical MNDO 
method. For clean and monohydrogenated recon¬ 
structed C(100) surfaces, he predicted a symmetric 
dimer; the C-C distances calculated for these 

o 

dimers were 1.43 and 1.67 A, respectively. The 
dihydride structure was predicted as unstable 
(C-C distance in the dimer of 0.74 A), because of 
the repulsive interaction between the electron den¬ 
sities of the two hydrogens. Taking a C 9 carbon 
cluster model similar to Verwoerd's, we calculated 
the relaxed and nonrelaxed surfaces, with and 
without chemisorbed hydrogens and with CH 2 
and CH 3 adsorbed. Verwoerd reported 3.09 eV for 
relaxation energy and 4.39 eV for binding energy 
of hydrogen. Mehandru and Anderson [25b] ob¬ 
tained 2.52 and 4.39 eV, respectively. They use 
semiempirical methods, but Mehandru and An¬ 
derson use a cluster larger than Verwoerd. 

An interesting work that combined STM images 
and theoretical calculations of molecular dynamics 
in a semiempirical DFT was made by Frauenheim 
et al. [4]. They used a supercell of 126 carbon 
atoms for C(001) surface and reported a symmetric 
dimer with a distance of 1.41 A for a clean surface 
and 1.50 A for a monohydrogenated surface. Other 
details of the structural parameters of Frauenheim 
et al. [4] are presented in Table V. We took their 
relaxed structural parameters. 


TABLE V_ 

Relaxed structural parameters of C 9 carbon 
clusters (Ref. [4]). 


Surf, struct. 

Bond lengths (A) 

Bond angles (deg) 

C(100) 2 x 1 

(1-1) 1.41 

1-1-2 = 109 


(1-2) 1.50 

2-1-2 = 113.7 

C(100) 2 x 1:H 

(1-2) 1.07 

1-2-2 = 112 


(2-2) 1.58 

1- 2-3 = 113 

2- 2-3 = 104 
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The C 9 carbon cluster was constructed consider¬ 
ing two consecutive diagonal chains [see the bcc 
unitary cell in Figure 1(a)], with two more layers 
below. The dimensions of the cluster correspond to 
the diagonal (2.5223 A) in the bcc unit cell. As in 
the unit cell, the cluster has four layers with an 
interlayer spacing which is 1 /4 of the lattice con¬ 
stant of the bcc cell. Table VI presents the Carte¬ 
sian coordinates of the nonrelaxed C 9 carbon clus¬ 
ter taken from the bcc cell parameters [Fig. 2(a)]; 
Figures 2(b) and 2(c) present the corresponding 
relaxed and relaxed hydrogenated structures using 
Table V parameters as the starting point. 

In order to saturate the valencies of the nonsur¬ 
face carbons, 12 virtual hydrogens have been used; 

NON-RELAXED CLUSTER C(100) 1x1 
WITH HYDROGEN CHEMISORBED 


TABLE VI 


Cartesian coordinates of the nonrelaxed 
C 9 -carbon clusters. 


Atom 

X 

Y 

Z 

C 1 

0.0000 

0.0000 

0.0000 

C 2 

1.2612 

0.0000 

0.8918 

C 3 

2.5223 

0.0000 

0.0000 

C 4 

0.0000 

2.5223 

0.0000 

C 5 

1.2612 

2.5223 

0.8918 

C 6 

2.5223 

2.5223 

0.0000 

C 7 

0.0000 

1.2612 

-0.8918 

C 8 

1.2612 

1.2612 

-1.7835 

C 9 

2.5223 

1.2612 

-0.8918 


RELAXED CLUSTER C(100) 2x1 
WITHOUT HYDROGEN CHEMISORBED 



FIGURE 2 . Geometrical structure of carbon clusters: (a) hydrogen nonrelaxed; (b) retexedj:and (c) relaxed 
monohydrogenated. 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


871 



AGACINO VALDES ET AL. 



Table VII shows the total energies computed for 
nonrelaxed hydrogen, relaxed, and relaxed mono- 
hydrogenated C 9 carbon cluster and the energies 
involved in the relaxation and hydrogen 
chemisorption process. 

The experimental results are qualitatively repro¬ 
duced. The relaxation process produces a stabiliza¬ 
tion of the cluster, but notice that the presence of 
hydrogen stabilizes it too much. The computed 


relaxation energy is smaller than that calculated by 
Verwoerd [26] and Mehandru-Andersen [25b]. 


Methyl Adsorption and Growth of the 
First Layer of Diamond 

Because of the unique properties of diamond, 
great efforts have been put into the synthesis of 
this material to be used in advanced materials 
applications. The first attempt to obtain diamond 
was in 1955, but the process required high pres¬ 
sures and temperatures. After 1955 a great number 
of investigators began to explore the possibility of 
synthesizing diamond at low pressure and temper¬ 
ature, that is, in the region of its metastability. The 
chemical vapor deposition (CVD) technique of dia¬ 
mond films was a reality in 1958, but the growth 
rates were very slow and graphitic carbon was 
deposited simultaneously; hydrogen was neces¬ 
sary to remove the graphitic phase. 

In 1975, Deryagin's group [28] reported the use 
of a gas activation technique which resulted in 
dramatic increases in diamond growth rates, while 
eliminating much of the graphitic deposition. 
Starting in the 1980s many reports were published 
of successful diamond growth in low pressure 
using new gas activation techniques. 

It is now well established that diamond can be 
deposited at low pressures (from 1 to 10“ 2 atm.) 
and temperatures between 700 and 1000° C using 
the CVD technique. In this technique, a mixture of 
methane and ~ 98% of hydrogen is activated by 
passing it through a plasma or a hot filament; this 
process produces atomic hydrogen and carbon 
radicals, and the deposition of these species over 
the diamond surface produces the growth of dia¬ 
mond and graphitic phases. A highly preferential 
etching of the graphitic phases is produced by 
atomic hydrogen [1]. 

Although different small hydrocarbon frag¬ 
ments have been analyzed as possible precursor 
species in diamond growth, at present it is ac- 


TABLE VII___ 

Total energies for the nonrelaxed and the relaxed cluster, with and without hydrogen. 


cluster 


(hartrees) 


Nonrelaxed + 2H 
-348.10161 


Relaxed 

346.95654 


Relaxed + 2H 
-348.23939 


^relax. (®V) ^{relaxed + 2H) (^nonrelaxed+ 2H) 
^quims. (H) = ^ I ^^(relaxed + 2H) — ^(relaxed) 


-3.75 

-4.49 
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cepted that CH 3 is the dominant growth species 
[29]. 

Mehandru and Anderson [25b] calculated CH 2 
and CH 3 energy barriers in the presence of the 
hydrogen vacancies on the monohydrogenated 
surface and reported for CH 2 and CH 3 , a C-CH 2 
and C-CH 3 of 1.48 and 1.59, respectively; the 
corresponding binding energies are 4.38 and 3.38 
eV in the most stable sites. We have calculated the 
energy of C 9 -H, CH 3 , C 9 -H, CH*, and C 9 -2H, 
CH 2 using the deMon program. 

Five steps in the growth mechanism were con¬ 
sidered as suggested by Garrison et al. [30]: (1) A 
hydrogen leaves the surface and a radical site is 
formed. (2) The methyl radical is adsorbed over a 
radical site. (3) A second hydrogen belonging to 
methyl radical leaves the surface and a methylen 
radical is formed. (4) The dimer bond is broken by 
a methylene group; a methyliden group and a 
radical site are formed. (5) A methyliden group 
attacks the radical site and a bridge-methylene 
structure is formed. A new layer of diamond has 
been produced and the hydrogens of the methy¬ 
lene group can be substituted for new methyl 
radicals that arrive to the surface. 

The calculated energies confirm that, in the 
mechanism suggested by Garrison et al. [30], step 


TABLE VIII 


Total energies for the relaxed cluster with methyl 
and methylene species adsorbed and energy 
changes involved in the steps of the mechanism. 



C 9 -H,CH 3 

c 9 -h ch 2 

E total (hartrees) 

-386.51 

-385.54 

dCsurf “ C ads (A) 

1.53 

— 


Steps 

A E/eV 

Step 1 

+0.01 

Step 2 

+0.5 

Step 3 

-0.69 

Step 4 

-0.002 

Steps 

+0.89 


3 is an activated process that produces an interme¬ 
diate radical state. Table VIII presents the results 
and Figure 3 presents a scheme of this process. 
Another interesting point in the mechanism is step 
4. The problem is whether the methylen group will 
attack the dimer bond or will displace the top 
hydrogen. In order to choose between these two 
possibilities, an electrostatic potential calculation 
was made for the C 9 -H, CH 2 structure. Figure 4 
presents the electrostatic potential curves in the 
plane bisecting the dimer for this structure. 



FIGURE 3. Steps 1 and 2 of the reaction mechanism. 
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FIGURE 4. Electrostatic potential curves in the plane 
bisecting the C 9 -H, CH 2 cluster. 


The electrostatic potential curves predict that 
the methylen group lbok for interaction with the 
dimer rather than for the fop hydrogen. Then, step 
4 is potentially possible: The methylen group 
breaks the dimer bond and it does not displace the 
top hydrogen. 


Conclusions 

A full optimization of C 9 cluster predicts the 
relaxation of the C(100) surface as the most stable 
and confirm the important role of Hydrogen in the 
stabilization of this surface. 

The energy involved in the step 2 is negative 
and the intermediate C 9 -H, CH 2 has a barrier 
energy of 34.08 eV. The electrostatic potential 
shown the step 4 is potentially possible. In order 
to improve the results quantitatively, a partial 
optimization of dusters larger than C 9 are being 
calculated. 
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ABSTRACT: A spin-averaged Hartree-Fock (SAHF) procedure is examined within 
the framework of the configuration-averaged methodology proposed earlier. The SAHF 
method produces reasonable total energies and can be successfully used for the calculation 
of electronic spectra, especially in the cases of marked energy separation between the 
spin states. An example is given with the spectroscopy of Mn 2+ ions in the ZnS crystal 
where the SAHF procedure provides a more accurate interpretation of the electronic 
spectrum than that obtained by other self-consistent-field methods. © 1997 John Wiley & 
Sons, Inc. Int J Quant Chem 65: 877-884, 1997 


Introduction 

T he applications of quantum chemical meth¬ 
odologies to investigations of open-shell 
transition-metal (TM) systems are complicated by 
the large number of spin states often associated 
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with such systems. The number of spin states in an 
open-shell system depends on the number of 
open-shell orbitals and electrons. In general, the 
number of microstates available to an open-shell 
system is given by Eq. (1), with N and m the 
numbers of open-shell electrons and spatial or¬ 
bitals, respectively; 


[2 m\ = < 2m ) ! 

I N ) N!(2m - N)\ 


( 1 ) 


and the number of states for a given multiplicity 
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by [1] 


N„,, n (S) = 


m 


m 


(2S + N)/2 I (N - 2S)/2 


I m ' 

\(2 S + N + 2)/2j 

I m ^ 

X \(N - 2S - 2)/2 j 
Nn.^iS) = M mN (S) — M mN (S + 1), 


( 2 ) 

(3) 


with n a — number of a spin electrons, n a > n /3/ 
and S = \(n a - rip); N = n a + rip and it is under¬ 
stood that the second term in Eq. (2) is zero should 
(N + 2S + 2) > 2m. Note that (N + 2S)/2 = n a 
and (N — 2S)/2 = rip. The first term in Eq. (3) is 
the number of times n a electrons can be arranged 
in m a-spin orbitals multiplied by the number of 
times rip that electrons can be arranged in m 
/3-spin orbitals. The second term subtracts from the 
first those microstates that have the value S 2 = S 
but that belong to multiplicities greater than S. 

In the case of a single TM center, several hun¬ 
dreds of possible states can arise and this number 
grows quickly to 10 5 -10 6 for TM dimers, 10 n -10 12 
for systems with four metals, etc. This large num¬ 
ber of possible states can create a large number of 
states close in energy to the lowest-energy state 
and often prevents the self-consistent-field (SCF) 
convergence, leading also to spatial or spin insta¬ 
bilities and thus invalidating the pure spin de¬ 
scription of the system. The calculation of the 
ground state is, therefore, a difficult task and re¬ 
quires a special approach to the treatment of 
molecules with a large number of open shells. 

Previous work on TM systems has demon¬ 
strated that an iterative configuration averaged 
Hartree-Fock (CAHF) procedure [1] is suitable for 
the description of electronic properties and geo¬ 
metric characteristics of metal clusters [2-7]. The 
CAHF procedure has been compared with other 
methods which also average over multiplets and 
effective degeneracies, and the CAHF reliability 
has been demonstrated especially for molecular 
systems with extensive open shells [2]. The CAHF 
operator was developed within the restricted 
open-shell Hartree-Fock (ROHF) approximation 
and generalized for systems with an arbitrary 


number of electrons and open-shell orbitals [2, 3]. 
For this purpose, a Fock operator for the averaged 
energy of a configuration is considered. The ener¬ 
gies and the orbitals from such an operator can be 
used to form a reference for a subsequent spin 
projection to obtain the pure spin states using, e.g., 
Rumer diagrams [1, 8, 9] or unitary group tech¬ 
niques [10, 11]. Subsequent configuration interac¬ 
tion (Cl) can also be performed with such conve¬ 
niently obtained orbitals, allowing for a proper 
check of the ground state and a refinement of the 
total wave function and its properties. 

It was the aim of this work to present a further 
development of the CAHF theory that allows for a 
better approach to the spin-state problem through 
a selection of the configuration-averaged states by 
a spin eigenvalue. This leads to a smaller and 
significantly simplified pool of possible states and 
can present a better reference for the subsequent 
CL An example is given with the spectroscopy of 
the Mn 2+ ion in the ZnS lattice where the new 
approach is compared with the ROHF calculations 
performed previously on the same system [12] 
using the ZINDO program and the INDO/S 
Hamiltonian [13]. 


Theory 

In general, we are trying to calculate the aver¬ 
age energy of a system with N electrons in m 
spatial orbitals. The number of possible mi¬ 
crostates is given by Eq. (1). We assume that the 
states are described by spin and spatially adapted 
linear combinations of Slater determinants, \<f> 1 
(f> 2 ... cf) N \ constructed from a set of orthonormal 
spin orbitals {$*},(<£/1 <fy> = 

The number of determinants of N electrons in 
m spatial orbitals, when one of the orbitals </>; is 
doubly occupied, given by 


/ 2 m — 2 \ 

\ N - 2 y 


(4) 


Since each such determinant yields the Coulomb 
integrals ] n in the expression for its energy, this is 
also the number of ] u integrals that appear in all 
possible determinants [Eq. (1)]. 

Accordingly, for i ¥= j, the number of / I; inte¬ 
grals that appears in all possible determinants is 
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given by 



2m — 2 \ 

N-2 y 


(5) 


as the electrons in and (f) ; can be of a and 
spin independently. The general Coulomb integral, 
Jij, is defined as 


hj = //dr(l)dr(2)^f(l)^(2)(l/r 12 )^(l)^(2). 

( 6 ) 


In a simple fashion, the number of exchange 
integrals, when considering all possible deter¬ 
minants of Eq. (1), is given by 


2 


2m - 2 \ 
N-2 )' 



with 


bitals, the next two are those for open-shell contri¬ 
butions to the energy, and the last term gives the 
coupling between the closed and the open shells. 
Here, i and j stand for the doubly occupied 
closed-shell orbitals, and p and n label the open- 
shell orbitals with fractional occupations n and n v 
belonging to open shells jjl and v, respectively. In 
the general case, p and n refer to individual or¬ 
bitals, and each pi and v define a separate open- 
shell Fock operator. Most often, sets of orbitals 
having the same vector coupling coefficients, a 
and b in Eq. (9), are grouped together into shells 
described by the same open-shell operator [1, 14]. 
Comparing Eq. (9) with (10) gives the vector cou¬ 
pling coefficients for a given open shell r, with 
fractional occupation n r = N r /m r : 


2 m r (N r - 1) 
N r (2m r — 1) ’ 


( 11 ) 


K tj - ff rfr(l)dT(2)^f(l)^;(2)(l//- 12 )</.,(2)^(l). 

( 8 ) 


The vector coupling coefficients between the dif¬ 
ferent open shells are equal to 1: 


In this case, both </> z and need to be of a or jS 
spin, respectively, for the factor 2 rather than the 4 
of Eq. (5). The average total energy of the open-shell 
is then obtained through averaging over the total 
number of integrals for all possible states. This 
leads to 


N 

E ave (open) = — X>„ + 
m n 


N(N - 1) 
2m(2m — 1) 


X 


Ulnn + 2 1L ( 2 Jnp 

n n<p 


K np ) 


. (9) 


To use the ROHF operator [14], the energy must be 
cast in the form of 


E = 2Zh,+ LGhi-Ktj) 

i i, j 


4- / n h 

JLj m m 

m 


U rs = brs = 1 - ( 12 ) 

The configuration average thus obtained is a 
mix of various states with different total spin. This 
spin contamination is generic for the CAHF and 
typically presents a difficulty in the determination 
of the pure spin state specified in the ROHF calcu¬ 
lations. When a fraction of the total CAHF states is 
selected by the total spin, one obtains a signifi¬ 
cantly smaller range of configurations; we further 
refer to them as spin-averaged states [see Eq. (2)]. 
This spin-averaged Hartree-Fock (SAHF) proce¬ 
dure, similar to the CAHF methodology, operates 
in the reduced manifold of spin-eigenstates within 
a given configuration, leaving unsampled (in the 
average) a large number of unwanted configura¬ 
tions. 

For the SAHF, the vector coupling coefficients 
have the following form: 


^ fx v p e jit n e v 

1 ^ 

+ rE n / 2 ^ ~ K ip ). do) 

Z i, p 


a(S, m, N) = 


m 


(m 2 - 1 )N 2 


X 


mN(N - 1) + 


N(N - 4) 


The first two terms in Eq. (10) account for con¬ 
tributions from electrons in the closed shell or- 


+ 2S(S + 1) 


(13) 
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advantages of the SAHF approach are illustrated 
in the next section. 


Application 


+ 4S(S + 1) . (14) 

To derive Eq. (14) (perhaps there is a simpler 
way!), we consider four cases and the genealogical 
construction of spin states [1]. Each case gives rise 
to the occurrence of at least one Coulomb integral. 
In case one, there is one up-spin electron in both 
orbitals c p t and </> y . Such a triplet-coupled pair 
gives rise to / /; - K f; and occurs multiplied by the 
number of times the remaining determinant can 
couple to give an eigenstate of spin eigenvalue 
S [more correctly, S 2 eigenvalue of S(S + 1)]. If 
S > 1, this is (N m „ 2 ,N~ 2 ( s ” 1) + N m - 2 ,N- 2 (S) + 
N m _ 2 /N _ 2 (S + 1)) times [see Eq. (2)]. If S = 1/2, 
the multiplicative factor is (N m _ 2N „ 2 (S) + 
N m~ 2 ,N- 2 ( s + 1))/and if S = 0, N m _ 2 N _ 2 (S)(ixom 
the addition rule of angular momentum, S = | S + 
1|, |S + 1| — 1,...|S - 1|). A second case has an 
up-spin electron in </>, and a down-spin in </> y , 
forming an (a/3 - /3a)/ yfl singlet-coupled pair. 
Such a pair contributes J /; - + K x - and occurs 
N w „ 2/N _ 2 (S) times. The third case contains two 
electrons in 4 > i and one in c/> ; or visa versa. Each 
such situation contributes 2(2 J i} — X /; ) and occurs 
(N m _ 2/N _ 3 (S + 1/2) + N m _ 2N _ 2 (S - 1/2)) times 
if S > 1/2 or N m _ 2fN _ 2 (S) if S = 0. Finally, the 
case of two electrons in each of these orbitals 
contributes 4L — 2 and occurs N m _ 2>N _ 4 (S) 
times. Combining these terms and dividing by 
N m N (S\ which is very tedious and best done 
symbolically, yields the average number of /'s and 
K r s. Interestingly, the special cases delineated 
above take care of themselves providing that the 
binomial terms in Eq. (2) with an ill-defined argu¬ 
ment are set equal to zero. The comparison is then 
made with Eq. (10) with all open-shell fractional 
occupancy n„ = n, = N/m to obtain the a-.-, and b., 
of Eqs. (13) and (14). 

} u occurs N m „ 1N _ 2 (S) times, whereas, of 
course, two electrons in one orbital contribute no 
K u . In comparing with Eq. (10), however, J H occurs 
n i n i ( 2a i{ - b u )/ 4 times (since J H = K (i ), allowing 
the same expression to be used for a xi as for a r 
and for b H as for b {]/ i ^ j, and, thus, the SAHF 
procedure is characterized by a single open-shell 
operator as it was for the CAHF procedure. The 


We illustrate the SAHF approach for the case of 
two electrons in two orbitals, which is represented 
by the six microstates shown in Figure 1. The 
average energy obtained by the CAHF method 
over all the six states is then split within the SAHF 
framework into two pieces, corresponding to the 
three possible singlets (Fig. 2) and the triplet states 
that form this configuration. Figure 1 shows two 
possible ways to obtain the averaged energies, 
through the averaging over all the individual mi¬ 
crostates (left block) or through the pure spin states 
(on the right). The results are the same as shown in 
the figure. 



FIGURE 1. Configuration averaging illustrated for the 
simplest case of two electrons in two spatial orbitals, h 
stands for the one-electron terms in the energy 
expressions; J and K are the Coulomb and the exchange 
integrals, respectively. At the bottom of the table, the 
CAHF energy is compound with other averaging 
procedures, such as “grand canonical Hartree-Fock” 
(gchf) and “hyper-Hartree-Fock” (hhf) ([15] and [16], 
respectively). 
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The relative energies obtained by various meth¬ 
ods for the similar case of a d 8, TM ion in an 
octahedral ligand field (two electrons in the upper 
doubly degenerate e g orbitals, d z i and d x 2 _ y2 ) are 
depicted in Figure 3. On the right side of Figure 3, 
the Cl energies of the ground-state triplet and the 
first two excited-state singlets that originate from 
the three dominant configurations are shown. We 
see that the open-shell triplet and singlet SCF 
calculations, ROHF-3 and ROHF-1, respectively, 
represent a good reference for the corresponding 
lower-lying spin states. SAHF-1 gives closer en¬ 
ergy for the higher-lying singlet than the other 
methods. In all the cases, RHF is a poor reference 
for either singlet or triplet states. Upon projection, 
the CAHF produces higher energies for the 
ground-state triplet and singlets, but is a good 
reference for higher-lying triplets, as, e.g., those 
originating from mixed (f 2 , e ) configurations, not 
included in any of the SCF schemes shown here. 
Note that the ROHF-3 and SAHF-3 correspond to 
the same triplet state which is unique, (see also 
Fig. 1). 

We conclude from these simple considerations 
that configuration-averaging models provide a 
good reference for the spectroscopic Cl calcula¬ 
tions. The advantages offered by the SAHF ap- 


Energy (eV) 

I — RHF 


5 



— SAHF-1 

"tf — 

S2 

— ROHF-1 

— CAHF 

g | 

i t 

SI 


0 


ROHF-3 _A_ JR 


FIGURE 3. Relative energies of the states produced by 
various SCF methods (left) compared with the Cl 
energies for the three lowest states originating from d 2 
configuration (right). 


_ I 1_ t 

11 \_L + J_ 

{I Ei + ^£- 5+6 + 1 E2}/3 

SAHF 

- /I— -Li 11 JL /1_ _1_\ 1_ 

11 \J—L/_L \l_L/ _L 

El + 1 E5+6 + 1 E2 + 3 E3 + 3 E5-6 + 3 E4>/6 

CAHF 

FIGURE 2. Comparison between the SAHF and CAHF in the case of two electrons in two orbitals. The numbering 
corresponds to the pictures on top of Figure 1. 
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proach can be demonstrated with relative ease in 
the cases where there is a distinct separation be¬ 
tween the ground and the excited spin states. We 
illustrate this idea with the examination of the 
electronic spectrum of a d 5 ion in a cubic crystal 
field (Fig. 4), which has an unique sextet ground- 
state 6 A l originating from the high-spin e 2 t\ con¬ 
figuration in a tetrahedral field. 

The electronic spectrum of ZnS:Mn has been 
thoroughly investigated in the last several decades 
[17-24]. Five distinct bands have been observed in 
the visible region corresponding to d-d transitions 
from the 6 A X ground state to the manifold of 
quartet states originating from the configurations 
e y t\~ y , with y = 1-4. The t\ configuration con¬ 
tributes to to the doublet excited states and re¬ 
quires excitation of two electrons from the ground 






b 1 - 

4 t 

fb'l 


T 

b 2 


e 2 t\ configuration. This requires a double spin-flip 
during the electronic transition and imposes a strict 
spin prohibition on such bands. Doublet states are, 
therefore, considered unlikely to contribute mea¬ 
surably to the d-d absorption spectrum of ZnS:Mn. 

In a previous work [12], we calculated the d-d 
bands of the ZnS:Mn spectrum using the ROHF 
ground state as a reference for the subsequent Cl 
that forms the quartet states using the spectro¬ 
scopic intermediate neglect of differential overlap 
(INDO/S) model [13]. We applied the same ap¬ 
proach here using the SAHF states as a reference. 
The results are given in Table I. We see that the 
level of accuracy is considerably improved by the 
SAHF method and the average error decreases to 
the level of the vibrational broadening of the spec¬ 
troscopic bands: 0.14 eV. Following the SAHF cal¬ 
culations, the 4 A 1 level is again calculated above 
the 4 E state by 160 cm -1 , as compared to the 100 
cm -1 that we calculated earlier using the ROHF 
approximation [12]. The relative positioning of 
these two states is controversial [18, 20]: The first 
doublet state that we calculate using Cl single and 
double excitations is at 3.72 eV, beyond the ob¬ 
servable spectroscopic range for ZnS with a band 
gap absorption starting at nearly 3.6 eV. Therefore, 
the possible contribution of doublet states to the 
ZnS:Mn electronic spectrum should be ruled out. 



FIGURE 4. Term level diagram for a d 5 ion in a cubic 
ligand field and the sextet-quartet transitions related to 
the ZnS:Mn absorption spectrum. 


TABLE I_ 

Calculated and experimental band energies (eV) 
for ZnS:Mn; ab initio and experimental data are 
taken from [22]. 


Transition 

Ab initio 3 

ROHF b 

This 

work 

Experiment 

6 A h a 4 T 

2.68 

2.72 

2.61 

2.34 

a 4 7 2 

2.83 

2.76 

2.68 

2.49 

^ a 4 E 

2.96 

2.78 

2.69 

2.67 


2.96 

2.78 

2.71 

2.67 

b 4 T z 

3.31 

3.18 

3.07 

2.89 

->b 4 T, 

3.39 

3.47 

3.15 

3.19 

b 4 E 

3.52 

3.56 

3.46 

— 

Average 

absolute 

error 

0.31 

0.24 

0.12 



a MR-CIS calculations employing Watcher’s (14,9,5) basis 
with extra diffuse functions for Mn (14,11,6) [8, 7, 4] and 
Roos-Siegbahn’s (10, 6) [7, 4] basis for S; from [25]. 
b ROHF stands for transition energies calculated using ROHF 
sextet state as a reference for the subsequent configuration 
interaction (see [2]). 
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The absorption spectrum is essentially reproduced 
from sextet-to-quartet d-d excitations and the 
agreement with the experiment (see [24]) is im¬ 
proved significantly by the SAHF procedure. 

The SAHF procedure is the same as the ROHF 
approximation for the case of maximum multiplic¬ 
ity, i.e., S = s z = m. It appears that it is a reason¬ 
able starting position for any Cl of the multiplicity 
over which the average is taken, but need not to be 
the best average for the cases of lower multiplicity. 
Returning for the moment to the case of two elec¬ 
trons in two orbitals, the singlet average contains 

I <t>r <f>il [\4>,4>j\ ~ \4>i4>j\}/ hi and \4’^> j \. The \<f>j<f>j\ 
state may be of such high energy as to render the 
average orbitals obtained not as good for the low¬ 
est-lying singlets as that for the CAHF case of 
Figure 1, since the energy there is lowered by 
including the three triplet states in this average 
(see, e.g.. Fig. 2). Indeed, in such cases, it can be 
difficult to converge the SCF. 

It is interesting also to develop an operator that 
presents the average of a given multiplicity for an 
open-shell structure of N electrons in m = N or¬ 
bitals. This would be the case, e.g., of the singlets 
or the triplets possible from through of 
Figure 1. There are 


N(S) = 1 with S = S max = N/2 


N(S) - 


N 

{IS + N)/2 


) 


f N \ 

^ (2 S + AT 4- 2)/2 j 

with S < S max (15) 


such states (one singlet, and one triplet for i// 3 
through i {/ 6 of Figure 1). Thus, the coupling coeffi¬ 
cients are obtained in the form 

2 a H = b iit a { : = 1 all i,j 
b u = 2 all i 

b ij = 2 [Vp(Vp - 1 ) + VdVa ~ 1 ) 

-2ri a rip/N(S)\/ m (m - 1) all i ¥= j. 

(16) 


This development, unfortunately, generates a dif¬ 
ferent operator for each orbital; for example, in the 
common case of the ROHF singlet operator for two 
electrons in two orbitals, this is two open-shell 
operators. Such an approach quickly becomes un¬ 
wieldy. For the case of antiferromagnetic spin cou¬ 
pling, however, where electrons of a given spin are 
"frozen" in the corresponding orbitals of a given 
center, say C and D, this approach again becomes 


simple, leading to 

djj = 1 all i, j 
bij = 2 ieC( T), jeD(t); or 
ieC(f), jeD(l) 
bjj = 0 otherwise. (17) 

This leads to two open-shell operators, one for all 
up-spin domains, C( T) and D( T X and one for the 
down-spin spin domains, C( i ) and D( i ): 

* — (! i)/(i D* B -(o 2l) «» 

For this to be useful, the orbitals must be localized, 
as they often are for d orbitals in open-shell situa¬ 
tions, and electrons of up- and down-spin must be 
assigned to different domains as well as specifying 
the overall S z value, i.e., A( T X B( i ), C( T)... . This 
operator has been used several times by us with 
good success [26]. 
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ABSTRACT: The red shift of the first absorption band of benzene in the condensed 
phase is calculated. The solid-phase calculations are performed using the experimental 
crystal structure. For the liquid two different approaches are considered. One is the 
herringbone structure suggested by X-ray experiments, and the other uses supermolecules 
generated by Monte Carlo simulation. Attention is given to the contribution of the usual 
parallel and T-shape conformations of the benzene dimers obtained in the simulation. 
Comparison with experiment shows a relatively poor result for the herringbone structure 
and good agreement for the supermolecular structures generated by the simulation. 

© 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 885-891, 1997 


Introduction 

I n this study we calculate and analyze the red 
shift of the first absorption band of benzene in 
the condensed phase as compared to the gas phase. 

The interest in the structure [1] and spectra of 
clusters and condensed-phase benzene is related to 
several topics of interest in chemistry, physics, and 
biology. It naturally relates to the importance of 
the interaction between aromatic 7r-systems. This 
present study also relates to our interest in solvent 
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effects in molecular absorption spectroscopy [2, 3]. 
For the calculation of the solvatochromic shift of 
liquid benzene, we use two structures. One is the 
well-known herringbone structure [4] and the other 
is a disordered structure obtained from Monte 
Carlo (MC) simulation. For completeness, we also 
consider the first absorption transition of the solid 
benzene. All quantum mechanical calculations are 
made using the INDO/CIS model [5] as imple¬ 
mented in the ZINDO program [6], and we use the 
so-called supermolecular (SM) approach. In the 
case of the solid and the herringbone structure of 
the liquid, two configurations are used (Fig. 1). In 
the first we use 14 benzene molecules both for the 
solid and the liquid, as seen iq Figure 1(a) (top). In 
the second, we use only 13 benzene molecules. 

CCC 0020-7608 / 97 / 050885-07 
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of solid and liquid benzene. The top figures show 14 
benzene molecules. The bottom figures show instead 13 
molecules and includes one bulk benzene in the center. 
Molecules 1,2,3, and 4 generate the unit cell as defined 
by Narten [4], 


This corresponds to including one benzene and its 
12 nearest neighbors of the well-known Ybca space 
group [4]. This second case, as seen in Figure 1(b) 
(bottom), as opposed to the first case, includes a 
central benzene molecule to represent the "bulk." 
Calculations are then performed at these four 
structures but in each case using a fixed geometry. 

For the case of the disordered liquid benzene 
the structures generated by the MC simulation [7] 
are used. Each SM is composed of 14 benzene 
molecules. As we shall see, this corresponds to 
including the first coordination shell as obtained 
from the radial distribution function (RDF). In 
addition, however, several SM calculations are 
performed. The actual number of SM structures 
used takes into account the statistical correlation 
between MC structures. The final excitation energy 
is obtained averaging over the transition energies 
obtained for each structure. A comparison of these 
different procedures will be made. 


Monte Carlo Simulation 


The MC simulation is performed using standard 
procedures for the Metropolis sampling technique 
[8] in the canonical ensemble, where the number of 
molecules N, the volume V , and the temperature 
T are fixed. As usual, periodic boundary condi¬ 
tions in a cubic box [9] are used. In our simulation, 
we use 125 molecules of benzene. The volume of 
the cubic box is determined by the experimental 
density of the liquid, p = 0.8990 g/cm 3 [10], and 
the temperature is T = 25°C. There has been much 
discussion about the quadrupolar interaction of 
benzene and its relation to the structure [11] (see 
next section). In our theoretical experiments, we 
are willing to generate a disordered structure for 
the liquid. Therefore, the benzene molecules inter¬ 
act only by the Lennard-Jones potential: 


on a on b 

U ab = E L 4 e ti 
i i 



( 1 ) 


where E fl is the sum over the atoms of molecule a, 
E & is the sum over the atoms of molecule b, r { - is 

the distance between atoms i and j, = ^e-e- , 

cr 2 / = and a i and e { are the parameters of 

atom i. The potential parameters used were devel¬ 
oped by Jorgensen and co-workers [12]: cr c = 3.75 

o 

A, e c = 0.11 kcal/mol, and a H = s H = 0. As the 
environment of the liquid system has no signifi¬ 
cant effect on the conformational equilibrium for 
the benzene molecule [4, 14], we consider, in the 
simulation, rigid conformations of the benzene 
molecules in the D 6h structure with r cc = 1.40 A, 
r CH = 1*0^8 A, and all angles 6 = 120°. We recog¬ 
nize that distortions of the benzene structure would 
lead to considerably enhanced intensity of the for¬ 
bidden 1 B 2u transition we examine. We do not 
believe, however, that the position of the band 
maximum will be much effected [15]. 

In the calculation of the pairwise energy, each 
molecule interacts with all other molecules within 
a center-of-mass separation that is smaller than the 
cutoff radius r c = 13.11 A. For separations larger 
than r c , we use the long-range correction of the 
potential energy [13]. Over the 125 benzene 
molecules included in the simulation, each mole¬ 
cule interacts effectively with ~ 60 neighbors. 

The initial configuration is generated randomly, 
considering the position and the orientation of 
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each molecule. A new configuration, or one MC 
step, is generated by selecting all molecules se¬ 
quentially and trying to translate it in all the 
Cartesian directions and also rotate it around a 
randomly chosen axis. The maximum allowed dis¬ 
placement of the molecules is auto-adjusted after 
50 MC steps to give an acceptance rate of new 
configurations around 50%. The maximum rota¬ 
tion angle is fixed during the simulation in 80 = 
±15°. The full simulation [7] involves a thermal- 
ization stage of about 5000 MC steps followed by 
an averaging stage of 48,000 MC steps. During the 
averaging stage some thermodynamical proper¬ 
ties, as the internal energy and the heat capacity at 
constant volume, are calculated, and they are 
in complete agreement with that calculated by 
Jorgensen and co-workers [12]. The radial distribu¬ 
tion function is also calculated during the averag¬ 
ing stage in the simulation. 


Results and Discussions 

The structure of the liquid benzene has been of 
interest for many years. Ever since the X-ray study 
of Narten [4], the herringbone structure is believed 
to prevail. This structure is illustrated in Figure 1 
along with the known experimental structure of 
solid benzene. In the case of the benzene dimer the 
interest is basically related to the relative stabiliza¬ 
tion of the parallel and T-shape structure [16]. A 
herringbone structure for the dimer has also been 
proposed [17]. More recent studies [18] have indi¬ 
cated that a parallel-displaced configuration seems 
to be more stable than the T-shape favorite struc¬ 
ture for the dimer. It seems clear from all these 
studies that the parallel-displaced, T-shape, and 
T-shape-displaced structures are very close in 
energy. 

From these considerations it seems appropriate 
to consider SM calculations of the liquid benzene 
with all these possibilities, not only the herring¬ 
bone structure. Thus several structures generated 
by the MC simulation will be used. As these struc¬ 
tures are statistically correlated, we will select 
structures that are separated by a certain number 
of MC steps. This number results from the time 
correlation function and the statistical inefficiency 
analysis [19]. In fact, we select structures that are 
separated by twice the correlation time (2%), ob¬ 
tained after the integration of the time correlation 
function. Clearly, "time" here should mean MC 


step. The calculated time correlation function, C(0, 
and the statistical inefficiency, s, for the energies 
of the liquid benzene structures are shown in Fig¬ 
ure 2. The calculated C(0 function gives the corre¬ 
lation time value of ET~ 400 steps. This is related 
to the statistical inefficiency and the relation s ~ 2% 
holds. Thus we use only MC structures that are 
separated by 800 MC steps. As the entire simula¬ 
tion took 48,000 MC steps, only 60 are statistically 
relevant and used in our studies. 

Figure 3 shows a comparison between the calcu¬ 
lated RDF from the MC simulation and the experi¬ 
mental RDF of Narten [4]. As can be seen, the 
overall agreement between these two results is 
very good. The results show a first broad peak 
ranging from 3.0 to 7.6 A. Integration of this peak 
gives the coordination number of N s = 12.5. This 
theoretical result is of interest in two different 
aspects. First, the experimental result of Narten 
gives N s = 12, in close agreement with our theo¬ 
retical value. This suggests that N s is not enough 
to uniquely characterize the liquid as having a 
herringbone structure. The second aspect is of more 
technical interest. N s = 12.5 signifies that, on the 
average, each benzene molecule has 13 (rounding 
off) neighbors and therefore the super molecule 
should have 14 molecules. Hence all three systems, 
including the solid and the herringbone, will be 
calculated with the same number of molecules. 
The difference is, however, that in the case of the 
MC liquid a total of 60 SM quantum mechanical 
calculations are performed. 

In all cases the excitation energy to the B 2u (rr — 
7i*) state of benzene is calculated including all the 
e lg —> e 2u transitions of each individual benzene; 
that is, from 28 occupied molecular orbitals into 28 
molecular orbitals of the virtual space. 

Table I summarizes the calculated numerical 
results and compares them with the experimental 
values for the red shift of benzene in condensed 
phase. 

Figure 4 shows the calculated result for the 
excitation energy of the MC liquid together with 
the final average (-306 cm -1 ) and the experimen¬ 
tal [20] ( — 320 cnC 1 ) values for the shift compared 
to the gas phase. It should be noted that the shift 
for an individual structure may differ by a few 
hundreds of cm -1 , but the "ensemble average" 
final result is in very good agreement with the 
experimental result. It is interesting to analyze, 
wherever possible, the contribution of the individ¬ 
ual benzene molecules. This is, of course, not al¬ 
ways possible. However, we note that in many 
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FIGURE 2. Time correlation function (top) and the statistical inefficiency (bottom) of the MC simulation. See text. 



2 5 8 11 

r [Angstrom] 


FIGURE 3. Comparison of the calculated and experimental [4] carbon-carbon radial distribution function of the liquid 
benzene. 
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TABLE I_ 

Summary of the calculated and experimental 
results for the B 2u ('n - tt*) energy shift (cm -1 ) 
in the absorption spectra of benzene in 
condensed phases . 3 


System 

N 

^ ^calc. 

A£ exp . [20] 

Gas 

1 

0 

0 

Solid b 

13 

-260 

— 260 + 30 


14 

-240 


□quid (herringbone) 0 

13 

-100 

-320 ± 30 


14 

-60 


Liquid (herringbone) d 

13 

-100 

-320 ± 30 


14 

-40 


Liquid (MC) e 

14 

-306 

-320 ± 30 


a A/ is the number of molecules included in the quantum 
calculations. 

b Lattice parameters [4] (7.460, 9.666, 7.034) in A. 
c Density of 0.874 g/cm 3 corresponding to the lattice pa¬ 
rameters [4] (6.75, 9.74, 9.03) in A. 

d Density of 0.899 g/cm 3 (used in the MC simulation) corre¬ 
sponding to the lattice parameters (6.75, 9.74, 8.78) in A. 
e Average value over 60 SM structures. 


structures the transition is somewhat localized in 
two benzene molecules. Interesting enough, these 
two molecules are found to be in well-known 
conformations, like the T-shape-displaced and the 
parallel-displaced (see Fig. 5 for illustration). 
Therefore we now look at systems composed of 


300 


0 



-1200 1 —*— A —*— 1 —*— 1 —— 1 —^‘ 

0 10 20 30 40 50 60 

Configuration Number 

FIGURE 4. Calculated red shift obtained from the 
disordered MC structures of the liquid benzene. Also 
shown is the average value (solid line) and the 
experimental result (dashed line). 


T-shape-displaced 



Parallel-displaced 



FIGURE 5. Illustration of MC structures showing 
the T-shape, displaced, and parallel-displaced dimer 
conformations. These dimers give a large contribution to 
the total red shift. See text and Table II. 


only two benzene molecules. We find from the MC 
simulation that ~ 25% of the benzenes have the 
nearest neighbor within an angle between 0 and 
20°, corresponding to the parallel and parallel- 
displaced conformations of the dimer. Similarly, 
~ 15% have its nearest neighbor within an angle 
between 70 and 90°, corresponding to the T-shape, 
T-shape-displaced, and L-shape conformations. The 
average contribution of these "ordered" dimer 
conformations is compared, in Table II, with the 
result obtained using all dimers generated in the 
Monte Carlo simulation irrespective of the angle. 
Dimers alone are not able to give a good result for 
the red shift, but it is clear that those ordered 
dimer conformations make contributions that are 
significant. We note, in particular, that these paral¬ 
lel conformations give larger shifts than the per¬ 
pendicular ones. This seems to suggest that these 
ordered dimer conformations of benzene are of 
importance in understanding the spectroscopic red 
shift of the liquid phase. 
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TABLE II_ 

Summary of the average contribution of different 
dimer conformations to the calculated shift for 
the B 2u (tt - it*) absorption transition of 
liquid benzene . 3 


System 

N 

A£ ca | C , (cm ^ 

MC T-shape 

2 

-98 

MC T-shape-displaced 
and L-shape 

2 

-135 

MC parallel 

2 

-179 

MC parallel-displaced 

2 

-197 

MC dimer 

2 

-103 

MC supermolecule 

14 

-306 


a N is the number of molecules included in the quantum 
calculations. 


In the case of the herringbone structure with 14 
molecules, the calculated shift is too small, only 
— 60 cm -1 . This discrepancy, however, is not found 
in the similar structure of the solid. In this case the 
calculated result is -240 cm -1 in good agreement 
with the experimental value of -260 cm" 1 [20]. 

A possible deficiency of the use of the herring¬ 
bone structure shown in Figure 1(a) is that there is 
no benzene molecule in the center of the box to 
represent the "bulk." To look at this we have used 
the structures shown in Figure 1(b) for the herring¬ 
bone and the solid. The calculated results, how¬ 
ever, changed only from the previous — 60 to —100 
cm -1 showing only slight improvement. For the 
solid it changed from the previous - 240 to - 260 
cm" 1 , bringing the shift to a better agreement with 
the experimental result. 


Summary and Conclusions 

In this work we have calculated the red shift of 
the first absorption band of benzene in the con¬ 
densed phase. The herringbone structure used for 
the liquid gives a red shift that is only 15-25% of 
the experimental value. In turn, the use of a disor¬ 
dered liquid obtained from Monte Carlo simula¬ 
tion gives a red shift that is in very good agree¬ 
ment with experiments. In this disordered liquid 
most of the usual structures of the benzene dimer, 
such as the parallel, parallel-displaced, T-shape, 
T-shape-displaced, etc., are included naturally, but 
there is little long-range order. The calculated red 
shift for crystalline benzene is also in very good 
agreement with the experimental shift. 
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ABSTRACT: The Kohn-Sham density functional theory "exchange" potential v x (r) = 
8Ef s [ p]/8p(r), where Ef s [ p] is the "exchange" energy functional, is composed of a 
component representative of Pauli correlations and one that constitutes part of the 
correlation contribution to the kinetic energy. The Pauli term is the work done W x KS (r) in 
the field ^ x KS (r) obtained by Coulomb's law from the Fermi hole charge distribution 
constructed from the Kohn-Sham orbitals. The correlation-kinetic term is the work done 
W^Kt) in the field Z^(r) derived from the kinetic-energy-density tensor involving the 
first-order correction to the Kohn-Sham single-particle density matrix. The sum of these 
fields is conservative, so that the total work done is path-independent. There is no 
explicit correlation-kinetic contribution to the "exchange" energy Ef s [ p]. Its contribution 
is manifested via the Kohn-Sham orbitals generated via the potential v x (r). The functional 
Ef s [ p] is thus expressed in virial form entirely in terms of the Pauli field #^ s (r). In this 
article, we determine and study the structure of the correlation-kinetic component field 
Z^Kr) and work W^Or) for the nonuniform electron density system in atoms and at 
metal surfaces. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 893-906, 1997 


Introduction 

T he Kohn-Sham [1] (KS) density functional 
theory [2, 3] "exchange-correlation" energy 
functional Ef^[ p] of the density p(r) and its func- 
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tional derivative (potential) v xc (r) = 8Ef^[ p]/ 
8p( r) incorporate correlations due to the Pauli ex¬ 
clusion principle and Coulomb repulsion as well 
as the correlation contribution to the kinetic en¬ 
ergy. The functional and derivative also have a 
rigorous physical interpretation [4-7] in which they 
are described more insightfully in terms of these 
correlations. The interpretation involves a field 
r), which, in turn, is a sum of two fields. The 
first of these & xc (x) accounts for Pauli and Coulomb 
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correlations and is determined by Coulomb's law 
from its source which is the quantum mechanical 
Fermi-Coulomb hole charge distribution. The sec¬ 
ond Z f (r) accounts for the correlation-kinetic con- 
tribution, and its source is the difference between 
the kinetic-energy-density tensors for the 
Kohn-Sham noninteracting and Schrodinger inter¬ 
acting systems. The corresponding field is the 
derivative of the resulting tensor. The functional 
derivative v xc (r ) is the sum of the work W xc (r) and 
W t (r) done in the fields & xc (t) and Z f (r), respec¬ 
tively. Since the field &(x) is conservative, the sum 
of the work done is path-independent. The quan¬ 
tum mechanical exchange-correlation E xc [ p] and 
correlation-kinetic T c [ p] energy components of the 
functional E^ c s [ p] can also be expressed in virial 
form in terms of the respective fields % xc (r ) and 
Z f (r) which give rise to them. (As in Schrodinger 
theory, there is no self-interaction in the physical 
description [4, 5] of the Kohn-Sham theory elec¬ 
tron-interaction energy functional and its deriva¬ 
tive.) 

In a similar vein, the KS theory "exchange" 
energy Ef s [ p], and its functional derivative y x (r) 
= 8Ef s [ p]/8p( r), can also be described [8] in 
terms of a field representative of Pauli correlations 
and one representing part of the correlation- 
kinetic effect. Further, within the density func¬ 
tional theory representation [3, 9] of Hartree-Fock 
(HF) theory, it is also possible to define [5, 10] an 
"exchange" energy functional E X SHF [ p] and po¬ 
tential y^ F (r) = 8Ef SHV [ p]/8pix), whereby the FIF 
density and energy are obtained. This functional 
and derivative can also be described [4, 5, 10] in 
terms of a field representative of Pauli correlations 
as well as one representing the corresponding cor¬ 
relation-kinetic effects. Finally, in exchange-only 
[11, 12] density functional theory, the correspond¬ 
ing "exchange" energy functional £° PM [ p] and 
potential y° PM (r) are defined [11] as those of the 
optimized potential method [13] (OPM). Once 
again, this energy and potential can be described 
[14] in terms of a field representative of Pauli 
correlations and one representing kinetic effects. 
The sum of these fields is again conservative. In 
each of the above cases, the field which accounts 
for Pauli correlations is obtained by Coulomb's 
law from a Fermi hole charge distribution. The 
resulting potential is then the work done to move 
an electron in the field. This field and potential 
have been studied previously [6, 15-21] for atoms 
and metal surfaces and their structure for these 
nonuniform density systems understood. The defi¬ 


nitions of the corresponding kinetic components 
differ, but the fields and potentials are all derived 
in the same manner from kinetic-energy-density 
tensors. In this article, we determine and study the 
structure of the kinetic component of the KS "ex¬ 
change" potential as applied to atoms and metal 
surfaces. 

We begin in the next section by defining the 
"exchange" energy functionals and potentials for 
the cases discussed above. Although, in principle, 
the KS, HF, and OPM orbitals are different, in 
practice, their differences are minimal in that the 
structure of the corresponding local "exchange" 
potentials is essentially equivalent. In our calcula¬ 
tions on atoms (third section), we, therefore, as¬ 
sume the KS "exchange" potential and orbitals to 
be those of the optimized potential method. For 
the metal surface calculations (fourth section), we 
employ an accurate representation of the KS "ex¬ 
change" potential derived [18, 19] via restricted 
differentiation of the functional Ef s [ p]. This po¬ 
tential does, however, have the exact [22-24] 
structure of the KS "exchange" potential both in 
the asymptotic vacuum and metal bulk regions. A 
summary of the results and concluding remarks 
are made in the fifth section. 


Definitions of “Exchange” Energy 
Functionals and Potentials in Terms 
of Fields 

In this section, we define the "exchange" energy 
functional and potential in terms of fields within 
(a) the Kohn-Sham theory of the fully interacting 
system, (b) the Kohn-Sham theory representation 
of Hartree-Fock theory, and (c) the exchange-only 
Kohn-Sham theory. 

KOHN-SHAM THEORY 

In KS theory, the "exchange" potential v x (r), 
which is the derivative of the functional Ef s [ p], is 
[8] the sum of a Pauli W x KS (r) and correlation- 
kinetic W, (1) (r) component: 

8Ef s [ p] 

v x (r) = * - W KS (r) - W f (1) (r). (1) 

8p( r) 

Here, W x KS (r) is the work done to move an electron 
in the field ^ x KS (r) determined via Coulomb's law 
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from the Fermi hole charge pf s (r, r'X Thus, 

W x KS (r) = - ■ dV, (2) 

*co 


where 


pKS 

X 


M-J 


pf s (r,t')(r - r') 


dr'. 


(3) 


In turn, the Fermi hole pf s (r, r') at r' for an elec¬ 
tron at r is defined in terms of the idempotent 
Dirac density matrix y s (r, r') constructed with the 
KS orbitals ^(r) as 


p^ s (r,r') = — |y s (r, r')| 2 /2p(r), (4) 


where 


0. The expression for the exchange potential v x (r) 
of Eq. (1) is exact [8], because it incorporates all 
terms linear in the coupling constant. Higher-order 
perturbation theory leads to the correlation com¬ 
ponent of the KS energy and potential. 

There is no explicit correlation-kinetic contribu¬ 
tion [8] to the KS "exchange" energy Ef s [ p]. The 
contribution, however, is manifested indirectly via 
the KS orbitals t^(r), so that Ef s [ p], which is also 
related to its functional derivative ^(r) by the 
virial theorem [25], is expressed [4] entirely in 
terms of the field ef x KS (r) as 


[ p] 


Irr p(r)p*(r,r') 
?//——— ■**'■ 


r — r 


- j p( r)r-Vv x (r) dr, 


( 10 ) 

( 11 ) 


y s (r,r') = L</'*( r ) , Ai( r '), (5) 

i 


= J p(r)r • ^ KS (r) dr, (12) 


such that y s (r, r) = p(r). The work W, (1) (r) is that 
done to move an electron in the field Zi 1} (r): 

l c 


W f O>(r) = - fzf\r') ■ dV, (6) 

*^oo C 

where 



z(r,[y{]) 
p( r) 


(7) 


The field Z^\r) is further defined in terms of a 
field z(r;[y[J) whose component z a (r) is derived 
from the kinetic-energy-density tensor t af3 (r;[y{]) 
as 


z a {r,[y{]) = 2 £ —t aP (r;[y {]), (8) 

p=l uVp 


where 




1 

4 


dr' a dr; 


+ 


dr’pdrl j 


7i c (r',r")lr' = r"=r/ (9) 


and y x c (r, r') is the first-order correction to the KS 
density matrix y s (r, r') as obtained via perturbation 
theory by an expansion of the system wave func¬ 
tion in terms of the electron-interaction coupling 
constant. The sum of the work [W x KS (r) - W f (1) (r)] 
is path-independent since V x [i^ KS (r) - Zf\r)] = 


with 


j p(r)r * Z£ } (r) dr = 0. (13) 

KOHN-SHAM REPRESENTATION OF 

HARTREEFOCK THEORY 

The KS "exchange" energy functional Ef SHF [ p] 
which leads to the HF theory ground-state energy 
and density is defined as [4, 5, 10] 

Ef HF [ p] = E™[ p;y HF ] + T C HF [ p], (14) 

where E^ F [p;y HF ] is the HF theory exchange 
energy determined from the idempotent Dirac 
density matrix y HF (r, r') constructed from the HF 
orbitals 0^ F (r). [The definitions of these properties 
are the same as Eqs. (4), (5), and (10) with ^(r) 
replaced by <£h F (r).] The correlation-kinetic en¬ 
ergy T c hf [ p] is defined as 

T c HF [p] = T HF [ p ] - T s [ p], (15) 

where T HF [ p] is the HF theory kinetic energy, and 
T s [ p], the kinetic energy of noninteracting fermions 
obtained from orbitals of the KS equation which 
leads to the HF theory density and energy. 

The exchange energy E^ F [ p; y HF ] component is 
defined in terms of an electric field i^ HF (r) derived 
via Coulomb's law from the HF theory Fermi hole 
p^ F (r, r') in the same manner as Eq. (12). 
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The correlation-kinetic energy T C HF [ p] compo¬ 
nent is expressed in terms of a field Z FIF (r) as 

T C HF [ p] = - j drp(r)r • Z^ F (r), (16) 

where 

Z™(r) = [z(r; [y s HF ]) - z(r; [y HF ])], (17) 

the field z(r) is as defined by Eqs. (8) and (9), and 
y s HF (r, r') is the idempotent density matrix con¬ 
structed from the orbitals of the KS equation which 
leads to the HF density and energy. 

The KS exchange potential i^ F (r) which gener¬ 
ates the HF theory density and energy is 

££KSHFr l 

0™(r) = * >.: - = W HF (r) + W, HF (r), (18) 

5p(r) 

where W x HF (r) and W/ 1F (r) are, respectively, the 
work done in the fields Jf x HF (r) and Z[^ F (r). [See 
Eqs. (2) and (6).] Once again, the sum of the work 
[ W X HF (r) + W^ HF (r)] is path-independent since V x 

[ar™(r) + Z*f (r)] = 0. 

KOHNSHAM EXCHANGE-ONLY THEORY 

The exchange-only [11, 12] version of KS theory 
corresponds to the optimized potential method 
[13] (OPM). The resulting exchange energy 
E x OPM [ p;% OPM ] an d potential i?° PM (r) are deter¬ 
mined by a self-consistent solution of the OPM 
integral and differential equations. The exchange 
energy is obtained through Eq. (10) via the Fermi 
hole p° PM (r,r') or density matrix y s OPM (r, r') con¬ 
structed from the OPM orbitals </>P PM (r). The po¬ 
tential z;° PM (r) may be expressed as [14] 

i£ PM (r) = W x ( r) + W t (x), (19) 

where W x (r) is the work done in the field £f x (r) 
determined via Coulomb's law [see Eq. (2)] from 
the Fermi hole p x (r, r') constructed from the or¬ 
bitals </>™ F (r) of the Work Formalism [21, 26] (WF) 
differential equation. [This equation is the same as 
the KS differential equation with v xc (r) replaced by 
W x (r).] With y s WF (r, r'), the corresponding idempo¬ 
tent density matrix, the work W f (r) is that done in 


the field Z,(r), where 

l s 

z ,, <r) -^)K r: [T,°™])-z(r ; [r, WF ])] GO) 

and where the field z(r) is as defined previously in 
terms of the appropriate kinetic-energy-density 
tensor. [See Eqs. (8) and (9).] In this exchange-only 
case, the virial theorem [25] leads to the energy 
relation 

-£,”[(>;r, WF ] 

= j p(r)r • Z t (r) dr. (21) 

We note that in the derivation of Eq. (19) it is 
assumed that the OPM and WF densities are 
equivalent, which is essentially the case for both 
finite and extended systems. However, the off-di¬ 
agonal elements of y s OPM (r,r') and y s WF (r, r') are 
not assumed to be equivalent so that the field 
Zdr) is finite. Furthermore, the work W x (r) and 
W t (r) are separately path-independent. 

To summarize, the exchange potential of KS 
theory [Eq. (1)] and that of its representation of HF 
theory [Eq. (18)] and of the OPM [Eq. (19)] are all 
composed of two components. The first, represen¬ 
tative of Pauli correlations, is the work done in a 
field determined by Coulomb's law from the Fermi 
hole charge constructed with the corresponding 
orbitals. The second, representative of correlation- 
kinetic effects, is the work done in a field deter¬ 
mined from a corresponding kinetic-energy-den¬ 
sity tensor. 


Structure of Correlation-Kinetic Field 
and Potential in Atoms 

To determine an accurate representation of the 
correlation-kinetic field Z^\x) and potential 
W f (1) (r) in atoms, we assume the KS exchange 
potential z; x (r) to be that of the optimized potential 
method z^ PM (r). We then determine the work W x (r) 
by a self-consistent solution of the Work Formal¬ 
ism differential equation. In other words, we as¬ 
sume the "exchange" problem to be defined by the 
case described by Eq. (19). We, however, refer to 
the potentials z?!p PM (r) and W x (r) as the KS ex¬ 
change potential t; x (r) and its Pauli component 
W x KS (r), respectively, since, as noted, the difference 
between the two sets of potentials is minimal. The 
differences [Vz? x (r) - ^ x KS (r)] and [u x (r) - W x KS (r)] 
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are then the correlation-kinetic field and 

potential W f (1) (r), respectively. 

In Figures 1 and 2, we plot the Pauli component 
field <^ KS (r) due to the Fermi hole charge for Ne 
and Ar atoms. Note the shell structure as exhibited 
by the field and the fact that it exists primarily 
within the K-shell, becoming less significant in the 
outer shells, decaying asymptotically as -1/r 2 . In 
Figures 3 and 4, we plot the corresponding correla¬ 
tion-kinetic component field Z f (1) (r) for these 
atoms. In these graphs, we also indicated the ra¬ 
dial probability density, so that it is evident that 
the field exists principally in the intershell 

region and is oscillatory there. Furthermore, the 
magnitude of this field is greatest in the region 
between the K and L shells, becoming much 
smaller in the outer in ter shell regions. The field 
Z t (1) (r) is also short-ranged in comparison to the 
Pauli field lf x KS (r), vanishing at about the com¬ 
mencement of the outermost occupied shell. It is 


also interesting to note that there is a small compo¬ 
nent of this field present in the interior of the 
atom. Finally, observe that the field Z f (1) (r) is much 
smaller, about a fifth of the magnitude of the Pauli 
field ^/ KS (r). 

In Figures 5 and 6, we plot the Pauli W/ S (r) 
and correlation-kinetic W, (1) (r) components of the 
KS "exchange" potential v x (r) for Ne and Ar, 
respectively. As known [6], the Pauli component 
W/ S (r) exhibits a shell structure and is monotonic 
with a positive slope since the field s (r) is 
negative throughout space. It decays asymptoti¬ 
cally as - 1/r in the classically forbidden region 
as does v x (r). Since the correlation-kinetic field 
ZP>(r) is oscillatory and exists in the intershell 
regions, the corresponding potential W/ 1} (r) ex¬ 
hibits a dip in these regions, vanishing well within 
the outermost occupied shell. It is this correlation- 
kinetic component which then gives rise to the 
well-known bumps of the KS potential v x (r) in the 
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intershell regions. Again, we note that there is a 
small component of this potential in the interior of 
atoms. Finally, observe that W/ a) (r) is an order of 
magnitude smaller than the Pauli potential W x KS (r). 
Thus, the correlation-kinetic effects contributing 
to the KS potential v x (r) are small. As such, it is 
justified to represent the potential v x (r) in finite 
systems by its Pauli component W x KS (r). The high 
accuracy of this approximation with regard to total 
ground- and excited-state energies, ionization po¬ 
tentials, and electron affinities has been demon¬ 
strated in the literature [4, 5, 15, 21, 26]. 


Structure of Correlation-Kinetic Field 
and Potential at Metal Surfaces 

For the jellium [27] and structureless pseudopo¬ 
tential [28] models of a metal surface, there is 
translational symmetry in the plane parallel to the 


surface and the effective potential of the electrons 
is local. Thus, the KS orbitals are of the form 

»A k (r) = e ik i' x «cf> k (x), (22) 

where (k^Xy) are the momentum and position 
vectors parallel to the surface, and (k, x), the com¬ 
ponents perpendicular to it. The metal is assumed 
to occupy the negative half-space. For these mod¬ 
els, the exact asymptotic structure of the KS "ex¬ 
change" potential v x (r) and of its Pauli component 
W x KS (r) in the classically forbidden vacuum region 
has been determined [20, 22-24]. The analytical 
expressions for these structures which are valid for 
fully self-consistently determined orbitals show both 
potentials to decay in an -x _1 image-potential-like 
manner, but with different decay coefficients which 
depend upon the metal Fermi energy and surface 
barrier height. Thus [see Eq. (1)], the asymptotic 
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FIGURE 3. The correlation-kinetic component field Z/ 1} (r) for the Ne atom. The radial probability density is plotted as 
the dashed line. 


structure of the correlation-kinetic component 
W/^Cr) is also known exactly to decay as 


/3) 

W«(x) - fc 

l C 


(23) 


oo 


where the decay coefficient is 




p 


2/3 


1 - 2 


j8 2 - 2 

/3 2 




4 — 3j8 2 

X (1 H-=— ln( /3 2 — 1) 

J3 


(24) 


to be exactly 


= — k F / 7T . 

-*> — 00 


For the KS ''exchange" potential v x (r), we em¬ 
ploy an accurate representation of it derived [18, 
19] by restricted differentiation of the energy func¬ 
tional Ef s [ p]. The resulting expression for 

the potential which is in terms of the density p(r) 
and Slater potential [29] V/Or) is 


y^ 0) (r) = 


1 d 
2dp{t)/dk T dk F 


(p(r)V/(r)), (25) 


where V x (r) is defined as 


and where /3 2 = W/e f , with W the barrier height, 
e F = fcp/2 the Fermi energy, k F = l/ar s the Fermi 
momentum, a -1 = (97 t/ 4) 1/3 , and r s the Wigner- 
Seitz radius. The asymptotic value of the KS "ex¬ 
change" potential in the metal bulk is also known 


r pf s ( r,r') 

V x s (r) = f-p- —dr'. (26) 

Thus, the potential i4 0) (r) incorporates the intrinsic 
nonlocality of the problem through the dynamic 
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FIGURE 4. The correlation-kinetic component field Z^(r) for the Ne atom. The radial probability density is plotted as 
the dashed line. 


Fermi hole charge. It also possesses [18, 22, 24] the 
correct asymptotic structure of v x (r) in both the 
asymptotic vacuum and metal-bulk regions. 

Now for the orbitals of Eq. (22), the expression 
for the Slater potential in dimensionless coordi¬ 
nates normalized to the Fermi momentum is 
[22-24] 

V x s (z) 4 

(3k F /2lT ) 7 Tp n (z) 

X f 1 dk' f k 'dk<Pt(z)<t> k ,(z)G(k,k'; z), (27) 
•'o ■'o 

where 

G(k, k'; z) = 7tA' 2 A ^ dqj{q,z) 

J o 

+ f A+A dqj(q , z){S x (q ) + S y (q)}, 

•'A-A' 


/ oo 

dz'e-^- z '^ k {z')<j>l{z’), 

— 00 

( X 2 — X 2 ) 1//2 

S k iq) = A 2 tan” 1 -—*- X a (A 2 - X A 2 ) 1/2 , 

a a 

S y (q) = S a (^)Ia-. y, 

1 ( A 2 - A' 2 \ 

X A,y(^ = ~ 9 ±--- 

z \ 7 / 

and where p n (z ) is the density normalized to the 
bulk value p = fc|/37r 2 : 

p„(z) = 3 r 1 (1 - k 2 )\<t> k (z)\ 2 dk. (28) 
J o 

In these equations, A = (1 — fc 2 ) 1/2 , A' = (1 — 
fc' 2 ) 1/2 , and z = k F x. The expression for the Pauli 
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0.01 0.1 1 10 

r (a.u.) 


FIGURE 5. The Pauli W x KS (r) and correlation-kinetic W t ^(r) potentials for the Ne atom. 


component field X’, k "'(r) is [20] 


The work done l'V KS ( 2 ) is then 


^ KS (z) 4 

Okj/ln) vp„(z) 


W KS (z) r z gL KS (z') 

—--= - / —-- dz' 

(3k F /2n) L (3kj/2w) 


(30) 


x C dk' \ k 'dk4>*{z)<t> k ,{z)H{k,k'-,z) l (29) 
•'0 J o 

where 

H(fc, k'-, z) = 7tA' 2 f (A A } dqqM(q,z) 

J o 

+ /* A+A dqqM(q, z){S K (q) + S x .{q)}, 
J x-y 

M(q, z) 

/ CO 

dz’sgniz - z t )e~^ z ~ z ^(f) k (z , )(f)p(z'). 

— 00 


For orbitals generated by model effective poten¬ 
tials, the spatial integrals J(q, z) and M(q, z) above 
can be determined analytically. Thus, for such 
orbitals, the Slater potential V x s (z) and the field 
rf s (z) are then entirely in terms of finite-region 
momentum-space integrals and therefore easily 
determined for ^//-electron positions throughout 
space. 

The expressions for the Slater potential V x s (z) of 
Eq. (27) and of the work W x KS (z) of Eq. (30) are 
valid for arbitrary orbitals (f> k (z ). In our calcula¬ 
tions, we employ those of the jellium and finite- 
linear-potential model [30] of a metal surface. The 
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FIGURE 6. The Pauli W x KS (r) and correlation-kinetic W t { ^(r) potentials for the Ar atom. 


advantage of employing these orbitals is that they 
are semianalytical and lead to results essentially 
equivalent [17] to those of fully self-consistent cal¬ 
culations. Furthermore, as noted above, the inte¬ 
grals J(q, z) and M(q, z) can then be obtained 
analytically. The KS effective potential for this 
model is 

v e// (z) = Fz[6(z) - 0(z - z b )] + W0(z - z b ), 

(31) 

for which the orbitals <f> k (z ) are 

<f> k (z) = sin[A:z + 8(k)]d( — z) 

+ [B k Ai{£ k ) + C k Bi(£ k )] 
x[0(z) - 0(z - z b )] 

+ D k exp(-K k z)0(z - z b ), (32) 


where k = ]/2E , K k = V2(W — £), l k = zz F l/3 — 
U, U = k 2 z\ /3 f F = (kj/2)/z F , z F = lkj/2)z h /W, 
E is the energy, W is the barrier height, and 
Ai( £ k ) and Bi( £ k ) are the Airy functions. The phase 
factor 8(k) and the coefficients B k , C k , and D k are 
determined by the requirement of continuity of the 
wave function and its logarithmic derivative at 
z = 0 and z = z h . 

The correlation-kinetic component field Z\ l \z) 
and potential W^Kz) are determined in a manner 
similar to that for atoms, except that the same 
orbitals cf> k (z) are employed in the calculation of 
the potentials vfKz) and W x KS (z). In Figures 7 and 
8, we plot for Li metal (r s = 3.24) the Pauli lf x KS (z) 
and correlation-kinetic Z^Xz) fields, respectively, 
employing the orbitals of Eq. (32). The relationship 
between r $ and j3 is determined via energy mini¬ 
mization in the local density approximation. The 
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z (in Fermi wavelengths) 


FIGURE 7. The Pauli component field ^ x KS (z) at the surface of the Li metal. 


corresponding value of /3 = 1.246. (See Appendix 
of [17].) [The field &f s (z) is also plotted elsewhere 
[16, 17, 20] for r s = 2.0 and 6.0. For purposes of 
completeness, we include it here for Li metal.] 
Observe that both fields are concentrated in the 
surface region and possess the requisite Bardeen- 
Friedel [27] oscillations in the metal bulk. How¬ 


ever, as in the atomic case, the correlation-kinetic 
field is an order of magnitude weaker than is the 
Pauli field. Observe that both fields decay many 
Fermi wavelengths into the vacuum region. (For Li 
metal, one Fermi wavelength is 5.61 A.) The cor¬ 
rectness of the asymptotic structure of the field 
Z f (1) (z) in the vacuum region is demonstrated in 



z (in Fermi wavelengths) 

FIGURE 8 . The correlation-kinetic field ZP(z) at the surface of the Li metal. 

L c 
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z (in Fermi wavelengths) 

FIGURE 9. Comparison of the asymptotic structure of the correlation-kinetic field Z/JMz) at the surface of the Li metal 
with its exact analytical asymptotic structure o^ } ( fi) Iz 2 . 


Figure 9 where the exact analytical structure 
a^Kp)/z 2 is also plotted. The calculated and ex¬ 
act structures can be observed to have merged at a 
distance of about four Fermi wavelengths from the 
surface. 


In Figure 10, we plot the approximate KS ex¬ 
change potential vfKz) and the work done W X KS ( z). 
The potential vf\z) which possesses the exact 
[22-24] asymptotic structure of the KS exchange 
potential v x (z) in the vacuum region achieves this 



z (in Fermi wavelengths) 


FIGURE 10. The approximation \/ x (0) (z) to the Kohn-Sham exchange potential as determined by Eq. (25). The Pauli 
component W x KS (z) of Eq. (30) is also plotted. 
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limit by about one Fermi wavelength from the 
surface. As is evident from the figure, it also has 
the correct limiting value of -2/3 (in units of 
3k F /2ir ) of v x {z) in the metal bulk. The asymp¬ 
totic structure of W t KS (z) in the vacuum differs 
from that of the KS potential v x (z) as has been 
shown [20] analytically. The limiting value of 
W/ S (z) in the metal bulk is also different [20], 
being -0.757 (in units of 3k F /2ir) for Li metal. 
The resulting correlation-kinetic potential compo¬ 
nent W^ X) (z) is plotted in Figure 11. Note that this 
potential is entirely positive and possesses the 
correct asymptotic structure of Eq. (23) in the vac¬ 
uum. The potentials vf\z), W/ S (z), and W^ 1} (z) 
all exhibit the Bardeen-Friedel oscillations within 
the metal. Once again, the potential W f (1) (z) is an 
order of magnitude smaller than W x KS (z). How¬ 
ever, as the asymptotic structure of the KS poten¬ 
tial v x (z) and its Pauli component W/ S (z) differ 
in both the vacuum and metal bulk regions, the 
significance of the correlation-kinetic term W t (1 \z) 
should be greater in the metal surface case than it 
is for atoms. 


Conclusions and Summary 

The KS "exchange" potential v x (r) is composed 
of a term W/ S (r) representative of Pauli correla¬ 
tions and another W f (1) (r) which constitutes part of 


the correlation-kinetic effects. These components 
of the potential v x (r) arise, respectively, from fields 
^ KS (r) and Z^ X) (r) representative of these correla¬ 
tions. In this article, we studied the structure of the 
correlation-kinetic field Z ( f X) (r) and potential 
W^ X) (r) as applied to finite atomic and extended 
metal surface systems. For these systems, this field 
and potential are about an order of magnitude 
smaller than the corresponding Pauli components. 
In atoms, the field Zf-\r) exists principally in the 
intershell regions and is oscillatory there. This then 
gives rise to a trough in the resulting potential 
W f ^(r) in these regions. Since the Pauli component 
W^ KS (r) is monotonic with a positive slope every¬ 
where, the intershell bumps of the KS potential 
v x (r) can now be understood to be entirely a conse¬ 
quence of correlation-kinetic effects. Although 
there is a small component of the field Z ( f X) (r) in the 
deep interior of atoms, this field is short-ranged , 
vanishing within the last occupied shell. The 
asymptotic structure -1/r of v x (r) in the classi¬ 
cally forbidden region is therefore due entirely to 
its Pauli component W x KS (r). Since the field Z^ !) (r) 
in atoms is small and short-ranged, the KS poten¬ 
tial v x (r) can be approximated accurately by its 
Pauli component W/ S (r). This is borne out in the 
literature [4, 15]. For example, total ground-state 
energies of atoms with this approximation lie 
within 25 ppm of those of the OPM, being within 5 
ppm for atoms heavier than Kr. 



z (in Fermi wavelengths) 

FIGURE 11. The correlation-kinetic component W t { Hz) of the approximate Kohn-Sham exchange potential u x (0) (z). 
The function of a^( fi) /z is also plotted. 
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At a metal surface, the correlation-kinetic field 
Z ( f 1} (r) is concentrated mainly in the surface region. 
The field, however, is long-ranged. It decays 
asymptotically in the vacuum as [20] a^/x 2 , with 
the coefficient a { / } dependent upon the metal Fermi 
energy and surface barrier height. It also extends 
deep into the metal, exhibiting the requisite 
Bardeen-Friedel oscillations there. The resulting 
correlation-kinetic potential W^ 1} (r) is therefore 
also long-ranged in the vacuum with a structure of 
(a^/x). Further, it too exhibits the Bardeen- 
Friedel oscillations in the metal and has a finite 
positive limiting value in the metal bulk. The Pauli 
component W x KS (r) is also long-ranged [20], with 
an — x _1 image-potential-like structure in the vac¬ 
uum similar to that of the KS potential v x (r), but 
with a decay coefficient that is different. In the 
metal bulk, W/ S (r) also has a finite (negative) 
limiting value which is close to but less than that 
of v x (r) which is -k ¥ /ir. Thus, although the con¬ 
tribution of p t (r) to properties such as the surface 
energy and work function will be due principally 
to its Pauli component, we expect the 
correlation-kinetic contribution in this instant not 
to be insignificant. The long-range of both the 
Pauli and correlation-kinetic components of v x (r) 
in the vacuum is also of significance from the 
perspective of various surface probes such as the 
scanning-tunneling microscope as well as for the 
interaction of electrons, ions, and positrons with 
the surface. 
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ABSTRACT: Density functional theory for the pair potential in liquid Na near melting 
is first compared with that extracted by liquid structure inversion. All the main features 
are in quantitative agreement. Quantum chemical representations of metallic crystalline 
energies are then treated as a function of local coordination number and near-neighbor 
distance. K and Cu are especially referred to, in different lattice structures. Finally, the 
Tersoff potential based on density functional theory is discussed in relation to the 
cleavage force in ideal crystalline Si, and is related to bond breaking. © 1997 John Wiley 
& Sons, Inc. Int J Quant Chem 65: 907-917, 1997 


Background 

T he search for force fields in condensed matter 
goes back to at least the so-called rigid-ion 
model in the 1930s. The idea was, say for a simple 
metal crystal like Na or Al, to settle on a model for 
the charge density to associate with a particular 
nucleus, the superposition of which on each nu¬ 
cleus would lead to the periodic density in the 
crystal. But the screened ion was to be chosen 
judiciously so that as the ions vibrated about their 
equilibrium lattice positions the blobs of localized 
charge density moved rigidly with the nuclei. This 
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was understood at that time to imply pair forces 
between nuclei. This idea was formalized in the 
1960s through pseudopotential theory. The essen¬ 
tial equation embodying these approaches can then 
be written for the internal energy E as 

£ = X>(r l7 ,n). (1) 

icj 

Thus, the energy is divided into two parts: a vol¬ 
ume-dependent part E n , with fl denoting specifi¬ 
cally the atomic volume, and a pair potential con¬ 
tribution (f>, which is itself density-dependent as 
explicitly noted in writing Eq. (1). Clearly, Eq. (1) 
is valuable for comparing, say, crystalline phases 
in which atomic rearrangements are made at a 
constant volume. In the next section, density func¬ 
tional theory (DFT) will be used, in the manner of 
Perrot and March [1], to obtain the pair potential 
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4>(r) in the liquid phase of metallic Na, just above 
its freezing point. Reference will also be made, but 
now briefly, to both K and Be. However, emphasis 
will be placed on the fact that, for Na, contact can 
be established with an "experimental potential" 
derived by inverting the measured liquid structure 
factor S(q), derived from either X-ray or neutron 
diffraction experiments. To date, this has not 
proved possible for K and Be, although, in the 
former case. X-ray diffraction data are available in 
the work of Greenfield et al. [2]. 

Attention will then move in the third section to 
a different type of representation of the ground- 
state energy of crystals, namely, through direct 
appeal to the dependence on local coordination. 
This approach goes back at least to the formulation 
of Poshusta and Klein [3] and was worked out in 
the alkali metals by Malrieu et al. [4]. The specific 
form to be discussed in the third section follows 
directly the work of March et al. [5]. Instead of Eq. 
(1) for a crystal with a coordination number and 
near-neighbor distance r 0 , one writes 

E(c, r„) - (2) 

where it is of the essence of the method that R(r 0 ) 
and g(r 0 ) are determined explicitly by free-space 
dimer potential-energy curves. The detailed forms 
to be chosen for the functions R, g, and / are 
discussed in the third section. Then, in a similar 
vein, the directionally bonded Si crystal will be 
discussed, the example taken being the Tersoff 
potential [6] which is fitted to the density func¬ 
tional calculations of Yin and Cohen [7]. The type 
of description used in the third section is, in fact, 
characteristic of the so-called glue models of inter¬ 
atomic forces used by solid-state theorists. The 
Tersoff potential will be illustrated specifically by 
using it to calculate the cleavage force in Si, fol¬ 
lowing the very recent work of Matthai and March 
[8]. We must note here, however, that there is 
already a body of work in which high-precision, 
all-electron, local spin density approximation 
(LSDA) calculations on films and slabs (especially 
metals) exist, from which cleavage energies can be 
deduced. At least some of these calculations have 
reported what amounts to the first derivative of 
the energy with respect to the interplanar distance 
(see, e.g., Boettger et al. [9], Rosch et al. [10], Smith 
et al. [11], Gay et al. [12], and others). 

Then, in the fourth section, some attention is 
given, reverting back to the type of philosophy 


underlying the old rigid-ion model, to the precise 
formulation of lattice dynamics in relation to force 
fields, via DFT. Since phonon dispersion relations 
turn out to be described in terms of two densities, 
obviously, that of the perfect periodic crystal and 
also the change due to "freezing in" a phonon, one 
cannot only formulate a correct "vector rigid-ion" 
model in lattice dynamics but also one can, but 
now in the elastic region only, formulate the cleav¬ 
age force in terms of phonon dispersion relations, 
as in the original work of Zaremba [13], which was 
extended by Kohn and Yaniv [14]. With this as 
background to the present review, let us turn 
immediately to discuss the calculation of the pair 
potential entering Eq. (1) for metallic Na in the 
liquid phase, just above its freezing point. 


Density Functional Theory (DFT) of 
Pair Potentials in Simple sp Liquid 
Metals, Especially Na Near Freezing 

The treatment below follows closely that of Per- 
rot and March [1]. These workers adopted, essen¬ 
tially, within DFT, the philosophy of the rigid-ion 
model referred to above. Their building block was 
therefore the screening charge around a single 
Na + ion plunged into a bath of conduction elec¬ 
trons precisely equal to that in liquid metal Na 
near freezing. This screening charge was calcu¬ 
lated by DFT, from the Kohn-Sham equations 
using a local density approximation (LDA). Figure 
1 shows that the total charge Q(r) enclosed within 
a sphere of radius r centered on the Na nucleus is 
approaching the asymptotic value of unity at large 
r. This is a requirement imposed by the fact that 
long-range electric fields cannot exist in a metallic 
medium. The so-called Friedel oscillations in the 
screening charge are in evidence; they fall off 
grossly as cos(2 kjr)/r 3 , with 2kj denoting the 
diameter of the Fermi sphere, determined, in turn, 
by the mean conduction electron density into which 
the metal ion Na + is plunged. 

Perrot and March then assumed that the 
ground-state electron density of the two-center 
problem is the superposition of the one-center den¬ 
sities. They thereby calculated the change in ki¬ 
netic energy, A T(R) say, as the pair of screened 
ions brought from infinite separation to the speci¬ 
fied internuclear separation R. This change in ki¬ 
netic energy is naturally calculated from the 
Kohn-Sham wave functions. One also must calcu- 
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FIGURE 1. Integrated displaced charge Q(r) contained 
with a sphere of radius r centered on a Na + ion 
embedded in an initially uniform electron gas of density 
equal to the conduction electron density in liquid metal 
near freezing. Note that (a) Q{r) -> 1 for perfect 
screening of the monovalent ion Na + in this conducting 
medium and (b) the corresponding pair potential 4>(r) 
given by Perrot and March [1] have a prominent turning 
point at the position marked by the arrow (see also 
Fig. 2). 

late the potential energy change A U(R) in assem¬ 
bling the pair from infinite separation. This latter 
quantity is, in fact, determined solely by the total 
charge Q(r) defined above. 

To be more specific, Perrot and March showed 
from DFT based on the superposition density that 
if one denotes the kinetic energy change A T(R), 
plus exchange and correlation energy change in 
LDA, by the sum A G(R), then the basic equation 
for the pair potential has the structure 

= AG(R) + AIKjR). (3) 

As already remarked, the quantity A U(R) is speci¬ 
fied uniquely in terms of Q(R). The reader requir¬ 
ing fuller details is referred to the articles of Perrot 
and March [1]. Their pair potential obtained using 
LDA for the exchange-correlation potential V xc (r ) 
in the Kohn-Sham equations is shown for Na 
liquid metal at freezing in the upper curve at large 
r in Figure 2. We note briefly that a similar pair 
potential was derived for divalent liquid Be metal 
[1] and subsequently also for liquid metal K [15]. 

COMPARISON OF DFT PAIR POTENTIAL 
FOR Na WITH DIFFRACTION POTENTIAL 
FROM LIQUID STRUCTURE FACTOR 
INVERSION 

The purpose of this section is to outline the way 
in which this DFT pair potential for liquid Na can 
be brought into direct contact with an "experimen¬ 



FIGURE 2. Pair potentials c p(r) in liquid Na just above 
freezing point, in units of k B T m = /3 “ 1 , with T m the melting 
temperature. Upper curve at large r: DFT 0(r); lower 
curve at large r: “diffraction” potential obtained by Reatto 
[17] by inverting liquid structure factor S(q) measured 
using X-rays by Greenfield et al. [2]. 


tal" diffraction potential. The idea of inverting the 
experimentally measured liquid structure factor 
S(q) goes back to the proposal of Johnson and 
March [16]. These workers pointed out that a pair 
potential <£(r) could be extracted, at least in prin¬ 
ciple, from the force equations of the BGY classical 
statistical mechanical hierarchy. If we write for the 
(classical) nuclei in a liquid metal near freezing the 
pair distribution function g(r u ), essentially, the 
Fourier transform of the measured structure factor 
S(q), in the Boltzmann form 

g(r 12 ) = exp(-U(r u )/k B T), (4) 

where U(r 12 ) is the potential of mean force and k B 
is Boltzmann's constant, then the force equation 
reads 


dU(r 12 ) 

dr 1 


<?</> 0 12 ) 

dt 1 


r g (3) (r 1/ r 2 ,r 3 ) <?<£(r 13 ) J 

/ -7—T- dr 

J g(r n ) dty 


(5) 


This equation is readily interpreted. The left-hand 
side is evidently the total force on atom 1. The 
right-hand side expresses this as a sum of two 
parts: The first is the direct pair interaction from a 
second atom at distance r 12 from atom 1. The 
second term comes from the force exerted by an 
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atom at r 3 , weighted with the probability g 3 /g, 
where g (3) is the three-atom correlation function, 
while g(r 12 ) in the denominator takes care of the 
fact that there is certainly a second atom at dis¬ 
tance r n from atom 1. In early work [16], g (3) had 
to be approximated (e.g., as a product of three 
g(r)'s), and this was uncontrolled. However, Reatto 
et al. [17] brought the scheme to full fruition for 
Na by using computer simulation to bypass the 
need to approximate the three-particle correlation 
function, and their pair potential for liquid Na 
near freezing is compared with the density func¬ 
tional pair potential of Perrot and March [1] in 
Figure 2. Although small quantitative differences 
exist, it is remarkable that all the main features of 
the diffraction potential are reflected in the DFT 
4>(r). This is made plain in Table I, where the 
principal features of the DFT potential are com¬ 
pared with those of the diffraction potential. We 
stress that there were no adjustable parameters in 
the DFT calculation, the input merely being the 
valence of unity for Na, the experimental density 
of the liquid at freezing, and, of course, the LDA 
approximation to V xc (r ) inserted into the 
Kohn-Sham equations. 

It is worth adding here that the asymptotic form 
of 4>(r) has subsequently been studied further in 
the work of Blazej and March [18]. These workers 
took the "spherical" blobs of screening charge 
around the pair of ions at separation R and al¬ 
lowed the blobs to float off the nuclei, although 
remaining undeformed. They then show that a 
correction to the usual asymptotic form of 4>(r) 
results, to yield the result 

A cos(2Ayr + a) B 

<£0) = -- r -> 0. (6) 

In Eq. (6), A denotes the amplitude of the Friedel 
oscillatory form and a is a phase shift. Blazej and 
March displayed the form of the next term as 
proportional to r" 4 . However, more work has to 


be done on the range of validity of Eq. (6), which 
probably will come into its own at larger separa¬ 
tion r than is plotted in Figure 2. The generaliza¬ 
tion of Eq. (6) to forces between a pair of atoms 
outside a metal surface has been given by Blazej et 
al. [19]. 

We turn from this discussion of pair potentials 
within the framework of Eq. (1) to the very differ¬ 
ent quantum chemical model expressed in Eq. (2). 
Of course, the present use of Eq. (2) is for crystal 
lattices with different local coordination c, while a 
merit of Eq. (1) in the above context was that it 
could be applied not only to long-range ordered 
assemblies but also to condensed metallic phases 
with short-range order. 


Coordination Number-dependent 
Force Fields 

COLD METAL CRYSTALS 

March et al. [5] very recently used a near¬ 
neighbor Heisenberg Hamiltonian approach, based 
on results of Malrieu et al. [4] for alkali and noble 
metals to set up a coordination number-dependent 
pair potential <£(r, c), with c denoting the coordi¬ 
nation number. The Heisenberg Hamiltonian used 
is basically characterized by properties of the free- 
space dimer, say Na 2 in the alkalis or Cu 2 in the 
noble metal. Both ground-state and triplet-state 
potential-energy curves are, however, now in¬ 
volved—the latter presenting some practical diffi¬ 
culties for DFT, which is still mainly useful as a 
ground-state theory (see, however, Theophilou 
[ 20 ]). 

Let us follow these brief comments by summa¬ 
rizing the quantitative results of Malrieu et al. [4] 
for three (say alkali) metal structures: fee, bcc, and 
simple cubic (sc). One then has for the cohesive 
energy E c (r 0 ) at near-neighbor distance r 0 , with 


TABLE I_ 

Principal features of DFT potential compared with “experimental” diffraction potential for liquid Na 
near freezing. 


Positions of turning 

Principal 

First 

Second 

First 

points and nodes (A) 

minimum 

maximum 

minimum 


DFT a <Mr) 

3.73 

5.67 

7.37 

3.20 

Diffraction 13 </>(r) 

3.9 

5.76 

7.44 

3.3 


a Perrot and March [1]. 
b Reatto [17]. 
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the coefficients of R and g in Eq. (2) taken from 
[4]: 

(i) fee(c = 12) : E u (r 0 ) = 6 R(r 0 ) - 4 g(r 0 ) (7) 

(ii) bcc(c = 8) : E 8 (r 0 ) = 4 R(r 0 ) - 4 g(r 0 ) 

(iii) sc(c = 6) : E 6 O 0 ) = 3 R(r 0 ) - 3g(r 0 ) 

Within the philosophy of this work (compare 
also the Tersoff work [6] on Si), March et al. [5] 
effected an approximate generalization of Eqs. (7) 
to read 

E c (r 0 ) = 2 cR(To) ( g ) 

The form of the first term on the right-hand side of 
Eq. (8) may be motivated as a simple additive sum 
of the interactions between nearest-neighbor pairs. 
The "correction" term involving the coordination 
number-dependent function /(c) is less easy to 
justify, although there are formulas given in Mai- 
rieu et al. accurate through second-order of pertur¬ 
bation theory. In Eq. (8) as applied to metallic Na 
say, R(r 0 ) is the triplet 3 X U potential curve of the 
free-space dimer, while g(r 0 ) represents the "ex¬ 
change" part. g(r 0 ) is precisely one-half of the 
difference between the triplet 3 X M and the singlet 
x X^ potential-energy curves. March et al. noted 
that, given these potential-energy curves (say from 
DFT), the procedure set out fully by Carlsson et al. 
[21] can then be used to derive a pair potential 
</>(r, c) having the shape 

4> c (r) = c//r) -/(c)/ 2 (r). (9) 

Concerning the function /(c), it is relevant here 
to note, following March et al. [5], that Robertson 
et al. [22] took a fixed near-neighbor distance r 0 
equal to that in fee Al and performed DFT calcula¬ 
tions as a function of the coordination number. 
They can fit their results for the energy E(r 0 ) by 
writing 

E(r 0 ) = cA x + c 1 / 2 A 2 . (10) 


While it must not be assumed that the c x/1 fit will 
work away from their r 0 , the sublinear depen¬ 
dence of /(c) in Eq. (8) is already evident (but now 
for Na say) from Eq. (7). 

DFT AND TERSOFF POTENTIAL FOR Si 

We turn from sp metals, with essentially nondi- 
rectional bonding, to covalently bonded Si. The 
focus will be the work of Tersoff [6], who pre¬ 
sented convincing evidence that the atomic coordi¬ 
nation number c is the main variable determining 
the now directional bonding properties of Si in 
different structures. 

The central idea that the bond order, a measure 
of bond strength, depends upon the local environ¬ 
ment can already be found in the study of Abell 
[23]. He argued that an atom with numerous 
neighbors makes weaker chemical bonds than an 
atom will with only a few neighbors. For more 
recent work on bond-order potentials, the reader 
may refer to Alinaghian et al. [24], where extensive 
references are given. 

What we must stress in the present context is 
that the above ideas have been made fully quanti¬ 
tative for Si using DFT. Thus, the work of Yin and 
Cohen [7] is based on self-consistent pseudopoten¬ 
tial calculations using LDA. From their work ([7]; 
see also Tersoff [6]), we constructed Table II which 
records DFT values for the cohesive energy of Si 
with different atomic coordination numbers. Both 
cohesive energy/bond and per atom are collected 
in Table II. As to the former case, the first show 
row of this table shows a clear, monotonic de¬ 
crease as the coordination number increases, which 
is a quantitative account for Si of the arguments of 
Abell [23]. The cohesive energy per atom, in con¬ 
trast, goes through a maximum as the coordination 
number c increases through diamond, leveling off 
at higher values of c. 

Below, we focus on the use of the Tersoff poten¬ 
tial for Si, constructed to embody almost all the 
DFT results in Table II, for calculating the cleavage 


TABLE IK__ 

DFT values for cohesive energy of Si (in eV). a 



Dimer 

Graphite 

Diamond 

Simple cubic 

fee 

Cohesive energy / bond 

3.2 

2.6 

2.3 

1.4 

0.7 

Cohesive energy / atom 

1.6 

3.9 

4.7 

4.3 

4.1 


a Constructed from Yin and Cohen [7]; see also Tersoff [6]. 
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force as a function of interplanar separation [8]. 
But two points should be made first in this connec¬ 
tion: First, there are, as the author himself empha¬ 
sizes, overly soft bond-angle forces in Tersoff's 
parametrization of the DFT results. He himself 
stresses the need for an improved treatment of 
bond-angle forces (see, again, the recent study of 
Alinaghian et al. [24]). Second, Tersoff [6] also 
emphasized the merits of his potential as applied 
to Si(100) and Si(lll) surfaces: These are a focal 
point of the study of Matthai and March [8] to be 
summarized later. 


Cleavage Force Between Planes of 
Atoms in sp Metals and in Si 


POSITION AND HEIGHT OF MAXIMUM IN 

CLEAVAGE FORCE F{z) FOR Si: RELATION 

TO BOND BREAKING 

Matthai and March [8] discussed the cleavage 
force in a model of Si with a chemically bonded 
force field. In their model, a cleaved piece of Si has 
parallel faces at separation a + z, with a the origi¬ 
nal interplanar spacing perpendicular to the cleav- 
age plane. 

These workers then presented an estimate for a 
block of Si of the maximum value of F(z) and its 
position, z max say, using the coordination-depen- 
dent interatomic potential of Tersoff [6], already 
discussed above. 

Matthai and March considered two cases (see 
also Fig. 3): 


JELLIUM MODEL FOR sp METALS: ELASTIC 
REGIME 

We shall refer to the phonon theory below to 
summarize the way to calculate the cleavage force 
in the elastic regime from appropriate dispersion 
relations, following Zaremba [13] and Kohn and 
Yaniv [14]. However, let us first obtain for simple 
sp metals the gross variation of the cleavage force 
F(z ) in the elastic regime, where 

F(z) = Az, (11) 

with z the interplanar separation. The work of 
March and Paranjape [25] used the simplest DFT, 
namely, the linearized Thomas-Fermi (TF) approx¬ 
imation, combined with a semi-infinite jellium ap¬ 
proach to obtain (see also Heinrichs [26]) 


(i) a Si(lll) surface, with one bond cut and 

(ii) a Si(100) surface, with breaking of two 
bonds. 


The modeling of the surface was carried out by 
having 12 planes of movable atoms on top of a 
fixed substrate. The system was allowed to relax 
by molecular dynamics. The atoms on the surface 
planes were held fixed, with the remainder per¬ 
mitted to relax. As the energy E(z) is a function of 
the separation z of atoms in the two surface planes, 
the cleavage force F(z) then is given by 


F(z) 


dE(z ) 
dz 


(14) 


A = , (12) 

with r s the mean interelectron separation, related 

to density n 0 by n 0 = 3/47rrf. Their value of the 
constant t was refined (and substantially in¬ 

creased) according to the subsequent work of 
Streitenberger [27] who gave the result 

fl6 

t = 0.229—Try. (13) 

m v 

Having summarized this simple approach to treat 
the elastic regime in sp metals, we note that the 
same model can be used on a jellium cylinder to 
predict size effects in atomic force microscopy (see 
the Appendix). Let us turn next to the recent work 
on the cleavage force in ideal Si crystal, treated by 
Matthai and March [8]. 


The form of E(z) obtained by Matthai and March 
is shown in Figure 4. 

The maximum in cases (i) and (ii) occurs when 
z lies between 0.8 and 0.95 of the original interpla¬ 
nar separation a . To gain insight, Matthai and 
March considered the H 2 molecule as it is pulled 
apart from its equilibrium-internuclear separation, 
R e say. Using the Coulson-Fischer wave function 
[28], designed to Allow a "transition" from a delo¬ 
calized molecular orbital (LCAO) behavior at R e 
to a (unsymmetrized) Heitler-London character at 
large internuclear separation R, Matthai and March 
pointed out that after extension of 0.6 R e from the 
equilibrium separation the LCAO MO is no longer 
the lowest-energy state, the electrons becoming 
localized. We identify this "transition" with bond 
breaking. From Figure 4, we see that the situation 
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(a) 


(b) 

FIGURE 3. Si(111) surface with one bond cut; (b) 
Si(100) surface with breaking of two bonds. (After Matthai 
and March [8].) 

is only quantitatively different 'for Si, a somewhat 
larger extension now being required for bond 
breaking. Of course, this treatment is restricted to 
ideal crystalline Si: In real materials, defects will 
lead to important changes from such ideal behav¬ 
ior. 

We turn to the area of lattice dynamics, where 
some precise statements relating to interatomic 
forces can be made. 



FIGURE 4. Energy E{z) as a function of separation z 
of atoms in two surface planes of Si. Cleavage force 
F(z) is then to be obtained from Eq. (14). (After Matthai 
and March [8].) 


Phonon Theory in Conducting Crystals, 
Including Cleavage Force in Elastic 
Region 

We start with simple metals and adopt the 
treatment of Jones and March [29]. For pair poten¬ 
tials, one has an equivalent "rigid-ion" model (see 
first section) in which the ground-state density 
p(r) in the presence of a phonon frozen into the 
lattice is given by 

p(r) = £o-(|r - l - u,|), (15) 

where / are the direct lattice vectors and u, denote 
the small displacements from the lattice positions 
due to the frozen phonon. The spherical localized 
"blobs" cr(r), centered on each nucleus in the 
perturbed lattice, are assumed to move "rigidly," 
i.e., without deformation, as the ions suffer small 
displacements from their equilibrium lattice posi¬ 
tions l n . 

Jones and March [29] showed, from the concepts 
of DFT, that while the above is, of course, approxi¬ 
mate, although a good approximation for bcc Na 
metal from the earlier discussion of liquid Na near 
freezing, a vector rigid-ion picture is formally ex¬ 
act for phonon theory. Thus, one writes for the 
first-order density change p 2 (r) when the ions are 
moved by small displacements u ; from the sites Z, 
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following Jones and March, 


Pi(r) 



(16) 


This change is given by linear response theory as 


Pl (r, E) = f AV m (t')F(rr'E) dt ', (17) 


where (see, e.g., Stoddart et al. [30]) F can be 
calculated from the one-body periodic potential 
V(r) of DFT from 


dF 

Je 


2 Re 


G 0 (rr 1 E + ) 


dp^r^E) 

dE 


(18) 


where G 0 is the perfect lattice Green function 
while p 0 (rr'E) is a Dirac (idempotent) density ma¬ 
trix whose diagonal element p 0 (r) is the exact 
crystal density. 

One can (Jones and March [29]) express the 
change in the one-body potential AV (1 ) Gt) in Eq. 
(17) due to " freezing in a phonon" as 


AV (1) = AV$ ctrostatic + / U(rr')pi(r') dr'. (19) 


Writing Eq. (16) next as 

Px(r) = r E/(r), (20) 

/ 

Jones and March demonstrated that R/(r) = R(r - 
l ) and they obtained an integral equation for R(r) 
from DFT. This was solved approximately for Al 
and for Si (Claesson et al. [31]). The relation to the 
"rigid-ion" model, or pair potential theory, was 
fully discussed by Jones and March [29] and also 
the way phonon dispersion curves can be calcu¬ 
lated. 

FERMI SURFACE IN METALS AND 

LONG-RANGE FORCES 

Flores et al. [32] used the above formulation to 
treat the effect of Fermi surface topology on long- 
range oscillatory interactions in metals, generaliz¬ 
ing the results for simple sp metals discussed 
earlier in this article. When, for instance, there are 
necks on the Fermi surface, obvious anisotropic 
effects arise, and in certain directions, an interac¬ 
tion energy-oscillatory function /r, with r the 
range of the interaction, can occur, contrasted with 
cos2 k F r/r 3 in simple metals. 


The writer (March [33]) has emphasized a re¬ 
lated point, namely, that the one-body potential of 
DFT, V xc (r ), plus li(rr') discussed above, are suffi¬ 
cient to map out the Kohn [34] anomaly in the 
lattice vibrations. This will then yield the correct 
Fermi surface. Direct use of the one-electron eigen¬ 
values from the solution of the Kohn-Sham equa¬ 
tions with potential V(r) does not, on the other 
hand, give the correct Fermi surface, as pointed 
out by Gunnarsson and Schonhammer [35]. 


ELASTIC REGIME OF CLEAVAGE FORCE IN 

TERMS OF PHONON THEORY 

We return briefly at this point to the elastic 
regime of the cleavage force F(z), where F(z ) — 
Az. The work of Zaremba [13] and Kohn and 
Yaniv [14] gave A explicitly in terms of the phonon 
dispersion relations in the perfect lattice before 
cleavage and this is a very precise route for the 
calculation of the constant A. One expects nearly 
free-electron metals to regain grossly A ar~ 11/2 as 
obtained by March and Paranjape [25] as well as in 
the later work of Streitenberger [27]. We stress that 
all the above is within the context of DFT, al¬ 
though in constructing AV (1) appearing in Eqs. 
(17) and (19), Jones and March emphasized that 
one requires the "exchange-correlation" func¬ 
tional U( rr')/ which transcends knowledge of the 
one-body potential of DFT. Thus, to complete the 
phonon calculation, as Jones and March [29] 
proved, one also needs the further functional 
derivative 


8V XC ( r) 

U(r,r') = w (21) 

6p(r ) 

with V xc (r) the usual exchange-correlation poten¬ 
tial of DFT. Claesson et al. [31] discussed approxi¬ 
mate forms of F and U for (a) Al and (b) Si and 
the interested reader is referred to their article for 
the detailed forms. However, it is noteworthy that 
a shell model treatment of Si is equivalent to a 
factorization of the response function E(r, r'), 
which, of course, affords great simplification. 


Summary and Future Directions 

Different DFT approaches have been discussed 
for treating interatomic force laws in condensed 
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metallic phases and for dealing with interplanar 
interactions in suitably cleaved crystals. It has been 
stressed in the latter case that the calculations 
presented here are for "ideal" materials. Natu¬ 
rally, defects will have to be incorporated before 
the theoretical predictions can be brought into 
direct contact with observation. 

Returning to the central Eqs. (1) and (2), for 
simple liquid metals such as Na near freezing, 
contact has been made between a DFT pair poten¬ 
tial calculated from first principles using LDA and 
that extracted using only classical statistical me¬ 
chanics to invert the measured liquid structure 
factor S(q). As Figure 1 and Table I show, the 
agreement is already rather remarkable, although 
some purely quantitative differences remain. As 
this article was nearing completion, work by 
Rajagopalan and Srivastava [36] appeared on liq¬ 
uid Cu. This was used subsequently by March and 
Ray [37] to compare it with the density functional 
calculations of Carlsson et al. [21], who extracted a 
pair potential for Cu. March and Ray [37] also 
used the coordination number representation (2) 
for Cu to extend the Carlsson et al. work on fee Cu 
to give E(c, r 0 ) also for bcc, sc, and diamond 
structures, but for these latter cases, to our knowl¬ 
edge, there is not as yet DFT work for comparison. 
For the simpler sp metal K, however, Rubio and 
March [38] very recently brought the DFT calcula¬ 
tions for a variety of structures (in LDA) into 
contact with the quantum chemical model (2). 
However, they used semiempirical representations 
of the singlet and triplet potential-energy curves 
for the K 2 dimer. While the potential-energy 
curve is obviously accessible through DFT, more 
problems exist in calculating the triplet curve from 
DFT, but approaches such as that set out by 
Theophilou [20] and later developments should be 
studied further in the context of the quantum 
chemical representation (2). 

Finally, phonon theory has been posed here in 
density functional language, with illustrative ex¬ 
amples given for Al and Si crystals and also for the 
cleavage force following [13, 14]. The Appendix 
treats size effects on the cleavage force in nearly 
free-electron metals and the establishment of a 
self-consistent field from the simplest DFT for fu¬ 
ture studies of such size influences. Atomic force 
microscopy may, perhaps, in the future, throw 
further light on the relevance of such studies to 
real materials, e.g.. Si and W [39, 40]. 


Appendix: Size Dependence of 
Cleavage Force Between Parallel 
Planar Jellium Metal Surfaces 


Razavy et al. [41] very recently discussed size 
effects in the jellium model already referred to in 
the main text. As there, the simplest DFT, namely, 
the linearized Thomas-Fermi (TF) model, was em¬ 
ployed. The force between two parallel planar con¬ 
ducting cylinders of infinite extension was consid¬ 
ered by these workers. 

In such a system, with cylindrical symmetry, 
the main achievement was to establish the self- 
consistent field analytically, by solving the lin¬ 
earized TF equation 


d 2 V dV d 2 V 

—— + p 1 - + - — 

dp 2 dp dz 2 



(Al) 


with q 1 the TF screening length. Separating the 
variables through 

V(p,z) = F(p)Z(z) (A2) 

leads to two differential equations: 

d 2 Z 

77 = -p 2 Z (A3) 

dz 

and 

d 2 F dF 

7-2 +P- 1 — = ( P 2 + ( ? 2)F ' (A4) 

dp dp 


with p being a real separation constant. Specifying 
regions I and II as within the two semi-infinite 
jellium cylinders, Razavy et al. [41] found in these 
regions 


V(p , z) 



C( p)cos pzl 0 [ Q( p)p] dp, p < R, 


(A 5) 


with R the radius of the cylinders and I Q the usual 
Bessel function. To specify C( p), Razavy et al. [41] 
matched V in Eq. (A5) to the electrostatic potential 
in region III, where this denotes the whole region 
outside the two jellium semi-infinite cylinders, in¬ 
cluding the gap of width d between them. 

The energy stored in the electrostatic field was 
then calculated as the sum of two terms, A E l and 
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A E 2/ and the force F from 

<?A Ei d A E 2 

p — _1 _j_ _i 

dd dd 


(A6) 


Razavy et al. showed that the first term is indepen¬ 
dent of d, while all the dependence of A E 2 on d 
can be subsumed within the simple factor 
sin (kd/2), with k a Fourier transform variable. It 
would be of obvious interest if this size depen¬ 
dence predicted by the simplest DFT could be 
tested using the atomic force microscope and work 
along these lines is in progress (see Allegrini et al. 
[40]). 

In concluding this Appendix, it is worth com¬ 
menting that the work of Razavy et al. discussed 
above makes available an (of course, approximate) 
self-consistent field as a function of cleavage sepa¬ 
ration d. In the future, this may well afford a 
"zeroth-order" field to use in solving the 
Kohn-Sham equations in cylindrical symmetry for 
a cleaved jellium cylinder. Of course, an LDA 
contribution to V(r), namely, V xc (r ) would have to 
be added, since the TF approximation used by 
Razavy et al. [41] does not include exchange and 
correlation. 
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ABSTRACT: Electronic structures and properties of carbides MC, M 5 C (M = 

Fe, Ni, Cu), and TiC and Ti 7 C were studied using density functional methods (DF), 
particularly the local density approach (LDA) with the Vosko-Wilk-Nusair (VWN) 
correlation functional, the generalized gradient approximation (GGA) of Becke and 
Perdew (using Becke's 1988 exchange functional and Perdew's 1986 correlation functional 
—BP86), and the ADF program of Baerends et al. In the first part of this report, we 
studied equilibrium geometries and dissociation energies for the process MC -> M 4- C 
involving the doublet ground state for M = Cu, singlet and triplet states for M = Ni, and 
triplet and quintuplet states for M = Fe, Ti. Charge distributions by population analysis, 
dipole moments, and vibrational frequencies were also evaluated. All calculations were 
done using triple-zeta basis sets, with frozen-core orbitals and the GGA corrections. In 
the second part of this report, we consider the doublet states of Cu 5 C and singlet and 
triplet states of Ni 5 C, taking planar and nonplanar models for the M 5 clusters. The triplet 
and quintuplet states of Fe 5 C were studied with a nonplanar model for the Fe 5 cluster, 
whereas in the case of the triplet and quintuplet states of Ti 7 C, a Ti 7 planar model was 
chosen. Optimization of the position of the C atom in the cluster, dissociation energies, 
distribution of charges in the molecule, and dipolar moments were also analyzed, and 
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comparisons with the results obtained for the corresponding carbides (MC) were made. 
These results, together with the ones of the corresponding carbonyls—that have 
previously been done by the same methodology—provide us with an interesting 
comparison of the M—C bond dissociation energy in carbides and carbonyls. Calculations 
of the C—O bond dissociation energies for the MCO and M n CO compounds were also 
performed. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 919-928, 1997 


Introduction 

C arbonyls and carbides (and also carbenes) of 
transition elements are a very important class 
of compounds for several reasons: One of these is 
that carbonyls are considered to be the first step in 
the reaction of CO with H 2 , catalyzed by transition 
elements (Fischer-Tropsch synthesis). From the in¬ 
dustrial point of view, this is of paramount impor¬ 
tance due to the large number of chemicals that 
can be obtained from it. 

The initial step of the chain of reactions is the 
chemisorption by materials which contain such 
elements. The simplest model for these materials is 
one atom, which is where the importance of the 
MCO compounds lies. Carbides (MC) are obtained 
by cleavage of the C—O bond from carbonyls, and 
carbenes (MCH 2 ) are obtained by hydrogenation 
of the carbides or carbonyls. In previous reports 
[1-3], we considered carbonyls and carbenes of Ti, 
Ni, Fe, and Cu using DFT methods. In this report, 
we focus on carbides as a complementary study. 

As a further step, it is natural to consider—as 
suitable models for real materials—clusters or 
crystals of the elements involved and to use the 
appropriate computational techniques. To compare 
the results obtained in the MC and M ?J C calcula¬ 
tions, we must use the same computational level 
in both cases. Therefore, the cluster size cannot be 
too large. 

These calculations should provide accurate pre¬ 
dictions, at least for the equilibrium molecular 
geometries, dissociation energies, and vibrational 
frequencies. For this reason, we studied—besides 
the MC compounds—the M 5 C structure with 
symmetry C 4v (for M = Ni, Fe,Cu) and Ti 7 C with 
symmetry C 6v . They all involve transition-atom 
clusters with geometries taken from their crystals. 

The transition elements considered in this re¬ 
port (Ti, Cu, Ni, and Fe) present quite different 
difficulties. Cu is the simplest because of its elec¬ 
tronic structure, as compared to Ni, Fe, and Ti; Ni 


is the most difficult. We focused our interest in the 
geometrical and energetical aspects of the systems 
under consideration. 


Methods 

The calculations reported in this article are based 
on the Kohn-Sham approach to density functional 
theory (DFT) [4], with the methodology of Baerends 
et al. The program used was ADF (version 2.0.1) 
[51 

The density functional, also called the ex- 
change-and-correlation (XC) functional, consists of 
a local part (the local density approach, LDA) and 
a so-called gradient correction (the generalized 
gradient approximation, GGA). Initially, the LDA 
used for the calculation of the exchange energy 
Ex(LDA) was the one proposed by Slater [6] (Xa 
approximation), but a version that also includes a 
correlation part—the one used in this report—is 
that proposed by Vosko, Wilk, and Nusair (VWN) 
[7]. 

The GGA introduces terms depending on the 
gradient of the electron density that account for 
the inhomogeneities necessary to describe the 
nonuniform electron gas in the case of atomic and 
molecular systems. GGA corrections are the sum 
of two terms: exchange and correlation. The intro¬ 
duction of these, sometimes called nonlocal correc¬ 
tions, significantly improve the results of the 
calculation of several properties, such as bond 
energies, molecular geometries, and dipole mo¬ 
ments. 

We have several possibilities using the ADF 
program with the GGA correction calculations. The 
one used in this report is due to Becke [8] for the 
exchange term and that of Perdew [9] for the 
correlation. The method is called, accordingly, 
LDA/BP. 

We used the Slater-type orbital (STO) basis set 
for the atomic orbitals (AO), which are internally 
stored in the ADF program. We selected the one 
called IV, which is a triple basis set with polariza¬ 
tion. 
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Integrals involving electronic density (such as 
Coulomb potentials) were evaluated using these 
auxiliary bases and then taking linear combina¬ 
tions of these integrals. The coefficients of these 
linear combinations were evaluated in each SCF 
cycle. 

The matrix elements of the overlap and Fock 
matrices were calculated by numerical integration. 
The success of DFT methods—such as the one 
implemented in ADF—is due, among other rea¬ 
sons, to the existence of accurate and very efficient 
numerical methods for the calculation of integrals 
using STO functions [10]. 

Instead of calculating total energies, as most 
methods do, ADF calculates bond energies accord¬ 
ing to Ziegler's procedure [11]. These bond ener¬ 
gies are the difference between the total energy of 
a molecule (or atom) and those of the atoms (or 
fragments) in established reference states. 

Optimization of the M—C bond length at the 
LDA/BP level was done for all the compounds 
considered in this report. In all cases, the lowest 
energy of the M and C fragments were obtained, 
and the adiabatic M—C bond dissociation energy 
was calculated in every case using the sum-of- 
fragments energies ( E so and the energy at the 
equilibrium geometry of the MC compounds. Mul- 
liken population analysis was carried out, and the 
charge distribution in the molecule and the related 
dipole moment were obtained, as well as the vi¬ 
brational frequencies of the normal modes. 

Regarding the M 5 C compounds, we have two 
possibilities for the M 5 clusters: Ni and Cu crystals 
have fee (face-centered cubic) elementary cells, 
whereas Fe has a bcc (body-centered cubic) and Ti 
has an hep (hexagonal close-packed) one. In the Ni 
and Cu cases, we selected two cluster models. The 
first one, called "planar" and symbolized by (5,0), 
had all five atoms located on the vertices of a 
square and its center. This would correspond to 
the top face of the elementary cell. The other, 
called "hollow" and symbolized by (4,1), was 
formed by taking the four-center atoms of the 
lateral faces of the cube and the fifth atom on the 
center of the bottom face, thus forming a square 
pyramid with the four M atoms at the corners of 
the base and the fifth M atom at the top. This was 
the only one employed in the case of Fe. We are 
using the nomenclature given by Post and 
Baerends [12]. 

The geometry of these clusters was taken from 
experimental crystal data. The distance from the 
center of the square to one of the M atoms was 


2.54 A in the case of Cu, 2.50 A in the case of Ni, 

o 

and 2.03 A in the case of Fe, and for Ti, the center 

o 

of the hexagon was 2.91 A from the other Ti atoms. 

The central M atom in (5,0), which we will call 
Mj, was displaced from the M 4 plane a distance 
equal to 1.80 A for the Cu 5 (4,1) cluster and 1.77 A 
for the Ni 5 (4,1) cluster. In the case of the Fe 5 
cluster, the distance from Fe 1 to the Fe 4 plane was 
1.44 A. 

All calculations were carried out at the unre¬ 
stricted approximation level. We checked the elec¬ 
tronic configuration after SCF convergence in or¬ 
der to see if the LUMO orbital was lower in energy 
than was the HOMO, both for a and /3 spin. Every 
time we found this not to be the case, we changed 
the configuration accordingly, verifying the de¬ 
crease in energy and continuing until the lowest 
energy was attained. In some cases of difficult 
convergence, we had to resort to the level-shifting 
technique, increasing the energy of the virtual or¬ 
bitals a little bit (in some cases up to 0.5 au). 


Results and Discussion 

All calculations reported in this article were 
done at the LDA/BP level. It is well known [13, 
14] that predictions for molecular geometries are 
improved when GGA corrections—such as 
BP—are included in the LDA methodology, result¬ 
ing in values very close to experimental ones. In 
general, DFT methods are superior to Hartree-Fock 
(HF) ones, particularly for metal-ligand bond 
lengths. 

CO CASE 

One of the objectives of this article was to eval¬ 
uate the dissociation energy of the C—O bond 
when the carbonyl group is attached to one or 
several transition-element atoms and then to com¬ 
pare this value to that of the free CO molecule. In 
that way, we can rationalize the action that differ¬ 
ent metal catalysts may have on reactions that 
involve the cleavage of that particular bond. 

For the free CO molecule, the C—O bond en¬ 
ergy obtained—with an optimized C—O bond 
distance of 1.1394 A—was -0.540653 au. For the 
fragments O (triplet), O (singlet), and C (singlet), 
the bond energies were - 0.057444, - 0.00875, and 
— 0.000206 au, respectively. Therefore, the sum of 
the lowest bond energies for the resulting frag- 
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merits of the dissociation of CO was E so f [C(l) + 
0(3)] = — 0.057650 au, which gave us a value for 
the (adiabatic) dissociation energy equal to 303 
kcal/mol. 

We found in the literature [15] an experimental 
value of 11.09 eV (256 kcal/mol) for this dissocia¬ 
tion energy and a C—O bond distance of 1.127 A. 
In [16], a calculated value of 277 kcal/mol is 
reported, as well as a bond distance of 1.127 A. 
They also provided an experimental value of 259 
kcal/mol, so our result is about 44 kcal/mol too 
high (17% higher). As far as bond distance is 
concerned, the agreement is satisfactory. 

We calculated values for the CO dissociation 
energy and bond distance using different bases 
and DFT parameterizations. These values were in 
good agreement with the ones just referred to. 

In any case, since we focus our interest on 
energy differences to be able to assess the influ¬ 
ence of the nature and number of the different 
metal clusters bonded to the CO molecule on the C 
—O bond dissociation energy, we used in our 
calculations the value just quoted (303 kcal/mol). 


MC AND MCO (M - Ti, Fe, Ni, Cu) CASES 

In Table I, we show the bond energies (in atomic 
units) for the optimized distances d( MC) as well as 
the adiabatic dissociation energies and electronic 
configurations of the different carbides MC. We 
evaluated these dissociation energies by subtract¬ 
ing from the bond energies the ones corresponding 
to the M and C atoms in the configurations and 
multiplicities that yield the lowest energies. 


The bond energies for the different atoms that 
we used to calculate the E so j are given below: 


Doublet Cu 
Triplet Ni 
Quintuplet Fe 
Quintuplet Ti 
Singlet C 


(2 3 5/1 3 SXsV 0 ) 
(2 3 5/1 3 4)(s 1 d 9 ) 
(2 3 5/2 3 1 )(s 2 d 6 ) 
(2 3 3/1 3 OXs 1 ^ 3 ) 
(1 1 0/1 1 0)(s 2 p 2 ) 


— 0.009972 au 
-0.021278 au 
-0.137846 au 

— 0.042003 au 

— 0.000206 au 


In Table II, we provide values for the equilib¬ 
rium geometries, bond energies, and dissociation 
energies of the M—C bonds of the carbonyls given 
in [1] (for the Fe, Ni, and Cu compounds) and [2] 
(for the Ti compounds). We also included, under 
the column A D e , the difference between the values 
of the M—C bond dissociation energies in the 
carbide and the corresponding carbonyl. This dif¬ 
ference can be easily shown to be identical to the 
difference between the values of the C—O bond 
dissociation energy in free CO and the carbonyls, a 
key quantity in this report. This reduction is very 
important, since it shows the decrease in energy 
needed to dissociate the carbonyl—i.e., to break 
the C—O bond—when going from free CO to CO 
bonded to a metallic ligand. It explains why in 
catalytic processes, and especially in heteroge¬ 
neous catalysis with transition elements, reactions 
whose rate-determining step involve the cleavage 
of that particular bond are enhanced compared to 
their noncatalyzed counterparts. The correspond¬ 
ing electronic configurations for the carbonyls are 
the ones detailed in the publications just refer¬ 
enced, so we shall not repeat them here. 


TABLE I_ 

MC compounds. 


Molecule 

d 

E 

De 

Configuration 

Esof 

CuC(2) 

1.799 

-0.154060 

90.3 

(5 5 2/5 4 2) 

-0.010178 

NiC(1) 

1.624 

-0.246783 

141.4 

(5 4 2/5 4 2) 

-0.021484 

NiC(3) 

1.625 

-0.230529 

131.2 

(5 5 2/4 4 2) 

-0.021484 

FeC(3) 

1.565 

-0.386087 

155.6 

(5 4 2/4 4 1) 

-0.138052 

FeC(5) 

1.628 

-0.357984 

138.0 

(5 5 2/4 4 0) 

-0.138052 

TiC(3) 

1.679 

-0.279043 

148.6 

(5 4 0/3 4 0) 

-0.042209 

TiC(5) 

1.829 

-0.244165 

126.7 

(5 4 1/3 3 0) 

-0.042209 

d: M—C distance (A); E: bond energy (au); D e : adiabatic bond dissociation energy (kcal/mol); E sof : sum of fragments energy 

(au). 
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TABLE II_ 

MCO compounds. 


Molecule 

d(MC) 

d(CO) 

a (MCO) 

E 

D e 

AD e 

D'e 

CuCO(2) 

1.903 

1.160 

136.8 

-0.575241 

15.4 

75 

228.2 

NiCO(1) 

1.661 

1.167 

180.0 

-0.659746 

61.4 

80 

223.1 

NiCO(3) 

1.861 

1.164 

142.5 

-0.605833 

27.5 

104 

199.5 

FeCO(3) 

1.698 

1.173 

180.0 

-0.753609 

47.1 

109 

194.6 

FeCO(5) 

1.864 

1.166 

163.7 

-0.741656 

39.6 

98 

204.7 

TiCO(3) 

2.013 

1.171 

180.0 

-0.626791 

27.7 

121 

182.2 

TiCO(5) 

2.034 

1.171 

180.0 

-0.653341 

44.4 

82 

220.7 


d{ MC): M—C distance (A); c/(CO): C—O distance (A); a(MCO): MCO angle (degrees); E: bond energy (au); D e : M—C bond 
dissociation energy (kcal/mol); AD e : difference between D e values from Tables I and II (kcal/mol); D' e : C—O bond dissociation 
energy (kcal/mol). 


The interaction energies for MC + O corre¬ 
spond to dissociation energies of the C—O bond 
in the reverse reaction, i.e., MCO -> MC + O. 
These values are obtained by adding the energy of 
0(3) (— 0.057444 au) to the one of the correspond¬ 
ing carbide and then subtracting the energy of the 
carbonyl, listed in Table II. The optimized distance 
for the carbide was used; therefore, the result 
corresponds to the adiabatic dissociation energy 

D' e - 

It is interesting to compare the optimized dis¬ 
tances M—C for the corresponding carbides (Table 
I) and carbonyls (Table II). In every case, we see a 
reduction in that distance when going from the 
carbonyl to the carbide. This is to be expected, 
since the strength of the C—M bond increases 
when only the M atom is the one bonded to the C 
atom, compared to the case when there is also a C 
—O bond involved. According to Tables I and II, 
this reduction goes from 0.037 A for NiC(l) to 
0.334 A for TiC(3). 

In the same fashion, we see a concomitant in¬ 
crease in the dissociation energy of the M—C 
bond when going from the carbonyl to the corre¬ 
sponding carbide, i.e., it is more difficult to re¬ 
move the C atom from the metal surface when it is 
present as a carbide than as a carbonyl. This in¬ 
crease, which is presented under the column A D e 
in Table II, also reaches its maximum at TiC(3). 

For NiC, like NiCO, the singlet state turns out 
to be more stable than is the triplet. The differ¬ 
ences in the distances d(Ni—C) are 0.037 A for the 
singlet and 0.236 A (much greater) for the triplet. 
Note that the distances d( Ni—C) in the case of the 


carbides are almost the same, whereas they are 
significantly larger for the triplet in the case of the 
carbonyls. 

The reduction in D e is 80 kcal/mol (similar to 
that of CuCO) for the ground state (singlet), but it 
increases to 104 kcal/mol for the excited state 
(triplet). When we go onto FeC, we see that—in 
the same manner as in FeCO—the triplet state is 
more stable than is the quintuplet and the values 
of AD' are 109 and 98 kcal/mol, respectively. For 
TiC, we find the triplet state to be more stable than 
is the quintuplet, as opposed to the corresponding 
carbonyls. 

According to these values, the compound in 
which the C—O bond is strongest (of the ones 
considered here) is CuCO(2), followed closely by 
NiCCXD and TiCO(5). On the other hand, it is 
weakest in TiCO(3), followed by FeCO(3). If we 
consider ground states only, then the weakest 
C—O bond is the one in FeCO(3), followed by 
TiCO(5) and NiCO(l). The strongest bond is still 
that of CuCO(2). 

In Table III, we tabulate the charge transfer, 
dipole moment, and overlap for the carbide species, 
obtained via a Mulliken population analysis of the 
different compounds listed. Appropriate values for 
the carbonyls are also included. We observe an 
important charge transfer from the metal atoms to 
carbon, which—for the ground states—oscillates 
between 0.29 au for NiC(l) and 0.40 au for TiC(3). 
These values are always slightly higher for the 
corresponding excited states. The same order is 
obeyed by the dipole moments, as is to be ex¬ 
pected. When compared to the carbonyls, we see 
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TABLE III_ 

MC compounds. 


Molecule 

q 

q' 

f 1 

S 

S' 

m 

f'(l') 

CuC(2) 

0.3164 

0.1032 

2.433 

0.37 

0.28 

621 (0.5) 

386 (7) 

NiC(1) 

0.2868 

0.1376 

1.983 

0.43 

0.42 

928 (25) 

617 (0.2) 

NiC(3) 

0.3248 

0.1132 

2.752 

0.43 

0.33 

877 (26) 

460 (9) 

FeC(3) 

0.3412 

0.1939 

2.471 

0.45 

0.40 

962 (57) 

574 (26) 

FeC(5) 

0.3977 

0.1896 

3.013 

0.46 

0.37 

887 (42) 

462 (6) 

TiC(3) 

0.3969 

0.2951 

3.151 

0.51 

0.33 

982 (83) 

431 (10) 

TiC(5) 

0.4082 

0.2675 

3.265 

0.37 

0.34 

865 (52) 

402 (25) 


q : Atomic charge transferred from the M to the C atom; fju: dipole moment (D); S: total M—C overlap; f : vibrational frequency 
(cm" 1 ); /: intensity (km/mol). Primed quantities refer to the corresponding carbonyl. 


that the charge transfer is fairly smaller than in the 
carbide case, but follows essentially the same or¬ 
der. 

In the same table, we display the vibrational 
frequencies and intensities of the M—C bonds for 
the carbides MC (/) and for the carbonyls MCO 
(/') (the latter were taken from [1, 2]). We can 
infer from them that there is a marked increase in 
the vibrational frequency of the M—C stretching 
mode when going from the carbonyl to the carbide 
(which goes from 235 cm 1 for CuC(2) to 551 
cm -1 for TiC(3), concomitant with a reduction in 
the M—C bond distance. 

For the ground states of NiC, FeC, and TiC, the 
frequencies are fairly close to one another and 
significantly greater than that of CuC. This denotes 


stronger M—C bonds in those compounds, in 
agreement with the dissociation energies given in 
Table I. The overlaps follow a similar trend. 

In the case of TiC, comparing the triplet ground 
state to the excited quintuplet, we see that the 
vibrational frequency of the M—C bond in the 
first is significantly larger than in the second. This 
is also true for the dissociation energies (Table I). 

M„C AND M n CO (n = 5 FOR M = Fe, 

Ni, Cu; n = 7 FOR M = Ti) CASES 

In Table IV, like in Table I, we show the equilib¬ 
rium geometries and bond energies of the carbide 
species with the C atom in the field of the M w 
clusters as well as their adiabatic dissociation ener- 


TABLE IV_ 

M n C compounds. 


Molecule 

d 

E 

De 

Configuration 

^sof 

Cu s C(2)(5,0) 

1.750 

-0.393986 

93.7 

(11 3 7 4 25/11 3 7 4 24) 

-0.244523 

Cu 5 C(2)(4,1) 

0.05 

-0.425183 

157.4 

(12 3 7 4 24/11 3 7 4 24) 

-0.174142 

Ni 5 C(1)(5, 0) 

1.625 

-0.606384 

145.8 

(11 3 6 3 24/10 3 6 3 24) 

-0.373782 

Ni 5 C(1)(4,1) 

— 0.050t 

-0.587068 

183.8 

(11 2 7 4 23/10 3 6 4 24) 

-0.293903 

Ni 5 C(3)(5, 0) 

1.65 

-0.611966 

149.3 

(11 2 7 4 24/11 3 6 4 22) 

-0.373782 

Ni 5 C(3)(4,1) 

-o.iot 

-0.595919 

189.4 

(10 3 7 4 24/10 3 6 4 23) 

-0.293903 

Fe 5 C(3)(4,1) 

0.50 

-1.274801 

186.0 

(11 2 4 5 21/10 2 3 6 20) 

-0.978150 

Fe s C(5)(4,1) 

0.40 

-1.281033 

189.9 

(11 2 4 5 22/10 2 3 5 20) 

-0.978150 

Ti 7 C(3)(7,0) 

1.82 

-1.086318 

129.9 

(10 1 3 3 1 5 13/10 1 3 3 16 10) 

-0.878152 

Ti 7 C(5)(7,0) 

1.79 

-1.101740 

139.5 

(10 1 3 3 16 13/10 1 3 2 16 10) 

-0.878152 

d : C distance (A); f: negative distances indicate central atom Mi and C atom are on the same side of the M 4 

energy (au); D e : bond dissociation energy (kcal/mol); E sof : sum of fragments energy (au). 

plane; E: bond 
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gies. To calculate these dissociation energies, one 
must know the bond energies for the M„ clusters 
in their configurations of minimum energy, which 
are listed below: 


Cu 5 (5, 0) (10 3 7 4 24/10 3 7 4 23) 

Cu 5 (4, 1) (10 3 7 4 24/10 3 7 4 23) 

Ni 5 (5,0) (9 3 7 4 22/9 2 7 4 23) 

Ni s (4, 1) (9 3 7 4 22/9 2 7 4 23) 

Fe 5 (4,1) (8 3 3 6 22/10 1 3 6 18) 

Ti 7 (7,0) (9 1 3 3 16 12/8 1 3 2 16 10) 


- 0.244317 au 
-0.173936 au 
-0.373576 au 
-0.293697 au 
-0.977944 au 
-0.878946 au 


Adding to these energies the one of the C atom 
(which has already been quoted as — 0.000206 au), 
we get the sum of fragments energy E so p shown in 
Table IV. Similarly, in Table V, we show the equi¬ 
librium geometries, bond energies, and dissocia¬ 
tion energies of the M n —C bond for the carbonyls, 
which have also been given in [1] for M = Fe, Ni, 
Cu and in [2] for M = Ti. These are obtained from 
the sum of fragments energy given above and the 
energy of CO at the equilibrium distance (1.139 A), 
which is - 0.540653 au. We also provide values for 
the sum of fragments (M„ and CO) energy, E so j, 
but do not include the corresponding electronic 
configurations because they have already been 
given in prior reports, the only exception being 
those of the Ti clusters. These are (11 1 3 3 18 
12/11 1 3 3 18 10) and (11 1 3 3 18 13/11 1 3 2 18 
20) for Ti 7 CO(3)(7,0) and Ti 7 CO(5)(7,0), respec¬ 
tively. Note that the values reported in this table 
were calculated after optimization of the C—O 
distance, unlike in earlier publications [1]. 


Cu 5 C Case 

Let us consider first the Cu 5 C cluster and its 
corresponding carbonyl in both (5,0) and (4,1) 
geometries. 

Cu 5 CO is more stable in the geometry (5,0) by 
47 kcal/mol, whereas Cu 5 C is more stable in the 
(4,1) geometry by 20 kcal/mol. The Cu^—C dis¬ 
tance in Cu 5 C (5,0) is 1.750 A, 0.049 A smaller 
than in CuC. The dissociation energy of the Cu 5 C 
bond is 93.7 kcal/mol, 3.4 kcal/mol greater than 
in the former case. In this way, we see that the 
interaction of CuC with the square ring Cu 4 brings 
about a strengthening of the C—Cu 5 bond. 

When compared to the corresponding car¬ 
bonyls, we see—in the (5,0) case—a reduction in 
the Cu—C distance: from 0.225 to 0.104 A, which 
shows a greater interaction between the C atom 
and the Cu 5 group than between the C and the Cu 
atoms. The adiabatic dissociation energy of the C 
—O bond, D e , is concomitantly reduced with re¬ 
spect to the CuCO case (from 228 to 215 kcal/mol). 

In the case of the (4,1) cluster, we observe an 

o 

equilibrium distance of 0.05 A to the center of the 
square (on the opposite side relative to the C 
atom), and since the Cu x atom is 1.80 A away from 
the Cu 4 plane, the distance Cu 4 —C is 1.85 A, just 
a little greater than the one in CuC (1.80 A, Table 
I). However, here we have the influence of the Cu 4 
ring, which increases considerably the dissociation 
energy of the Cu 5 —C bond with respect to the 
(5,0) case, going from 93.7 to 157.4 kcal/mol. Thus, 
as the Cu 4 ring approaches the C atom, the inten¬ 
sity of the C-cluster bond is considerably enhanced 


TABLE V_ 

M n CO compounds. 


Molecule 

d 

d( CO) 

E 

^ sof 

»e 

d; 

A ° e 

Diff. 

Cu s CO(2)(5,0) 

1.975 

1.160 

-0.794584 

-0.784970 

6.03 

215.3 

87.8 

12.9 

Cu 5 CO(2)(4, 1) 

1.70 

1.15 

-0.719414 

-0.714589 

3.03 

148.6 

154.5 

79.7 

Ni 5 CO(1)(5,0) 

1.70 

1.17 

-0.950188 

-0.914229 

22.55 

179.7 

123.4 

43.4 

Ni s CO(1)(4,1) 

0.02 

1.23 

-0.876046 

-0.834350 

26.16 

145.3 

157.8 

77.8 

Ni s CO(3)(5,0) 

1.75 

1.17 

-0.958880 

-0.914229 

28.02 

181.6 

121.4 

17.8 

Ni 5 CO(3)(4,1) 

0.05 

1.22 

-0.856103 

-0.834350 

13.65 

127.2 

175.9 

72.2 

Fe s CO(3)(4,1) 

1.05 

1.22 

-1.547066 

-1.518597 

17.86 

134.8 

168.3 

59.8 

Fe 5 CO(5)(4,1) 

0.90 

1.25 

-1.592293 

-1.518597 

46.24 

159.3 

143.8 

45.4 

Ti 7 CO(3)(7,0) 

2.00 

1.16 

-1.472449 

-1.419599 

33.16 

206.3 

96.8 

-24.1 

Ti 7 CO(5)(7,0) 

2.05 

1.19 

-1.484859 

-1.419599 

40.95 

204.4 

98.7 

16.4 

d : M 1 — C distance (A); d( CO): C—O distance (A); E: bond energy (au); E sof \ sum of fragments energy (au); D e : M n - 

-C bond 

dissociation energy (kcal/mol); 
Tables II and V (kcal/mol). 

P : 

o 

1 

-O bond dissociation energy (kcal/mol); 

Diff.: difference between 

A D e values between 
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when going from (5,0) to (4,1). As for the dissocia¬ 
tion energy of the C—O bond (D'), it shows the 
opposite change: it gets reduced from 215 to 149 
kcal/moL 

]\i 5 C Case 

Let us now consider the Ni 5 C case. Here, we 
have four different cases to analyze: singlet and 
triplet combined with (5,0) and (4,1) geometries. 
According to Table V, the triplet state is more 
stable than is the singlet by 3.5 kcal/mol for the 
(5,0) case and by 5.6 kcal/mol for the (4,1) case. 
This differs from the NiC case, where the singlet is 
more stable than is the triplet by 10.2 kcal/mol. 

The dissociation energies D e for the bonds 
Ni—C and Ni 5 —C show tendencies similar to the 
ones of Cu. For the singlet, it increases slightly 
(from 141 to 146 kcal/mol) in the (5,0) case and 
strongly (to 184 kcal/mol) in the (4,1) case. 

For the triplet, we have similar values, amount¬ 
ing to 18.1 and 58.2 kcal/mol increments. Thus, 
the interaction of the Ni 4 ring with the NiC car¬ 
bide is similar to that of the Cu case. 

Now, let us look at the corresponding car¬ 
bonyls. In the case of NiCO, the singlet is more 
stable than is the triplet by 34 kcal/mol, according 
to Table II. In the case of Ni 5 CO, the singlet is still 
more stable in the (4,1) case, but the triplet is the 
more stable one when the structure is of the (5,0) 
type. It is observed, then, that in going from NiCO 
to Ni 5 CO, the difference in energy between the 
two multiplicities is reduced, being inverted in the 
(5,0) case. 

The reduction of the C—O bond dissociation 
energy amounts to 43 and 78 kcal/mol in the 
case of the singlet when going from NiCO(l) 
to Ni 5 CO(5,0) and Ni 5 CO(4,1), respectively. In 
the triplet case, we observe reductions of 18 
and 72 kcal/mol when going from NiCO(3) to 
Ni 5 CO(3)(5,0) and Ni 5 CO(3)(4,1), respectively. 

In [17], values are reported for compounds simi¬ 
lar to the ones that are the subject of this study, 
but of the form Ni(5,4) and Ni(4,5) as well as 
Ni(5,0) (in the nomenclature we are using). For 
Ni(4,5), distances in the order of 0.1-0.3 A are 
mentioned, not too different from the values found 

o 

in the present report (0.05 and 0.10 A, singlet and 
triplet, respectively). As for the Ni„—C bond dis¬ 
sociation energies, they report a calculated value 


of 265 kcal/mol and an experimental one of 170 
kcal/mol. The latter is in excellent agreement with 
our values (184 kcal/mol for the singlet and 189 
kcal/mol for the triplet). However, in the (5,0) 
case, their calculated value for the NT—C distance 
is 1.18 A, substantially smaller than ours (1.625 A), 
and a bond dissociation energy of 117 kcal/mol, 
whereas our values are 146 kcal/mol for the sin¬ 
glet and 149 kcal/mol for the triplet. 

Fe 5 C Case 

In the case of Fe 5 C, we have only the (4,1) 
geometry for the cluster and the triplet and quin¬ 
tuplet states. Regarding the carbide species, we see 
that—according to Table IV—the quintuplet is 
more stable than is the triplet by 3.9 kcal/mol, 
unlike the FeC case, where the triplet is more 
stable than is the quintuplet by 17.6 kcal/mol. This 
shows the great influence that the Fe 4 ring has 
over the FeC species, which is also reflected by the 
fact that the C atom is almost in the Fe 4 plane. 

As in prior cases, let us consider the influence 
that the Fe 4 group has over the Fe 5 —C bond 
dissociation energy in the carbides and over the C 
—O bond in the carbonyls. We observe an incre¬ 
ment in the former energy in the case of the triplet, 
going from 155.6 (FeC) to 186.0 kcal/mol (Fe 5 C). 
In the case of the quintuplet, the corresponding 
change is from 138.0 to 189.9 kcal/mol. Once again, 
the strengthening of the Fe—C bond by the action 
of the Fe 4 ring is clearly manifested. 

If we focus on the C—O bond dissociation en¬ 
ergy, we infer—from Tables II and V—that in the 
case of the triplet it diminishes by 59.8 kcal/mol. 
In the case of the quintuplet, the corresponding 
reduction is 45.4 kcal/mol. The influence of the 
Fe 4 ring on the FeCO species is found to strengthen 
the interaction of the Fe atom with the metallic 
cluster and weaken the C—O bond. 

In [18], the compound Fe 5 C(CO) 15 was consid¬ 
ered. It has three carbonyl groups per Fe atom, and 
it is the first carbide cluster reported in the litera¬ 
ture, being the most similar one to our Fe 5 C. The 
only information that is provided about this car¬ 
bide is the distance from the C atom to the Fe 4 
plane, which is estimated as being 0.08 A. Our 
value is 0.40 A, which is much larger, but one 
should keep in mind the differences between the 
two compounds. 
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Ti 7 C Case 

Lastly, let us consider the Ti carbides and car¬ 
bonyls. Here, we focused only on the (7,0) geome¬ 
try of the cluster. From Table IV, we see that in the 
case of Ti 7 C the quintuplet is more stable than is 
the triplet by 9.6 kcal/mol. This is quite the oppo¬ 
site of TiC, where the triplet is more stable by 21.9 
kcal/mol, as in the Fe cases. When we move onto 
the corresponding carbonyl compounds, we find 
the quintuplet to be more stable in both cases, the 
differences being 7.8 kcal/mol for Ti 7 CO and 16.7 
kcal/mol for TiCO. 

In the case of Ti 7 C(5), which is the ground state, 
there is a 12.8 kcal/mol increment with respect to 
the (excited) state TiC(5). This change is similar to 
those already seen in planar cases. 

However, if we consider Ti 7 C(3) (excited state), 
there is a 18.7 kcal/mol reduction in the Ti—C 
bond dissociation energy. This is an anomalous 
case, since it means that the interaction of the Ti 6 
ring with the TiC carbide strengthens the C-metal 
bond for the ground state, but not for the excited 
state. 

We have a similar situation for the C—O bond 
in the case of the carbonyls. In the quintuplet, the 
C—O bond dissociation energy decreases by 16.3 
kcal/mol when going from TiCO(5) to Ti 7 C0(5). 
In this case, both states are ground states. In the 
triplet, we have again an anomalous case, since the 
C—O bond dissociation energy increases by 24.1 
kcal/mol. 

When we concentrate on the values of charge 
transfer, we see that in the Cu and Ni compounds 
there is a marked increase (as shown in Table VI) 
when going from the (5,0) to the (4,1) geometry, 
whereas a reduction is observed in the dipole 
moments. In the (5,0) and (7,0) cases, this charge 
transfer is similar to those of the carbides MC, 
listed in Table III. 

For the charge distribution analysis, we will 
consider a typical case only. Choosing Cu 5 C as an 
example, a population analysis gives us for the 
atoms on the vertices of the square a positive 
charge of 0.1030 au, whereas that on the central 
one is -0.1586 au. This leaves a total charge for 
the cluster of 0.2535 au, which is the net electronic 
charge transferred to the C atom. In the (4,1) case, 
each Cu atom on a vertex carries a positive charge 
not too different from that of the previous case 
(0.1126 au), but the fifth Cu atom now bears a 
positive charge as well (0.1278 au), which results 


TABLE VI_ 

M n C compounds. 


Molecule 

<7 

V 

Cu 5 C(2) (5,0) 

0.2535 

1.335 

Cu 5 C(2) (4,1) 

0.5782 

0.313 

Ni 5 C(1) (5, 0) 

0.3494 

1.781 

Ni 5 C(l) (4,1) 

0.6035 

0.403 

Ni 5 C(3) (5,0) 

0.3208 

1.565 

Ni s C(3) (4,1) 

0.5695 

0.142 

Fe 5 C(3) (4,1) 

0.4786 

0.868 

Fe s C(5) (4,1) 

0.5507 

0.755 

Ti 7 C(3) (7,0) 

0.4049 

0.988 

Ti 7 C(5) (7,0) 

0.4141 

0.935 

q: Electronic charge transferred from M n 

to C (au). fi\ dipole 

moment (D). 



in a larger positive charge 

on the cluster (0.5782 

au). 




Conclusions 


1. For the carbides considered in this report, the 
most stable states are the singlet for NiC and 
the triplets for FeC and TiC, which is quite 
similar to the corresponding carbonyl com¬ 
pounds, except for TiCO, where the quin¬ 
tuplet is more stable than is the triplet. 

2. M—C bond lengths are significantly reduced 
when going from the carbonyls to the corre¬ 
sponding carbides, whereas bond dissocia¬ 
tion energies increase in like manner. 

3. There is a sizable reduction in the dissocia¬ 
tion energy of the C—O bond of the car¬ 
bonyls with respect to the free CO molecule. 

4. In the case of the M—C carbides, we observe 
an important charge transfer from the metal 
atom to the C atom, which varies between 
0.30 and 0.40 electronic charge units. 

5. The stretching frequencies for the M—C 
bonds of the MC carbides increase, as is to be 
expected, in the same order as bond dissocia¬ 
tion energies. These values are considerably 
higher than the ones of the corresponding 
carbonyls. 

6. Considering now the M„C carbides, we find 
the following results: First, for Cu 5 C, the 
geometry of the (4,1) cluster is more stable 
than is the (5,0) one, as opposed to the car- 
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bonyl case. Ni 5 C, however, shows higher sta¬ 
bility for the (5,0) geometry, and at the same 
time, the triplet becomes slightly more stable 
than is the singlet. NiC displays the opposite 
behavior. In the case of Fe 5 C, the quin¬ 
tuplet is more stable than is the triplet, in a 
similar way to that of the corresponding car¬ 
bonyls and unlike TiC, where the triplet is 
more stable than is the quintuplet. For Ti 7 C, 
and also for Ti 7 CO, the quintuplet is more 
stable than is the triplet. We find the oppo¬ 
site order for TiC. 

7. Looking now at the dissociation energy of 
the C—O bond in the M„CO compounds, we 
notice a reduction with respect to the corre¬ 
sponding MCO compounds. This reduction 
is moderate for the planar geometries, but 
significantly greater for the nonplanar ones. 
This means that the C—O bond is weakened 
considerably by the influence of the M 4 group 
(M = Cu, Ni, Fe). In the case of Ti 7 CO, we 
find a similar reduction in the C—O bond 
energy (compared to TiCO), but this is only 
so in the ground quintuplet state, since in the 
excited triplet state, we find an increase in 
that energy. 
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Introduction 

F or the past decade, the fabrication of ultra¬ 
small devices and associated developments 
in laser technology for exploration of their proper¬ 
ties have raised important theoretical questions, 
both conceptual and practical In particular, the 
short times sampled in femtosecond laser studies 
and the wealth of potential information from dif¬ 
ferent initial preparations call for a description of 
Coulomb kinetics (e.g., electrons and holes) far 
from equilibrium, extending from initial times to 
times long compared to the characteristic collision 
times and dephasing times [1,2]. Such theoretical 
considerations already have been explored in other 
contexts, such as transport in simple classical liq¬ 
uids and plasmas [3] and for nuclear matter [4], 
but their experimental relevance in these cases has 
been less compelling. The objective here is to for¬ 
mulate the problem of Coulomb kinetics in terms 
of the fundamental exact first two hierarchy equa¬ 
tions for the one- and two-particle reduced density 
operators. Practical applications result from an 
approximate closure of the two-particle equation 
subject to important constraints on acceptable ap¬ 
proximations associated with conservation laws, 
representability, stationary solutions, and quan¬ 
tum statistics. These structural properties are inde¬ 
pendent of considerations of classical or quantum 
effects and do not rely on the matrix representa¬ 
tion appropriate to the specific problem. The theo¬ 
retical analysis of a given approximation occurs at 
the compact abstract level for greatest simplicity 
and generality and in a formulation that allows 
exploitation of previous work on classical systems. 

This approach is illustrated for a simple two- 
band model of a semiconductor, to show how 
corrections to the semiconductor Bloch equations 
can be constructed to include the effects of scatter¬ 
ing and dynamic screening. In this discussion, only 
electron-electron and electron-laser interactions 
are considered; the lattice is presumed to be rigid. 
A simple closure approximation including exact 
three-particle exchange correlations and all resid¬ 
ual pair correlations is indicated and its content 
discussed. Solution of the two-particle equation 
gives the two-particle reduced density operator as 
a functional of the one-particle density operator. 
Use of this result in the first hierarchy equation for 
the one-particle density operator gives a closed 


kinetic equation. The resulting non-Markovian ki¬ 
netic equation describes the evolution of an arbi¬ 
trary initial preparation from asymptotically short 
times to the long-time Boltzmann limit. The short 
time evolution includes the buildup of dynamical 
screening (polarization effects); at long times, the 
Boltzmann scattering rates are determined from 
the full T-matrix rather than from the weak cou¬ 
pling Born approximation. 

In this brief presentation, only the theoretical 
structure and method are described as an overview 
of the general approach to carrier dynamics. More 
detailed calculations based on the approximation 
suggested here are in progress. 


Reduced Density Operators and 
Hierarchy 

We consider N electrons interacting with a rigid 
ionic lattice, with overall charge neutrality. The 
electrons interact with the lattice and with each 
other via bare Coulomb potentials. In addition, 
they interact with a classical (i.e., many photon) 
transverse electric field, E(0 via a dipole interac¬ 
tion. The Hamiltonian for the system is then 

N 

Hit) =H 0 + U+ U ext (t), H 0 = £ h 0 (i) (1) 

i= 1 

1 N 

U=-LV(i,j), U ext = — it - E(0, (2) 

^ i*j 

where h 0 (i) is the single-particle Bloch Hamilto¬ 
nian for interaction of electron i with the lattice, 
V(i, j ) is the Coulomb potential for interaction of 
electrons i, j, and fx = Efi 1 fx(i) is the total dipole 
moment due to all electrons. The external field 
E(0 characterizes the laser field, which may have 
a very short time scale in some pump experiments, 
but may also represent a longer time driving field 
with a duration comparable to various dephasing 
mechanisms. The details of this field are not im¬ 
portant for the formal considerations here, but can 
play a role in the numerical method used to solve 
the equations. The eigenvalue problem for the 
Bloch Hamiltonian is described by 

h 0 if/ a (k) = e a (k)0- a (k), (3) 

where a denotes the band index, and k, the mo¬ 
mentum quantum number (here and in the follow¬ 
ing we set ft = 1). 
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The initial state of the system is defined by an 
N-particle density operator, p(l,..., N). This could 
be the Gibbs distribution if the electron-lattice 
system is initially in equilibrium, but it is not 
necessary to specify its form at this point. Im¬ 
plicit in the definition of the density matrix is an 
N-particle symmetrization operator so that a repre¬ 
sentation of its matrix elements in terms of unsym- 
metrized products of single particle states is possi- 
ble [5]. This is an important feature that will be 
exploited in the next section to extract the ex¬ 
change correlation of the reduced density opera¬ 
tors associated with p(l,..., N). The time evolu¬ 
tion of the density operator is governed by the 
Liouville-von Neumann equation 

d t p(t) + i[ H(t), pit)] =0. (4) 

The reduced density operators for m particles are 
defined by 

/ (m) (l,..., m; t) = N m Tr m+l N p(t). (5) 

The notation on the right side indicates a trace 
over the degrees of freedom associated with parti¬ 
cles m + 1 through N. More precisely, in matrix 
representation, it denotes summation over the di¬ 
agonal quantum numbers for these particles. The 
reduced density operators / (m) inherit an M-par- 
ticle symmetrization operator from the corre¬ 
sponding N-particle symmetrization operator in p. 
The BBGKY (Born, Bogoliubov, Green, Kirkwood, 
Yvon) hierarchy equations [5,6] for the time de¬ 
pendence of the reduced distribution functions 
now follow directly from a partial trace of the 
Liouville-von Neumann equation, 

m 

0 + E i[(h 0 (i) - fi(i) • E(f)), 

i= 1 

m; 0] 

m 

+ E Av{i, 1/..m; 0] 

i =£ j 

m 

= - ETr m+1 i[V(i,m + 1 ), 

i = 1 

/ (m + 1) ( 1. m + 1; £)]. (6) 

The left side of this equation is just the 
Liouville-von Neumann equation for m isolated 
particles. The right side expresses a coupling to the 


other degrees of freedom through the Coulomb 
interactions. In the following, it will be sufficient 
to consider only the cases with m — 1 and 2, 

t) + z[H(l;f),/ (1) (l;0] 

+ Tr 2 ![y(l,2),/ (2) (1,2;0]=0 (7) 

<? f / (2) (l, 2; t ) + i[ H( 1,2; t), / (2) (1,2; t )] 

2 

+ E Tr 3 z[ V 7 (z, 3),/ (3) (1,2 / 3; f)] = 0, (8) 

i= 1 

where the single-particle and two-particle Hamil¬ 
tonians are given by 

H(l; t) = h Q ( 1) - jxQ) * E(f) H(l,2; t) 

2 

= E (ft 0 (O - MO • E(f)) + Ml,2). (9) 

i= 1 

These equations are exact but formal since they 
are not closed in terms of / (1) and alone. It is 
necessary to construct a suitable approximation 
that expresses / (3) as a functional of / (1) and / (2) . 
Then Eqs. (7) and (8) provide the means to calcu¬ 
late / (1) and / (2) . In particular, if the equation for 
/ (2) is solved as a functional of / (1) , i.e., 

/ (2) (1,2; t) = F (2) (l,2; t l/ (1) ), (10) 

then use of this solution in the first hierarchy Eq. 
(7) provides a closed kinetic equation for / (1) : 

d t f (l) (l;t) + f[HQ;0,/ (1) (l;0] 

+ Tr 2 f[V(l,2),F®(l,2;M/ (1) )] = 0 . ( 11 ) 

This is the approach proposed here for obtaining 
the appropriate kinetic equation for charge carri¬ 
ers. The procedure is representation-independent, 
and as a first-order equation in time, it is appropri¬ 
ately posed as an initial value problem. This is in 
contrast to the alternative nonequilibrium Keldysh 
Green's function method which requires a detailed 
reconstruction of the specified initial conditions 
from an artificial past history [7]. 


Exact Exchange Effects and 
Correlations 

There are two important sources of correlations 
among the charge carriers. One is due to the long- 
range Coulomb interaction, while the other is due 
to the exchange symmetry among particles of the 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


931 



DUFTY ET AL. 


same species. It is useful to extract explicitly the 
dominant effects due to exchange symmetry by 
identifying the exchange operators implicit in the 
definitions of the reduced density operators. This 
will allow a more controlled identification of the 
residual correlations due to Coulomb interactions. 
The two- and three-particle reduced density opera¬ 
tors are expressed as 

/®(l,2;t) E/( 2 )(l,2;[)y(l,2) 

= ^(l,2)/ (2) (l,2;f) (12) 

/ (3) (1,2,3; f) = /< 3) ( 1,2,3; t)S*(l,2, 3) 

= Ml,2,3)/ (3) (l,2 / 3;0, (13) 

where j/(I, 2) and ,:/'(], 2,3) are the two- and 
three-particle antisymmetrization operators that 
convert product states for two and three electrons 
into antisymmetric states, respectively. Such sym- 
metrization factors occur necessarily from the defi¬ 
nition of the underlying N-particle density matrix. 
The last equality follows from the permutation 
symmetry of the reduced density operators and of 
the symmetrization operators. Next, the correlation 
operators associated with / (2) (1, 2; t) and 
/ (3) (1, 2,3; 0 are introduced: 

/ (2) (1,2; 0 = / (1) (1; t)/ (1) (2; t) + £(1,2; t) (14) 
/ (3) (1,2,3;0 

= / (1) (l; 0/ (1) (2; f)/ (1) (3; t) 

+ / (1) (1; t)g( 2,3; t) +/ (1) (2; t)g( 1,3; t) 
+ / (1) ( 3; t)g( 1,2; 0 + £(1,2,3; 0. (15) 

Since the symmetrization operators have been ex¬ 
tracted explicitly in (12) and (13), the correlation 
functions £(1,2;£) and £(1,2,3; f) are primarily 
measures of correlations due to the Coulomb inter¬ 
actions. Conversely, even when there are no corre¬ 
lations in / (2) (1,2; t) and / (3) (1,2,3; 0, the distri¬ 
bution functions / (2) ( 1,2; 0 and / (3) ( 1,2,3; 0 have 
two- and three-particle correlations induced by the 
antisymmetrization operators. Therefore, it is use¬ 
ful to rewrite the first two hierarchy equations in 
terms of £(1,2; t ) and £(1,2,3; 0* The first equa¬ 
tion becomes 

<? f / (1) (l; t) + H(l; f),/ (1) (1; 0] 

+ Tr 2 i[ V s (l,2),/ (1) (1; 0/ (1) (2; t)] 

+ 7V 2 i[ V s (1,2), £(1,2; 0] = 0, (16) 


where Vg(l,2) = V(l, 2)^(1,2) is the pair potential 
with exchange effects. The left side of this equation 
is recognized as generating the time-dependent 
Hartree-Fock dynamics so the equation becomes 

£? f / (1) (l; f) + i[H hf (l;t),f m (l;t)] 

= -Tr 2 i[V s (l,2), g(1,2; t)], (17) 

where the Hartree-Fock Hamiltonian is 

H hf ( 1; t ) = H(l; f) + V fc/ (1; f), 

Vfc/1; t) = TrJ w ( 2; f)V s (l,2). (18) 

This is the expected result: The mean-field 
Hartree-Fock dynamics is modified by a coupling 
to other electrons due to the presence of Coulomb 
correlations. 

The equation for the correlations, £( 1,2; t) f fol¬ 
lows from the second hierarchy equation. The ef¬ 
fects of the symmetrization operators are evalu¬ 
ated in detail in the Appendix leading to the exact 
equation for £(1,2; t ): 

d t g(l,2; t) + z'[h(1,2; t)g(l,2; t) 

-g(l,2;t)HHl,2;t)] 

+ 7> 3 z{[ V s (l,3),/ (1) (1; f)] 

Xg(2,3;(M2,3) 

+ [ V s (2, 3), f m (2; f)] 

X|(U;()y(l,3)) 

= — i{V(l,2; f)/ (1) (l; f)/ (1) (2; f) 

-/ <1) (l;0/ (1) (2;0V t (l / 2;0} 

2 

- E Tr 3 i[ V(/, 3), £(1,2,3; f) 

;=i 

X(1 -P 13 -P 23 )], (19) 

with the definitions 

H(l,2; t) — H hf (l; t) + H hf (2} t) + F(l,2;0 

( 20 ) 

V(l,2;0 = {1 -/ (1 >(1;0 -/ (1) (2;OMl,2). 

( 21 ) 

The first two terms on the left side describe pair 
dynamics generated by the Hamiltonian H(l, 2; t ). 
This Hamiltonian differs from H(l, 2) in two im¬ 
portant ways: First, the single-particle energies are 
renormalized to the mean-field Hartree-Fock 
energies. Second, the pair potential is modified 
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by the prefactor {1 ~/ (1) (l;f) -/ (1) (2;0) describ¬ 
ing "blocking" effects of the statistics. To see 
this, note that {1 - / (1) ( 1; t) - / (1) (2; 0} = 
{/z (1) (l; 0^ (1) (2; 0 - / (1) ( 1; 0/ (1) (2; 01 where 
/z (1) (l; 0 = 1 ™/ (1) (l;0 is the "hole" occupancy. 
Thus, the matrix elements of V(l,2;t) are re¬ 
stricted according to the occupation of carrier den¬ 
sity in the nonequilibrium state. Both the blocking 
and mean-field effects in H( 1,2;0 account for 
significant many-particle correlations in this effec¬ 
tive pair dynamics. The third and fourth terms on 
the left side describe polarization or screening ef¬ 
fects that are essential for a proper treatment of the 
long-ranged Coulomb interactions. In the classical 
limit, these are the linearized Vlasov operators 
leading to the Debye-Huckel dynamic screening 
and pair correlations. Here, these operators gener¬ 
ate the random phase or chain approximation in¬ 
cluding exchange effects. The first term on the 
right side of (20) is a source of correlations due to 
the "commutator" of V(l,2;t) with the uncorre¬ 
lated pair operator / (1) ( 1; 0/ {1) (2; t ); equivalently, 
the time derivative of the uncorrelated state is 
correlated due to the Coulomb interactions. Here, 
this source of correlations is due to the pair inter¬ 
action including blocking effects. Finally, all resid¬ 
ual many-body effects are contained in the last 
term on the right side describing three-particle 
correlations not associated solely with three-par¬ 
ticle exchange effects. 

Equations (17) and (20) are still exact and fully 
equivalent to Eqs. (7) and (8). However, the analy¬ 
sis of the exchange effects and the transformation 
to the correlation functions provides a form in 
which the dominant mean field, polarization, and 
blocking effects are made explicit. This is a more 
convenient form for introduction of appropriate 
approximations or application to specific prob¬ 
lems. 


Bloch Representation 

The above analysis has used only the abstract 
operator form of the reduced density operators 
and associated hierarchy equations. Specific appli¬ 
cations require a particular matrix representation. 
For example, a fully ionized plasma might best be 
described in terms of single-particle momentum 
states. To show the relationship of the density 
matrix approach to the standard equations of 
semiconductor physics, we consider a representa¬ 


tion using Bloch states *A ak (r) = (r I ak), h(l) I ak> 
= c a (k) | ak). Only the first hierarchy equation 
will be considered; the analysis for the second 
equation is similar: 

<? f / (1) (l; f) + i[H hf ( 1; f),/ (1) (l; 0] + C( 1; t ) = 0, 

(22) 

where C( 1; 0 is the "collision" operator due to 
pair correlations 

C(l; 0 = Tr 2 i[V s (l,2), g(l,2; £)]. (23) 

The band occupation densities, n a ( k; f), and the 
polarization densities, p aa <( k; t), are defined by 

n a ( k; t) = (ak |/ {1) (f) | ak) 

Paa'O*;= (ak I/ (1) (0 I a'k), (24) 

where the band indices a ^ a' in the definition of 
the polarization density p aot ,(k; t). To simplify the 
discussion, we consider the case of only two bands 
(conduction and valence). Also, we consider only 
homogeneous states, so that the one-particle den¬ 
sity matrix is diagonal in the wave vector, k. The 
equations are then found to be 

<? f n a (k; 0 - 2Im{(ak | H hf ( 1; t) | a'k)p a , a (k; t )} 

+ C aa (k; t) = 0 (25) 

d t p aa '( k; t) + /[(ak | H hf ( 1; t ) | ak) 

-<a'k|H h/ (l;0|a'k>] p aa ,(k; t) 

+ /<ak | H hf (l;t) I a'k> 

X[n a ,(k; t) - n a ( k; 0] 

+ C aa Tk; t ) = 0, (26) 

where it is understood that a ^ a'. The matrix 
elements in these equations are easily evaluated: 

<ak | H hf (l; t) \ vk) 

= e a (k)8 atV + (ak | V hf I ^k> - E (t) • tL av , (27) 

where /r av (k) = (ak | /x \ vk). Equations (25) and 
(26) then become 

d t n a (k) t ) - 2Im[[(ak | V hf \ a'k> - E (t) • /r aa> ] 

Xpc'a(k;f)} + c aa (k;f) = 0 (28) 

d t Paa'(k; 0 + z[(ak | H hf (l; t ) | ak> 

-<a'k|H h/ (l;0la'k>] Paa ,(k;0 

+ /[(ak | V h j | a'k) — E(f) • /A ao -] 

X(w a ,(k; 0 - n a ( k; f)) + C tta ,(k; t ) = 0. 

(29) 
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The matrix elements of the Hartree-Fock potential 
are easily evaluated and the resulting equations 
for n a ( k; f) and p aa -( k; t ) are found to be 

d t n c ( k; t) + 2Im(E(f) • /i cv p vc ( k; f)) 

+ 2Im£ V(i 7 )[n c (k + q;f)I cc (k / -q) 

q 

x/* c (k, -q) + n v ( k + q; f) 

XJ ep (k, -q)I 0 * p (k, -q) 

+ p co (k + q; t)l cc (k, -q)I* v (k, -q) 

+ p vc (k + q; t)I cv (k, -q)I* c (k, -q)] p vc ( k; t) 
+ C cc (k; f) = 0 (30) 

<9 f p C j,(k; f) + z'(e c (k) - e B (k) - E(f) 

£ P'cc P'vv 1) PczXk/ ^ 

_ *E^(<?)[ n c(k + q; 0 

q 

X (|J cc (k, — q)| 2 - |/„ c (k, —q)| 2 ) 

+ n v (k + q; f)(|f cc (k, -q)| 2 - \I vv (k, -q)| 2 ) 
+ 2p cv (k + q; f)Re(I cc (k, -q)f*(k, -q) 
-l vc (k, -q)f* 0 (k, — q))] p cv (k; t) 

-illlV(q)[n c (k + q; t) 

^ q 

X I cc (k, — q) I* c (k, -q) + n v ( k + q; t ) 
Xl cv (k, — q)/* 0 (k, -q) 

+Pc V ( k + q; f ) 

Xl cc (k, - q) 4* 0 (k, -q) + p„ c (k + q; f) 
Xl cv (k, — q) I* c (k, -q)] 

+ E(f) • fi cl }j 

X(n v (k;t) - n c (k; f)) + C cv (kyt) = 0. (31) 

Here, J aa ,(k,q[) arises from the Coulomb matrix 
elements 

< a 2 k x ; a 2 k 2 1y | a' 2 k 2 ; ) 

= ^k , 1 + k' 2/ k 1 + k 2 Zl^^)^k , 1/ k 1 + q 

q 

X fa.o^ki,-q)4 2 a’ 2 (k 2 ,q) (32) 


4a'(k'q) = / ^' q ' r Ck(*)^«'k-q( r ) 

= 7*. 0 (k - q,-q) (33) 

and V(q) is the Fourier-transformed pair potential. 
The valence and conduction occupation numbers 
are related by n c ( k; 0 + n q ( k; t) = constant, and 
the polarization densities are related by p cv = p vc f. 
These are the most general hierarchy equations. 

The matrix elements C aa ,(k; t ) describe both 
inter- and intraband collisions and can be ex¬ 
pressed in terms of the matrix elements of the 
correlation function g( 1,2; 0 using (23) 

C aa '( k; t ) 

= <ak|{7> 2 z[ Vg(l,2), £(1,2; £)]}|a'k> 

= i E E {<ak; v 1 k 1 |V s |v 2 k 2 ; v 3 k 3 > 

k 2 ,k 2 ,k 3 vi, v 2 , v 2> 

X(v 2 k 2 ; v 3 k 3 |g|a'k; Vjkj) 

-<ak; v l k 1 \g\v 2 k 2 - / v 3 k 3 ) 
X<z' 2 k 2 ;^ 3 k 3 |V s |a'k;^ 1 k 1 >}. (34) 

Further reduction follows from substitution of the 
Coulomb matrix element (32). 

In practice, the relevant values of q may be 
sufficiently small to justify the approximation 
I« „(k, q) -» I a! ,(k, 0) = 8 a v . Then, these equations 
simplify to the usual forms 

d t n c (k; t) + 2Im(E(f) • ix cv p vc (k; t)) 

+ 2Im£ V(q)p cv (k + q; t)p vc (k; t) 
q 

+ C cc (k; f) = 0 (35) 

d t p cv ( k; f) + z(e c (k) - e v (k) - E(f) 

[ P'aa Aa'a'])Fcy(k/ f) 

-z‘E k ; ( ( ?)[ w c(k + q; t) - n v (k + q; f)] 

q 

Xp cv (k;t) 

-il £ V(q)Pcv(k + q; f ) + E(/ )•• /x cr | 

l q > 

X(n v ( k; f) - n c ( k; f)) 

+ C ct ,(k; t) = 0, (36) 
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with the collision matrix elements 
C cc ( k; f) 

= — 21m Z V(q) 

q^0 / k 1 

X {<ck - q; ckj + q|^|ck; ck x ) 

+ <ck - q; I'kj + q|g|ck; pkj) 

-(ckj + q;ck - qlglck^kj) 

-<»k x + q;ck - qlglckjnkj)} (37) 

C c „( k; t) 

= -21m Z V(q) 

q^0 / k 1 

X {(ck - q;ck x + q|g|uk;ckj> 

+ (ck - q;pkj + q|g|nk; uk x >} 

— Z 17(^){<ckj + q;ck - qlflukjckj) 

q#0 / k 1 

+ (ukj + q; ck - q|g|uk; ukj) 

— (ck; ckjlglckj + q;nk - q> 

— (ck;t)kj|f|®kj + q;nk - q>}. (38) 

Aside from the small q limitation of the 
Coulomb matrix elements, the above equations are 
still exact. The detailed dependence of C acx ,(k; f) 
on n a ( k; t ) and p aa >( k; t ) requires specification of 
£(1,2; f) as a functional of / (1) (1; t). This follows 
from an approximate solution to Eq. (19). To illus¬ 
trate the procedure, consider the weak coupling 
limit in which £(1, 2; t ) is evaluated to first order 
in the Coulomb interactions. Since £(1,2,3; f) oc¬ 
curs in (19) multiplied by X^(l, 2), it is sufficient to 
evaluate £(1, 2,3; t ) to zeroth order in the poten¬ 
tial. However, since the correlations due to statis¬ 
tics have already been extracted, the three-particle 
correlation function vanishes to this order. Also, 
since the source term on the right side is of first 
order in the potential, then £( 1,2; t ) also is of first 
order. Thus, all explicit dependence on V(l, 2) can 
be neglected on the left side of the equation, lead¬ 
ing to the weak coupling equation 

d t g(l,2; t) 4- t[(H(l) +H(2)),f(l,2;f)] 

= - i{V(l,2; 0/ (1) (l; 0/ (1) (2; f) 

~f m ( 1; f)/ (1) (2; t)V f (l,2; t)}, (39) 

$ 


with the solution 

1(1,2; f) = g"«Ha)+H<2» t | (1/2;0)c< (H(l) + H(2))» 

- z(W (H( D+HPiXf-T) 

■'o 

X {17(1,2; r)/ (1) (l; t)/ (1) (2; t ) 

-/ (1) ( l; r) 

X/ (1) (2;r)V t (l,2;T)}e i(H(1)+H(2)Xf - T >. 

(40) 

Since the Hamiltonian H( 1) is diagonal in the 
Bloch representation, it is straightforward to calcu¬ 
late the matrix elements of £(1,2; f) given by (40) 
and obtain the collision operator as a quartic func¬ 
tion of the occupation numbers [recall F(l,2; t) = 
{1 — / (1) (1; 0 — / (1) (2; t)}V(l, 2)]. The result is a 
generalization of the Born-Boltzmann collision op¬ 
erator, extended to include the effects of initial 
correlations and non-Markovian effects at short 
times. In the long time limit, it becomes exactly the 
Uhlenbeck-Boltzmann collision operator with 
scattering calculated in the Born approximation. 
An improved approximation appropriate for most 
current experimental conditions is described in the 
next section, where strong scattering and polariza¬ 
tion effects are accounted for as well. 


Pair Correlation Approximation 

The exact transformed hierarchy Eq. (19) for 
£(1,2; t) is an appropriate form for the introduc¬ 
tion of approximations since it makes explicit the 
physical mechanisms for renormalized single-par¬ 
ticle states, blocking in the pair dynamics, and 
dynamic screening. The approximations entail 
some statement 1 about the three-particle correla¬ 
tions in £(1,2,3; t). More specifically, the approxi¬ 
mation should give £(1,2,3; t) in terms of the pair 
correlations and one-particle distribution function, 
so that the first two hierarchy equations become 
closed equations for / (1) (1; t) and £(1, 2;t). The 
formal solution to the second equation provides 
the functional in (10) required for a kinetic equa¬ 
tion 

F (2) (l,2; f | /W) 

= {/ (1) ( 1; f)/ (1) (2; t ) + 1(1,2; 1 1 / (1) )}^(1,2). 

(41) 
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In general, there is no small parameter on which to 
base a systematic expansion. However, there are 
exact structural properties of the hierarchy equa¬ 
tion that impose important constraints on the class 
of approximations considered acceptable. For ex¬ 
ample, the single-particle reduced density operator 
is representable as a trace over the two-particle 
reduced density operator, Nf (1) ( 1; t ) = Tr 2 / (2) 
( 1 , 2 ; t ) and, therefore, the exact first hierarchy 
equation for / (1) ( 1 ; t) should result from a partial 
trace over any acceptable approximate closure of 
the second hierarchy equation. Additional con¬ 
straints follow from the invariance of the hierarchy 
equations under symmetry transformations de¬ 
rived from the Hamiltonian (e.g., rotations, trans¬ 
lations, Galilean boosts). An important conse¬ 
quence of these invariances is the local conserva¬ 
tion laws for mass, energy, and momentum. Con¬ 
straints imposed by the requirement of exact 
conservation laws are discussed in [5,8,9]. Other 
constraints can be included as well. For example, if 
the reduced density operators for the equilibrium 
state are known, it can be required that they are 
also solutions to the approximate hierarchy equa¬ 
tions. In this way, the class of approximations can 
be assured to have the most important properties 
of the exact hierarchy in a context that does not 
imply weak coupling or other small parameter 
limits. 

For long-range Coulomb interactions, it is im¬ 
portant to describe screening effects. This is in¬ 
cluded explicitly on the right side of ( 20 ), as are 
the mean field Hartree-Fock-renormalized single- 
particle energies and the Pauli blocking effects for 
the pair potential. Thus, all of the important mech¬ 
anisms appear in this form even before consider¬ 
ing three-particle correlations in g( 1,2,3;f). Fur¬ 
thermore, it is shown in [5] that any choice for 
g(l, 2,3; t) that is Hermitian and pairwise symmet¬ 
ric in the particle labels preserves both the repre- 
sentability of / (1) and the exact conservation laws. 
Consequently, we suggest that the residual three- 
particle correlations described by g(l, 2 ,3; 0 can 
be neglected entirely, except for the conditions of 
very strong coupling. The resulting approximation 
preserves the exact local conservation laws for 
mass, energy, and momentum and is time reversal 
invariant. It contains strong collision effects (ladder 
diagrams) through the dependence on the poten¬ 
tial in H( 1 , 2 ; t) and dynamical polarization effects 
in the random-phase approximation (ring dia¬ 
grams). There are no a priori limitations on the 


time scale in this approximation so that a uniform 
treatment of the initial value problem is possible. 

With this approximation, Eq. (19) becomes a 
closed equation for g( 1 , 2 ; t) in terms of / (1) ( 1 ; t), 
with the form 

( d t - L(l, 2 ; f))g(l, 2 ; t) 

= - i[v(l, 2 ;t)f m (l;t)f m ( 2 ;t) 

—/ (1) ( 1; f )/ 0, ( 2; t)VH 1,2; f)]. (42) 

The formal solution provides the functional in (10) 
required for a kinetic equation 

F <2) ( 1,2; t |/W) = { f m ( 1; f}/ (1) (2; f) 

+g(l,2;t\f m )}^(l r 2), (43) 
f(1,2; t | f m ) = Tr 34 (J(l,2,3,4; t, f 0 )f (3,4; f 0 ) 

— iJr M f dtU(l,2, 3,4; t,i) 

X [ V(3,4; f )/ ( 1 ) (3; f )/ ( 1 > (4; t) 

-/< 1 ) (3;D/ ( 1 ) (4;?)V t (3,4;?)]. 

(44) 

Here, 11(1,2,3,4; t, t 0 ) is the two-particle propaga¬ 
tor associated with the generator L(2,2; t). Its form 
and reduction to familiar quantities such as scat¬ 
tering matrices and dielectric functions will be 
discussed elsewhere [ 10 ] and only the structural 
features noted here. The first term on the right side 
of (44) gives the pair dynamics due to initial corre¬ 
lations determined from the system preparation. 
The second term describes the correlation buildup 
even in the absence of such initial correlations. 
Both contributions can be of equal importance for 
initial value problems at short times. 

The collision operator in the kinetic Eq. (22) 
reflects these two contributions: 

C(l; t) - — z‘Tr 2 [ Vg(l, 2), £(1,2; t)] 

= jT c (1; t) + 1(1; t) (45) 

Hl;0 = -zTr 234 [V s (l,2), 

li(l,2,3,4; t, £ 0 )g(3,4; f 0 )] (46) 

1(1; t) = — (TI 234 f*di\v s (l,2), 17(1,2,3,4; t,t) 

x{V(3,4;t)f m (3;t)f m (4;t) 

-f m ( 3; f)/ ( 1 ) (4; f )V f (3, 4; f)}]. (47) 
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The Hartree-Fock dynamics is modified at short 
times by the conditions of the initial preparation 
through I{(0. The other modification is indepen¬ 
dent of the initial preparation and represents the 
buildup of scattering effects. The term 1(1; t) grows 
from zero to an asymptotic value J( 1;°°), while the 
initial correlations term J((0 is expected to de¬ 
crease to zero for most physical preparations. This 
initial transient period represents the transition 
from time scales short compared to the scattering 
time up to times long enough for completed colli¬ 
sions. On the long time scale, C(l; t) -> 1(1; °°) and 
the kinetic equation takes the usual Markov form. 
In this limit, a Boltzmann-Uhlenbeck form is ob¬ 
tained with scattering determined by the full T- 
matrix [11]. During the transient period, the in¬ 
complete scattering is described by an effective 
potential with dynamical screening ranging from 
Coulomb at short times to fully screened at long 
times. 


Discussion 

The objective of this presentation has been to 
give an overview of an approach to formulating 
practical kinetic theories for charged particle dy¬ 
namics, with the potential for application to cur¬ 
rent and proposed experiments on semiconductors 
using femtosecond lasers to prepare and probe the 
charges. The new features of such experiments 
requiring reconsideration of standard kinetic the¬ 
ory are states far from equilibrium and the inclu¬ 
sion of short time scales. The approach here formu¬ 
lates the description in terms of the exact hierarchy 
for the one- and two-particle reduced density op¬ 
erators. These operators determine most directly 
the relevant observables of interest. There are two 
components to the analysis. First, an exact trans¬ 
formation of the first two hierarchy equations is 
performed to isolate the strong correlations due to 
quantum statistics from correlations due to the 
Coulomb forces. This description in terms of the 
excess correlations also makes explicit the mean- 
field effects associated with Hartree-Fock renor¬ 
malization of the single-particle energies and the 
dominant polarization effects necessary to account 
for screening of the long-range Coulomb forces. As 
a single-time Markovian first-order set of differen¬ 
tial equations, the description is formulated as an 
initial value problem, appropriate for the experi¬ 
mental conditions of interest. The second compo¬ 


nent of the analysis is an approximate closure of 
the second hierarchy equation, with the three-par¬ 
ticle correlations expressed as explicit functionals 
of the two-particle correlations and the one-par¬ 
ticle reduced density operator. The resulting ap¬ 
proximation transforms the first two hierarchy 
equations into a closed set of coupled equations to 
determine the one-particle-reduced density opera¬ 
tor and the two-particle correlations. Finally, a last 
step is possible where the approximate second 
hierarchy equation is solved formally to give the 
two-particle correlations in terms of the one-par¬ 
ticle density operator. Use of this result in the 
exact first hierarchy equation gives a closed kinetic 
equation for the one-particle density operator. 
While the coupled pair of equations for the corre¬ 
lations and one-particle distribution is local in time 
(Markovian), the reduced kinetic theory descrip¬ 
tion in terms of the one-particle distribution alone 
is necessarily nonlocal in time. This latter feature is 
essential for a proper treatment of relaxation and 
transport at short times. 

The transformation of the hierarchy and result¬ 
ing explicit representation of the most important 
physical mechanisms simplifies the choice of ap¬ 
proximations considerably. Additional constraints 
are associated with the exact relationship on one- 
and two-particle reduced density operators, the 
exact conservation laws, and the existence of sta¬ 
tionary states (equilibrium, in the absence of driv¬ 
ing fields). It is proposed in the fifth section that 
these constraints are satisfied by an approximation 
obtained by neglecting £(1,2,3; 0. The resulting 
approximate equations retain all of the desired 
physical mechanisms and time scales, excluding 
only conditions of strong Coulomb coupling. The 
detailed solution to the equation for the pair corre¬ 
lations is essentially an effective two-body prob¬ 
lem and will be discussed elsewhere. 

The analysis of the hierarchy and considerations 
leading to this approximation occurs at the ab¬ 
stract operator level and does not require explicit 
attention to the detailed structure of the physical 
system considered (e.g., the number of bands) and 
additional processes are easily accounted for at 
this abstract level. Having completed the many- 
body analysis, the physical properties of interest 
follow directly from an appropriate matrix repre¬ 
sentation of the operator equations, so that manip¬ 
ulation of complex notation is deferred to this late 
stage. Of course, the final measure of any ap¬ 
proach is the ability to describe interesting new 
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phenomena in quantitative applications. We hope 
to report on such results in the near future. 
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Appendix: Transformation of the 
Hierarchy Equations 

Define a superoperator, ^ 12 , that exchanges the 
labels of particles 1 and 2 in all operators to its 
right: 

X(l, 2 ) = X(2,l). (48) 

This operator should not be confused with the 
permutation operator defined over quantum states, 
P 12 , that permutes the quantum labels of the states. 
The second equation of the BBGKY hierarchy then 
can be written in the compact form: 

«? ( /< 2 ) (l,2;f) + i[H( l,2;f),/ ( 2 ) (l,2;f)] 

+ (1 + ^> n )Tr 3 i[V (1,3) f 3) (1,2,3-,t) - he] 

= 0, (49) 

where he indicates the Hermitian conjugate of the 
preceding term in the brackets. The objective of 
this Appendix is to show how the exchange effects 
in the definitions of / ( 2 ) ( 1 , 2 ; f) and / ( 3 ) ( 1 , 2 ,3; 0 
can be evaluated exactly to give the Hartree-Fock 
energies, the blocking factors in the pair potential, 
and the linear operators responsible for polariza¬ 
tion effects. Substitution of (14) into (49) and use of 
the first hierarchy equation gives 

y’H, 2 ){o , ,g ( 2 ) (l / 2 ; f) + z'[H(l, 2 ; f), g ( 2 ) (l, 2 ; f)] 
+ z[y(l, 2 ;f),/ ( 1 >(l;f)/ ( 1 ) ( 2 ;£)]} 

-S*( 1,2)(1 + & 12 )TV 3 z[ V(l, 3) / (2) ( 1,3; t ) 

X/ ( 1 ) (2; 0^(1,3) - he] 

+ (1 + ^ 12 )Tr 3 z 

X[V(1,3)/ ( 3 ) (1,2,3;0 

X^(l,2,3) - he] = 0. (50) 

First, note that the last term can be written with an 
overall prefactor of 2 ) as well using the iden¬ 


tity 27tl, 2,3) = (1 — P 13 — P 23 )c5^(1, 2 ) and the fact 
that cS^Q, 2 ) commutes with any symmetric two- 
particle operator. Consequently, the overall opera¬ 
tor , 5 * 0 , 2 ) in (51) can be factored out, leaving the 
simpler result: 

d t g (2) (l,2;t) + z[H(l, 2 ;£)g< 2 >(l, 2 ;f)] 

+ z[y(l, 2 ;£),/ ( 1 ) (l;f)/ ( 1 ) ( 2 ;f)] 

-(1 +^ 12 )Tr 3 zmi,3)/ ( 2 ) (l,3;f) 
Xf>(2;t)y(l,3) - he] 

+ (1 +^ l2 )Tr 3 i 

X [y(l,3)/ ( 3 ) (l,2,3; £)(1 - P 13 - P 23 ) - he] 
= 0. (51) 

Next, represent / ( 3 ) (1,2,3; t ) in the last term using 
(15) in the equivalent form 

/< 3 ) (l,2,3;f) 

= / ( 2 ) ( 1 ,3; t)f m ( 2 ; t ) + / (1) ( 1 ; t)g(2, 3; t) 

+ / ( 1 ) (3; t)g(l,2; f) + g(l,2,3; f) 

to obtain 

V(l,3)/ ( 3 ) (1,2,3; t)(l - P 13 - P 23 ) 

= V(l,3)/ ( 2 ) (1,3; 0/ ( 1 ) (2; £)(^(1,3) - P 23 ) 
+ V(l,3){g(2,3;£)/ ( 1 ) (l;i) 

+ g(l,2;f)/ ( 1 ) (3) 

+£(1,2,3; f)}(1 - P 13 - P 23 ). (52) 

The contribution proportional to ^(1,3) cancels 
the terms on the second line of (51) and the equa¬ 
tion for g t2> (l, 2 ; t) becomes 

<? f £ ( 2 ) (l, 2 ; t) + z'[H(l, 2 ; t), £ ( 2 ) ( 1 , 2 ; £)] 

+ i[V(l,2;0 / / (1) (l;f)/ (1) (2;0] 

-(1 +^ 12 )Tr 3 z[y(l,3)/ ( 2 > (l,3;t) 

X/ ( 1 ) (2; f)P 23 - ftc] 

+ (1 + ^ 12 )Tr 3 z[V (1,3) 

X {£(2,3; f)/ ( 1 ) (l; t) + f (1,2; f)/ ( 1 ) (3)} 

X (1 — P 13 ~ P 23 ) — 

+ (1 + ^ 12 )Tr 3 z[V(l,3)£ ( 3 ) (l,2,3; f) 

X (1 - P 13 - P 23 ) - he] = 0. (53) 
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The fourth term on the left side can be simplified 
using the identities 

^ 23^23 = 1/ ^ r 3^23 = 1/ / v 

(54) 

P 23 X(l / 2 / 3)P 23 = X(l,3 / 2) 

to write 

Tr 3 V(1,3)/ ( 2 ) (1,3; f)/ ( 1 ) (2; f)P 23 

= Tr 3 P 23 P 23 V(l,3)/ ( 2 >(l,3; f)/ ( 1 ) (2; t)P 23 
= Tr 2 P 23 V(l,2)f (2 Hl,2; t)/ ( 1 >(3; t) 

= Tr 3 f m (3; OP 23 V(l / 2)/P ) (l,2; f) 

= Tr 3 P 23 P 23 fV(3 ; OP 23 y(l,2)/< 2 >(l,2; 0 
= Tr 3 P 23 f m (2; t)V( 1,2)/ (2) (l,2; t ) 

= / ( 1 ) (2; f)V(l,2)/ ( 2 ) (l,2; t). (55) 

The second equation of the BBGKY hierarchy now 
becomes 

*,£(1,2; f) + i[(H(l;t) + H(2; 0), g <2) (l,2; 0] 

+ i[v(l,2; 01(1,2; f)-fcc] 

+ (1 +^ 12 )Tr 3 i[ V(l, 3 ){g( 2 ,3; f)/ ( 1 ) (1; t ) 
+1(1,2; f)/ ( 1 ) (3)}(l - P 13 - P 23 ) - fee] 

+ i[y(l,2; 0/ (1) (l; t)/ (1) (2; f) 
-/W(l;f)/ (1) (2;f)F t (l,2;t)] 

+ (1 +^ 12 )Ir 3 i[ 1 /( 1 ,3)f' t 3 >(l, 2 ,3; t ) 

X (1 - P 13 - P 23 ) - fee] = 0, (56) 

with the definition 

V(l,2; t) = [1 -/ (1 >(l;f) -/ 0) (2;O]V(l,2). 

(57) 

This shows clearly the origin of the Pauli blocking 
factor in the pair potential. 

The remaining exchange terms in the second 
line of (56) can be analyzed in a similar way: 

Tr 3 V (l,3)[g(2,3;0/ a) (l;0 

+g(l,2;f)/< 1 >(3)](l -P 13 -P 23 ) 

= Tr 3 V(l,3)[g(l,2; t)f m (3) 
-f(2,3;f)/ ( 1 >(l;t)P 13 ] 
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+ Tr 3 V(l,3)[g(2,3; f)/ ( 1 ) (l; f)(l - P 23 ) 
-g(l,2;t)/ ( 1 ) (3)(P 13 + P 23 )] 

= Tr 3 V(l,3)[f(1,2; 0/ ( 1 ) (3) 
-P 13 P 13 g(2,3;f)/( 1 >(l;f)P 13 ] 

+ Tr 3 V(l, 3)[g(2,3; f)/ { 1 ) (l; 0(1 - P 23 ) 
-P 13 P 13 g(l,2;f)/W(3)P 13 (l + P 13 P 23 )] 

= Tr 3 V s (l,3)/ ( 1 ) (3)g(l,2; f) 

+ Tr 3 V(l,3)£(2,3;0/ <1) (l;0(l - P 23 ) 
-P J3 g(2,3;0/ (1) (l)(l + P 13 P 23 ). (58) 

The first term on the right side gives the 
Hartree-Fock single-particle energy shift: 

V h/ (l;t) = Tr 3 V s (l,3)f™(3), 

^(1,3) s V(1,3)'F(1,3). 

The second term can be transformed by writing 
the factor (1 + P 13 P 23 ) = (1 ~ P 23 ) + (1 + P 13 )P 23 
= (1 — P 23 ) + P 23 (l + P 12 ). Recall that an overall 
factor of 2 ) has been canceled out in writing 
(56). Since (1 + P 12 )c5 !! tl / 2) = 0, the term P 23 (l + 
P 12 ) vanishes in this context and Eq. (58) simplifies 
to 

Tr 3 V(l / 3)[f(2 / 3;0/ (1) (l;0 

+ g(l,2; 0/ ( 1 ) (3)](l — P 13 — P 23 ) 

= V A/ (l,2;0f(l,2;0 

+ Tr 3 V s (l,3)g(2, 3; 0/ (1) (l; 0(1 - P 23 ). 

(59) 

The second term gives the polarization effects with 
exchange. The exact equation for g( 1,2; t) now 
takes the final form of (19). 
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ABSTRACT: Using a molecular cluster with 71 atoms, we studied the electronic and 
structural properties of nitrogen vacancy, boron antisite, and oxygen in cubic boron 
nitride through an ab initio Hartree-Fock calculation. We found that all defects introduce 
a deep state above the middle-energy gap. These defects present a C 3v local symmetry. In 
the case of nitrogen vacancy, the possibility of the F-center formation is discussed. All the 
calculations were performed using the program code GAMESS. © 1997 John Wiley & Sons, 
Inc. Int J Quant Chem 65: 941-946, 1997 


Introduction 

B oron nitrides exist in both high-density 
(tetrahedral diamondlike) and low-density 
(graphitelike) forms or modifications. Of the high- 
density forms, cubic boron (c-BN) is unique among 
the group III nitrides in that its zinc-blende struc¬ 
ture is the most stable and best characterized form. 
Then, c-BN, the lightest of III-V semiconductors, 
is a large band-gap material (-6.4 eV) [1] which 
has several similarities compared with diamond, 
such as crystal structure, wide energy band gap, 
extreme hardness, high thermal conductivity, high 
melting temperature, and low reactivity [2]. But 
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while diamond can be readily doped only with the 
p-type, c-BN can be doped p- and rc-types when 
suitable impurities are added [3]. Although it could 
be an important material for devices of high tem¬ 
perature and high pressure, with many applica¬ 
tions in industries in the future, the quality of the 
samples remains as a major problem. If we com¬ 
pare it with other III-V materials, c-BN has a very 
high concentration of native defects. 

In this article, we report a theoretical calculation 
of the behavior of native defects in c-BN. The 
electronic and structural properties of N-vacancy 
(Vac [N p, boron antisite (B [N] ), and oxygen substi¬ 
tutional (0 [N] ) are calculated. Although several 
calculations are available for pure materials [4], 
there are only a few works concerning the theoreti¬ 
cal study of the defects in c-BN [5,6]. Particularly, 
an interesting work was performed by Gubanov et 
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al. [6] using LAPW-ASA-TB for vacancies. How¬ 
ever, in that calculation, the authors did not in¬ 
clude relaxation or asymmetric distortions, basi¬ 
cally because they used the muffin-tin approach 
for the potential. These effects, as we will show, 
are important to interpret the experimental results. 
As far as we know, this is the first work to con¬ 
sider distortions induced by the defects in the 
calculation. 

Theoretical Procedure 

We simulated the pure crystal through the 
molecular cluster with 71 atoms 

(nb 4 n 12 b 12 n 6 h 36 ) 3+ 

initially in Td symmetry, as seen in Figure 1. The 
hydrogen atoms are introduced to saturate the 
dangling bonds at the cluster surface, as usual 
[7,8]. The net charge of 3 4- at the cluster is neces¬ 
sary to get a neutral charge condition as in a 
crystal. Having defined the molecular cluster, the 
total energies are calculated using the all-electron 
Hartree-Fock method in the usual linear combina¬ 
tion of atomic orbitals procedure. The atomic basis 



FIGURE 1 . Molecular cluster with 71 atoms 
(NB 4 N 12 B 12 N 6 H 36 ) 3+ , initially in Td symmetry, with the 
hydrogen atoms introduced to saturate the dangling 
bonds at the surface. 


adopted here is the minimal split-valence 3-21G 
basis set [9], which consists of 6s3p (3 s) primitive 
Gaussian functions contracted in 3s 2 p (2 s) basis 
functions on each B and N (H) atoms. The size of 
the set thus ranges from the smallest 242 (for the 
Vac [N] ) to 247 (for 0 [N] and B [N] ) contracted 
Gaussian-type functions. The calculations were 
performed using the program code GAMESS [10]. 
Before calculating the distortions on the geometric 
structure, when defects substitute the N central 
atom in the cluster, we first optimized the defect 
nearest-neighbor bond distances with respect to 
the vibrational mode. This is necessary to avoid 
spurious forces in the defect cluster . As we know 
from the literature [8], this cluster size is appropri¬ 
ate to study single-impurity distortions. 

For the pure c-BN, the calculated eigenvalue 
separation between the highest occupied molecu¬ 
lar orbital (f 2 -HOMO) and the lowest unoccupied 
molecular orbital (^-LUMO) obtained is 8.07 eV. 
A better result for the band gap is obtained using 
the Mulliken correction [11]: A E(HOMO — 
LUMO) = &IUMO ^ homo / "F 2 K, where / and 
K are the Coulomb and exchange integrals, respec¬ 
tively. Using this correction, the result for the 
energy gap drops to 6.53 eV, in excellent agree¬ 
ment with the experimental results [1]. 


Results and Discussion 

We show schematically in Figure 2(a), (b), and 
(c) the electronic energy levels in the gap region 
for the B [N] , Vac [N] , and 0 [N] , respectively, using 
a defect molecule model. On the left side is always 
the pure system with an a\t\ configuration. Our 
procedure to estimate the energy levels in the gap 
was to compare the pure cluster and the defect 
clusters. The electronic energy levels showed in 
Figure 2 were obtained as differences between the 
calculated Hartree-Fock eigenvalues. Fortunately, 
in all defects studied here, the density of states at 
the top of the valence band practically does not 
change. In this sense, the top of the valence, used 
as a reference for the calculated electronic levels, is 
fairly well defined. 

As one can see in the middle of Figure 2(a), the 
electronic consequence of B [N] , before structural 
minimization, is to introduce a f 2 -level occupied 
with four electrons in the gap region. This configu¬ 
ration presents a degenerated ground state and, 
consequently, is favorable for a Jahn-Teller distor- 
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FIGURE 2. Schematic representation of the energy levels in the gap region of c-BN for (a) boron antisite, (b) nitrogen 
vacancy, and (c) oxygen substitutional. 


tion. The optimized geometric structure has a C 3v 
symmetry, where the t 2 orbital is split in a twofold 
degenerate fully occupied e orbital and an a x 
empty orbital, as shown on the right side of Figure 
2(a). The fully occupied e-level, with a ligand 
character ( 77 -type), is located 4.81 eV above the top 
of valence band, resulting in a deep donor level. In 
Figure 3(a) and (b), we show the total charge 
densities in the (110) plane for the c-BN and c- 
BN:B [N j systems, respectively. From Figure 3(b), 
we see that the B antisite atom makes bonds with 
the B atoms in the cr v plane of the C 3v symmetry. 


Concerning the structural properties for the B [N] 
defect, our results show that the antisite B atom is 
found in an off-center position. We first performed 
a breathing mode total energy minimization which 
leads to a small inward relaxation of 5% in the 
bond length. Starting from this relaxed configura¬ 
tion, we calculated the total energy changes of the 
B [JV] atom along the (111) and (100) directions, as 
shown in Figure 4. For the distortion in the (111) 
direction, toward an intersticial position in a C 3v 
configuration, we find a stable position with an 
energy gain of 1.34 eV and a displacement of the B 



FIGURE 3. Contour plot of the total curves for displacements of B antisite in (110) plane for (a) pure c-BN and (b) B 
antisite. 
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atom of 0.28 A. In addition, the distortion in the 
(110) direction gives a metastable configuration 
with C 2v symmetry, where the energy gain is 
0.65 eV and the displacement of the antisite B 
atom is 0.16 A. 

With respect to the Vac [N] , the calculated 
breathing mode relaxation of the first neighbor 
boron atoms gives an outward displacement of 
0.31 A for these atoms. A subsequent C 3v mode 
distortion was observed with one boron atom, B v 
following a (111) direction and the three others, 
B 2 , B 3 , and B 4 , moving closer to form a triangle, as 
seen in Figure 5. 

As shown in the middle of Figure 2(b), the 
removal of an N atom, Vac [N] , results in a strong 
perturbation in the electronic configuration of the 
system with two orbitals, an a x fully occupied in 
the gap region and a t 2 singly occupied resonant 
in the conduction band, in agreement with 
Gubanov et al. [6]. After the total energy mini¬ 
mization, a C 3v configuration is obtained. The t 2 
orbital splits in an a x plus an e orbital, as shown 
on the right side of Figure 2(b). In the gap region, 
there are two levels with a x character (1 a\ 2a\) 
located at 1.76 and 4.90 eV above the top of 
valence band, respectively. 

In Figure 6, the wave functions for the two 
highest occupied orbitals (la 1 2a 1 ) of the neutral 


B2 



FIGURE 5. Vac [W] the C 3v mode distortion, with one 
boron (B1) following a (111) direction and the three 
others (B2, B3, and B4) moving closer to form a triangle. 

vacancy are presented. In the figure, the (a) and (c) 
planes contain the Bj-Vac^ centers, while the (b) 
and (d) planes the B 2 -Vac-B 4 ones. The fully occu¬ 
pied 1 a-y defect orbital in the gap region has a 
7 r-character with a charge density shared among 
the three equivalent (B 2 , B 3 , and B 4 ) boron atoms 
near the vacancy. This trapped charge resembles 
the F centers that arise in anion vacancies in 
alkali-halide crystals [12]. Particularly, the forma¬ 
tion of F centers in hexagonal BN (h-BN), due to 
nitrogen vacancy, was proposed by Zunger and 
Katzir using an extended Huckel method [13]. The 



FIGURE 4. Calculated total energy curves for displacement of B antisite in (100) ( C 2v ) and (111) (C 3v ) directions 
(see text). 
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FIGURE 6. Contour plot of the (a,b) 1 a x and (c,d) 2a 1 
orbitals. The (a) and (c) planes contain the B r Vac-B 3 
centers, and the (b) and (d) planes, the B 2 -Vac-B 4 
centers (see text). 


singly occupied 2 a x orbital above the middle gap 
has an electronic distribution strongly localized on 
the B x boron with a 77-character. Although there 
are some similarities between the highest occupied 
orbitals in c-BN and h-BN, like the 77 -character and 
charge-confinement, a very important difference in 
the energetic ordering of the electronic levels is 
remarkable. Differently from h-BN, the c-BN could 
present an F center only for the doubly ionized 
nitrogen vacancy (Vac + + ). The essential role played 
by carbon doping for the formation of F centers in 
h-BN [14] could also be seen in the case of c-BN, at 
least for high concentrations of these impurities 
when a recombination occurs between the electron 
localized at the 2 a x orbital and the hole introduced 
by the carbon atoms. It is in agreement with exper¬ 
imental EPR results by Fanciulli and Moustakas 
[15]. 

The electronic energy levels for the relaxed 
(right) and unrelaxed (left) configurations at T d 
symmetry for the 0 [N] system are shown schemat¬ 
ically in Figure 2(c). We obtained a singly occu¬ 
pied a x level with antibonding character that comes 
from the conduction band 4.70 eV above the top of 
the valence band. Also, a hyper-deep a x state be¬ 
low the top of valence band and a resonant p-type 
state in the valence band are obtained. By this 
analysis, we see that the substitutional oxygen 
impurity introduces an a x antibonding state in the 
gap region. 

We obtained the energy minimum when the 
first neighbors relax — 0.16 A outward (10% in 
the bond length). But if we look carefully at the 


total energy surface, we observe that the system 
distorts from the T d symmetry toward a C 3v con¬ 
figuration, with the oxygen and the boron atoms 
moving oppositely in the (111) direction. The oxy- 

o 

gen and boron displacements are 0.03 and 0.18 A, 
respectively. 


Final Remarks 

In conclusion, we have discussed the role played 
by the native defects in c-BN using an ab initio 
Hartree-Fock method within the cluster approach. 
For the neutral isolated boron antisite, a 
Jahn-Teller distortion in the (111) direction is 
observed. For the neutral N vacancy, a strong 
relaxation is found: highly localized a x orbital in 
the vacancy region (F-center precursor) and a t 2 
orbital resonant in the conduction band appear 
before the lattice relaxation. The relaxation (+ dis- 
toration) splits the t 2 orbital, leading to a half- 
occupied a x level, close to the conduction band. 
Our calculations enable us to suggest that the 
strong EPR signal seen in c-BN [15] could be 
provided by the trapping of two electrons by 
acceptor levels introduced by carbon doping. For 
the neutral 0 [N] , we found that an a x orbital, 
half-occupied, is introduced above the middle of 
the valence band after lattice relaxation. 
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ABSTRACT: Magnetism, conductivity, and superconductivity of molecular materials 
are discussed on the basis of extended Hubbard-Peierls Hamiltonians, which involve 
four parameters; transfer integral (T), electron-lattice interaction (W), and on-site and 
intersite Coulomb interactions (U, V ). The four-component spinor is introduced to define 
order parameters which characterize four different electronic states (charge density wave, 
spin density wave, singlet and triplet superconductivities) in a unified fashion. Several 
intermediate regions in these parameters are interesting and important from current 
theoretical reasons. Many experimental results clearly show the importance of 
intermediate correlation regimes for active controls of these states by chemical 
modifications and by external fields such as high pressure. Several model Hamiltonians 
such as CT and t-J models are derived for elucidation of potential electronic properties 
of molecule-based materials. Recent computational results based on the t-J model are 
utilized to rationalize our / model for spin-mediated superconductivity. Possibilities of 
magnetic conductors, charge- and/or spin-mediated superconductors, and photoinduced 
superconductors instead of the charge-mediated Little model are examined in the 
intermediate region of several metal-insulator transitions. Possible candidates are also 
proposed on the basis of various circumstantial experimental results. Implications of the 
calculated results are finally discussed in relation to the recent development of the 
intersection area among conducting, magnetic, and optical molecular materials. © 1997 
John Wiley & Sons, Inc. Int J Quant Chem 65: 947-964, 1997 
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Introduction 

I n the past decade, magnetism and electron 
correlation have attracted much interest in re¬ 
lation to high-T c superconductivity in copper ox¬ 
ides [1]. Many experimental and theoretical efforts 
have been made for the elucidation of possible 
interplay between p- and d-electrons of transition- 
metal oxides in the intermediate or strong elec¬ 
tron-correlation regime [2]. In previous articles 
[3-5], we examined model Hamiltonians for cop¬ 
per oxides to elucidate two key points: (1) How 
many active orbitals are necessary for modelings 
of electronic structures of copper oxides? and (2) 
What kind of model Hamiltonians are derived on 
the basis of the strength of electron correlations? 
Concerning the first question, one- to six-band 
models (N = 1-6) were found to be plausible for 
copper oxides depending on the purposes and 
problems under investigation. For the latter ques¬ 
tion, there were at least four different classes: (1) 
the t-J model, (2) the spin-fluctuation model, (3) 
the spin- and charge-fluctuation model, which is 
referred to as an electron correlation (EC) model 
[5], and (4) the charge-fluctuation or CT model. 
Many charge- and/or spin-mediated models are 
feasible. For example, models (1) and (2) provide a 
simple equation to estimate the transition tempera¬ 
ture (T c ) for spin-mediated superconductivity 
[3-5]: 

T c = ~~CJ ah , (1) 

where C and } ab are, respectively, a phenomeno¬ 
logical constant (C = 0.1) and the effective ex¬ 
change integral for the superexchange-coupled 
transition-metal ions: M(II)0 2 ”M(II) (1) (M = Cu, 
Ni, etc). Our ab initio computations [4] showed 
that the J ab value for the Cu(II)0 2 Cu(II) unit (la) 
is over 1000 cm” 1 , and, therefore, T c is higher than 
100 K in copper oxide superconductors [1, 2]. 
However, they indicated that copper oxides belong 
to the intermediate regime (see circle 4 in Fig. 5), 
and, therefore, charge and phonon degrees of free¬ 
dom also contribute to the high-T c superconductiv¬ 
ity, although only the spin degrees of freedom is 
explicitly considered in Eq. (1). 

A BRIEF REVIEW OF OUR PREVIOUS WORK 

During the past decade, we have been inter¬ 
ested in the design and investigation of possible 


organic, organometallic, and inorganic analogs to 
copper oxides on the basis of an assumption that 
these analogs should have low CT excitation ener¬ 
gies and large \J ab \ values before doping, although 
many physicists have focused attention on the 
investigation of possible microscopic mechanisms 
of the high-T c superconductivity [2]. Previously 
[3-5], the possible mechanisms have been exam¬ 
ined from the viewpoint of charge and spin de¬ 
grees of freedom, and several model Hamiltonians 
such as the Hubbard model [6] have been dis¬ 
cussed in relation to magnetism, conductivity, and 
superconductivity. Since undoped copper oxide 
was known to be an antiferromagnetic insulator, 
molecular magnetic compounds thus attracted our 
special interest [3-5, 7]. Therefore, we may have 
the first hypothesis for molecular design 1: 

(HI) The first step to molecular design is to find 
possible isoelectronic units to the Cu(II)0 2 ~ Cu(Il) (la) 
system. 

From the above hypothesis, we have first been 
interested in 1,3-dipolar compounds [8] RYR (R = 
CH 2 , CH, etc; Y = O, S, etc. (2) because of their 
isoelectronic structures with 1 as illustrated in 
Figure 1. Figure 2 illustrates the orbital energy 
diagrams for 1 and 2 to clarify their interrelation¬ 
ship. Isoelectronic d~ir systems were also de¬ 
signed by replacing the inorganic anion X in metal 
oxides with an organic 77 -ligand [9], as illustrated 
in Figure 2, and isoelectronic 77-radical (R) sys¬ 
tems were theoretically synthesized by substitut¬ 
ing divalent magnetic ions with radical groups (R) 
under the assumption that the p-d, ir-d, and 77- R 
energy gaps should be small so as to reproduce the 
large antiferromagnetic (negative) exchange inte¬ 
gral of la. To confirm the assumption, the com¬ 
plete active space (CAS) three-orbital four-electron 
{4,3} configuration interaction (Cl) calculations [10] 
were carried out as illustrated in Figure 3. Since 
the direct d-d ( R-R ) interaction is small because 
of no metal-metal orbital overlap, the p~d, ir-d, 
and 7T-R orbital interactions play crucial roles in 
the configuration mixing. The CAS Cl calculations, 
i.e., exact diagonalization of the extended Hubbard 
model [6], revealed a characteristic feature for 
the Cu(II)0 2 Cu(II) unit (la): low ligand-metal 
charge-transfer (LMCT) excitation energy (AE CT ) 
and strong mixing between the metal biradical 
(MBR) and ligand-metal charge-transfer (LMCT) 
configurations because of the large p-d orbital 
overlap integral and small p-d charge-transfer en¬ 
ergy (A pd ) as illustrated in Figure 4. The strong 
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M,-X-M 2 

1 

a: Mi = M2 = Cu(TI) 
b: Mi = M2 = Ni(II) 
X= 0 2 \ S 2 ‘, F 1 ' 
etc. 


R-Y-R 

2 


a: R=CH2, Y = NH, O, S 


b: R=CH, Y = NH, O, S 
etc. 




4 

a: Mi = M2=Cu(II) 


FIGURE 1. The component units for construction 
correlations. 



I 5 : 

■ 1 

1 1 

• 1 

’ a: Mi = Fe(III), M2 = Cu(II)' 
b: Mi =Fe(III), M2=Cr(III) 

chain, ladder, and sheet with intermediate or strong electron 


mixing between MBR and LMCT also resulted in 
the cases of several organometallic systems, e.g., 
the binuclear iron-sulfur system Fe(II)S 2 ~Fe(III) 
(3) [11] and the transition-metal tetrathiolate sys¬ 
tem (4) in Figure 1. In fact, the |} ab \ value has been 
estimated from experiment to be over 800 cm -1 
for the binuclear copper-tetrathiolate complex (4a) 
[12, 13], and this value is comparable to the 
observed } ab value (-1000 cm -1 ) for copper 
oxide. Therefore, we have tried to synthesize one¬ 
dimensional (ID) and 2D complexes (5) using the 
tetrathiolate ligand (4) as shown in Figure 1. The 
intrachain ] ah values observed by the magnetic 
measurements were indeed very large; for exam¬ 
ple, they were 100 and -63 cm -1 , respectively, for 
5a {Ml - Fe(III) and M2 = Cu(II)} and 5b {Ml = 
Fe(III) and M2 = Cr(III)}. However, the intercol¬ 
umn interaction for 4 could not be controlled so as 


to obtain proper crystals since appropriate ligands 
(L 2 ) have not been discovered yet [12, 14]. 

The CAS Cl calculations have shown that elec¬ 
tronic properties of excited states of the p-rf, ir-d, 
and 7 t-R systems in Figure 4 are sensitive to the 
interaction parameters [10] and that CT excited 
states such as LMCT are generated by low-energy 
photoexcitation or even by thermal excitation in 
special cases. This, in turn, indicates that photoex¬ 
citations may induce dramatic changes of elec¬ 
tronic, magnetic, and optical properties of these 
ip-d, 1 r-d, and tt-R interaction systems under con¬ 
sideration. In the past decades, we have thor¬ 
oughly investigated dimer and clusters of typical 
model systems to elucidate the nature of these 
chemical bonds. The calculated results showed that 
active controls of charge, spin, and photon proper¬ 
ties in the systems are feasible because of strong 
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FIGURE 2. Schematic illustrations of the orbital energy 
levels of isoelectronic p-d , rr-d, p-R , and tt-R systems, 
which are schematically explained by A, B, C, and D, 
respectively. 

coupling of three degrees of freedom and nonlin¬ 
ear responses to external fields as has been empha¬ 
sized in previous articles [15, 16]. 

PRESENT PURPOSE AND INTEREST 

As a continuation of this previous work, we 
summarize here and extend our basic ideas for 
molecular design and fully examine theoretical 
models for conducting, magnetic, and optical phe¬ 
nomena in molecular materials. Our previous / 
model [4], which was presented before the pro¬ 
posal of the so-called t-J model, is extended on 
the basis of recent numerical results based on the 


latter model. Many experimental results are also 
examined systematically as circumstantial evi¬ 
dence from our theoretical viewpoints presented 
previously [3, 5]. Several theoretical models or 
hypotheses will be derived from these investi¬ 
gations for active controls of charge and/or 
spin-mediated and photoinduced properties of 
molecular materials. Probably, such a semiempiri- 
cal approach would be permitted as a chemist's 
approach at the current stage of both theoretical 
and experimental studies on molecule-based mate¬ 
rials in the intermediate and strong correlation 
regime. This article contributes to the recent devel¬ 
opment of the intersection area among conducting, 
magnetic, and optical molecular materials. 

Theoretical Backgrounds 

MODEL HAMILTONIANS FOR EXTENDED 
SYSTEMS 

Several microscopic theoretical models [5, 17] 
are first considered in order to elucidate the nature 
of itinerant ferromagnetism via electron correla¬ 
tion, spin-mediated high-T c superconductivity, 
fractional quantum Hall effect, etc., which are our 
main concerns in the field of the solid-state quan¬ 
tum chemistry [3, 18]. From the viewpoint of the 
nature of chemical bonds, conductive, magnetic, 
optical, and other properties of molecular solids 
under investigation are governed by three impor¬ 
tant microscopic variables: (1) the transfer integral 






(D) (E) (F) 


FIGURE 3. Schematic illustrations of six configurations constructed of three active orbitals and four active electrons 
{4, 3}. Notations are given in the text. 
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MMCT(R) 

MMCT(L) 

DCT 


MMCT~ 
MMCT + 


DCT 


MLCT(R) 

MLCT(L) 


MDR 


Apd 


- MLCT" 

MLCT + 

AEct 

- G 


before Cl after Cf 

(A) (B) 

FIGURE 4. The energy levels of the six states 
(A) before and (B) after full Cl calculation of the 
Hubbard model. 


(T) between d (or radical) and p (or 77) orbitals, 

(2) electron-phonon (lattice) interaction (W), and 

(3) electron-electron interaction [10], although 
other factors such as the chemical potential and 
crystal structure and disorder are also important 
for molecular materials. T describes the itinerant 
character of electron or hole, namely, the charge or 
electron transfer in molecule-based solids. The last 
term is usually divided into on-site (li) and inter¬ 
site (V) repulsion integrals. Therefore, the proper¬ 
ties (P) of 7r-radical ( R ) and 77-rf interaction sys¬ 
tems are, in principle, described by these variables: 
P = functions (T, W, If, V) as illustrated in Figure 
5, and the extended Hubbard-Peierls Hamiltonian 
involving these parameters (T,W,U,V) should 
have a general applicability to theoretical treat¬ 
ments of possible new molecule-based materials 
examined later. This Hamiltonian is often reduced 
to several simple model Hamiltonians such as the 
t-J model [5], for which effective variables are 
derived from the four parameters in Figure 5. 

There have been many theoretical studies in the 
extreme cases illustrated in Figure 5: (i) large T- 
limit (delocalized limit), large Lf-limit (localized 
limit), and large W-limit (Jahn-Teller limit). The 
behaviors of molecule-based materials in these 
limits are rather simple, and the nature of them is 
understood in many cases. On the other hand, 
several intermediate cases shown in circles 1-4 in 
Figure 5 are not well known yet because of diffi¬ 
culty of theoretical treatments; it is noteworthy 
that conventional perturbation approaches break 


(Electron lattice) 



FIGURE 5. Variations of electronic, magnetic, and 
optical properties with four microscopic molecular 
parameters: transfer integral (T), electron-phonon 
coupling (IV), and on-site (U) and intersite (V) 
electron-repulsion integrals. Circles (1-4) denote the 
intermediate cases (see text). 


down [5], However, the characteristics of 
molecule-based materials are often regarded as the 
intermediate strengths of four parameters in Fig¬ 
ure 5 [3, 4, 18]. This is, in turn, one of the reasons 
why the molecular materials of our concern are 
important and interesting from a theoretical view¬ 
point [5, 18]. Thus, we have the second hypo¬ 
thesis 2: 

(H2) New phenomena and science may be expected from 
thorough and careful investigations of low-dimensional 
molecular systems in the intermediate regime in Fig¬ 
ure 5. 


ORDER PARAMETERS FOR MAGNETIC AND 
SUPERCONDUCTING STATES 

The magnetism and superconductivity are typi¬ 
cal macroscopic quantum phenomena which can 
be described by characteristic order parameters. 
For the purpose, field theoretical formulations are 
briefly summarized here [17, 19, 20]. The four- 
component spinor ^(r) is introduced to character¬ 
ize possible electronic phases for molecular mater¬ 
ials—(1) charge density wave (CDW), (2) spin 
density wave (SDW), (3) singlet superconductivity 
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(SSC), and (4) triplet superconductivity (TSC): 


¥(r) = 


(r) 

<M r) 

- (r) 


( 2 ) 


^ + (r) = (r)zFj (r) - i/z T (r)). 

i//J (r)( *A tY (r)) is the fermion creation (annihilation) 
field operator with spin a at point r, and it satis¬ 
fies the anticommutation relations. The subscripts 
t and i mean up and down spins, respectively. 
The electronic phase is characterized by its order 
parameter, which is given by the product of spinor 
fields; A J / (r) A I r+ (r'X The product is composed of 16 
components and it is factorized into the following 
forms as 


F(r, r') 

/,/,+ 


1 

2 


</'i(r)i/' i (r') - ^+(r)</r T (r') 

- (/rf (r) Ip l (r') ij/t (r) i/> T (r') 

/ 

C'(r,r') + S' z (r, r') S',(r,r') - zS'/r,r') 
S'(r,r') + iS'( r,r') C'(r,r') - S'(r,r') 


(5) 


and 

D(r,r') 

/ t/z T (r)^f a (r') - i/> t (r)i/» T (r')' 
i/^ (r)^ (r') - tff l (r)i/z T (r') 


1 

2 


A(r, r') + A z (r, r') A x (r,r') 
A^Cr, r') + zA y (r, r') A(r, r') 


zA y (r, r') 
A 7 (r, r') 


( 6 ) 


'Hr^Cr') = 


[ F(r,r') 
\D + (r,r') 


D(r,rV 
F(r, r'), 


(3) 


where each block component is given by 


F( r, r') 

^ (r)i//| (r') i/f T (r)^| (r') ' 

yw^(r') 

1 f C(r,r') + S.(r,r') S 2 (r,r') 

2 ^ S x (r, r') + z'S ;/ (r, r') C(r,r') 


zS y (r, r') 
S 2 (r, r') 


\ 


I 


(4) 


They can be also rewritten by the field product 
elements summarized in Table I. The number- 
density field operator C(r, r')(C'(r, r')) is the anti¬ 
symmetric spinor and means an order parameter 
for CDW. The spin-density field operators, S x ( r, r'), 
S (r,rO, S 2 (r / r'XS' JC (r / r , ) / and so on, are the sym¬ 
metric spinors and correspond to operators for 
SDW. The order parameters for SSC and TSC are 
generated from the off-diagonal field operator 
D(r, r') [19]. Thus, superconductivity^is regarded 
as an off-diagonal long-range order. A(r, r') is the 
antisymmetric spinor and generates the order pa¬ 
rameter for SSC. A v (r, r'), A y (r, r'), and A 2 (r, r') are 
the symmetric spinors and yield order parameters 
for TSC. The SSC state, whose order parameter 
does not depend on the Fermi momentum, is 
isotropic; it is called the s-wave. However, the SSC 


TABLE I_ 

Order parameters for CDW, SDW, SSC, and TSC. 

C(r,r') = i/r T (r)^|(r') + ^(r)^(r') 

S x (r,r') = i/^(r)^ T + (r') + (r)^(r') 

S y (r,r') = ~\{^ i (r)ipf(r') - (r')} 

S z (r,r f ) = i// r (r)t// T + (r0 - ifj i (r)^ + (C) 

C(r,r f ) = ^ A (r)^ A (r') + ^(r)^ T (r') 

S r x (r,r f ) = -i l/ + (r)ilf l ( r f ) - <Aj + (r)<A t (/*') 


Sy{r,r f ) = i{ipi(r)il/ i (r') - i// i + (r)i// r (r')> 
S z (r,r') = ip'l(r)il/ i (r r ) - i/> T + (r)^ t (r') 

A(r, r f ) = i// T (r r ) - ^ A (r)^ t (r # ) 

A x (r, r') = ^ A (r f ) - (r)^ T (r') 

A y (r,r') = -/{tfr A (r)^ A (r') + <// T (r)^ T (r')> 
A z (r, r') = <A T (r)i// i (r') + (r)i// T (/*') 
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state often depends upon the direction of the mo¬ 
mentum whose magnitude equals the Fermi mo¬ 
mentum. The SSC state which exhibits dependence 
on direction is an anisotropic SSC state such as the 
d-wave. On the other hand, we must consider both 
orbital and spin degrees of freedom to characterize 
the TSC state. The order parameters of spin S and 
angular momentum L are composed of 2S + 1 
spin elements times 2L 4- 1 orbital elements. 
Therefore, TSC have many possibilities such as the 
p- wave, showing many interesting characteristics. 
These SSC and TSC states can be characterized by 
the group-theoretical methods [21]. 

MODIFIED PHASE DIAGRAMS 

FOR t-J MODELS 

As a typical example for explanation of long- 
range orders discussed above, let us consider phase 
diagrams for doped copper oxides. Previous ab 
initio UHF [4] and CASSCF [22] calculations indi¬ 
cated that extra holes are introduced mainly into 
the oxygen site for copper oxides [2], being consis¬ 
tent with the so-called t-J model derived from the 
d-p two- or three-band Hubbard model of the 
species [23, 24]. Therefore, a reasonable under¬ 
standing of our J model [4] can be obtained from 
exact and numerical results obtained by recent 
exact diagonalizations [25-30] of t-J models for 
copper oxides. The ID t-J model was solved ex¬ 
actly in the special case of J = 2t [25], and its 
numerical solution was also obtained to depict the 
phase diagram [26]. Tire t-J models for the 2D 
lattice were also solved numerically to describe the 
high-T c superconductivity for copper oxides 
[27-30], All these calculations indicated that the 
large / value is responsible for the formation of a 
Cooper pair, supporting the / model in Eq. (1). 
Here, the phase diagrams obtained by them are 
utilized for the extension of our model to repro¬ 
duce experimental results [2]. 

The t-J model Hamiltonian can be defined as 

H(t -J) = -t £ (1 - a /(r (1 - n jr _ a ) 

i, j , <r 

+ /D S,-S ; , (7a) 

i'i 

where t denotes the hopping integral for the sin¬ 
glet pair between the Cu- and O-hole spins [23], 
and the spin operator S, and number density are 
defined by S,- = k a ia a ap a ip an d n i = ^( 7 a ta a ia' re “ 


spectively. The phase diagram for the t-J model 
was obtained using the slave-boson method [25], 
where a ia is defined by 

= fur b i' ■ ( 8 ) 

f i(7 is a fermion operator that carries the spin and 
is a boson operator creating a vacancy. The 
constraint of no double occupancy at the site i is 
expressed by 

Lftcrfiv + K b i = I' W 

(T 

Then, the t-J model Hamiltonian can be rewritten 
as 

Hit - J) = -t E b,ft r f ja b? + /Ls, -S, 

i, j, <J i, j 

- E A,'( £/^/;„ + K b i - l)/ (7b) 

where the Lagrange multiplier A, represents the 
constraint to exclude the double occupancy at each 
site. The singlet spin-pairing parameter (S z/ ) is 
given by 

s,, = (fi 1 f, i -fnfj t ). ( 10 ) 

The 2D t-J model was solved under the mean- 
field approximation [27-30] and two different 
transition temperatures, T(SG) and T(BE), were 
obtained: T(SG) is the transition temperature for 
the pairing of the spinons below which the fermion 
response function shows the Meissner effect, while 
T(BE) is the Bose-Einstein condensation tempera¬ 
ture of the boson. T(SG) is a decreasing function of 
the hole concentration x and becomes zero when 
tx max = /, where x max is the maximum concentra¬ 
tion for the appearance of superconductivity. On 
the other hand, T(BE) is an increasing function 
of x, e.g., T(BE) = 477fx = 477 Jx (x = (fc+fy)). 
The four different phases were defined by using 
these two characteristic temperatures, T(SG) and 
T(BE)—I(FL): the Fermi liquid phase where (b t ) 
= 0; II(SG): the spin gap (SG) phase where <S /; ) =£ 
0 but (b^ = 0; III(SSC): the singlet superconduct¬ 
ing phase with the A- wave symmetry where <S f -> 
^ 0 and (bi) # 0; and IV(EM): the exotic metal 
phase where = 0 and (b { ) = 0. The spin gap 
(SG) appears in the spin excitation spectrum in the 
phase II(SG). This has been confirmed by an NMR 
experiment [31]. The superconducting order pa- 
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rameter [27] in Eq. (6) is given by 


by the quadratic form of x: 


^ ij 1 i i a jO ^ (11a) 

where is defined by 

A ij = j dr dr’<pf (r)A(r,r')<Py(r'). (lib) 


cpi denotes the site wave function. This is not zero 
in the phase III(SSC), accompanied by the appear¬ 
ance of the mass of the gauge field (the so-called 
Anderson-Higgs mechanism) [19, 20]; the boson 
(holon) and fermion (spinon) interact through the 
gauge field as in the case of QCD theory [17]. 
Although the slave-boson model explains, indeed, 
the appearance of a spin gap [31] in the under¬ 
doped copper oxide, its behavior near the thresh¬ 
old x = 0 is not clear, although the experiment 
clearly shows the existence of an antiferromagnetic 
(AF) phase [2]. 

Therefore, our / model in Eq. (1) is extended so 
as to reproduce experimental results [2] on the 
basis of the Ginzburg-Landau (GL) model which 
has been developed for the RVB state described by 
the t-J model [32]. From the mean-field results 
[32], two order parameters, 4> G lf aR d <^ GLB , corre¬ 
sponding to fermion pairing and Bose condensa¬ 
tion, respectively, were introduced to define the 
free energy: 


^glx t 4>glx / a ' A ] 


H. 


cX 


r 


l 2e \ 

rsj 


1 V — zma — i — nA\<f> GLX 


8 


+ 2sgn(T T(X))\<f> GLX \ + I^glx 


( 12 ) 


where and H cX denote the coherence length 
and critical field, and m = 2 and n = 1 for the 
fermion (X = F) and m = 1 and n = 0 for the 
boson (X = B). a and A denote the Chern-Simons 
gauge field [33] and vector potential, respectively. 
Flere, we modify the GL model so as to approxi¬ 
mately describe the experimental results [2] in 
combination with our / model, which was derived 
from the ab initio molecular orbital results for 
transition-metal oxides [34]. The three transition 
temperatures, T(AF), T(SG), and T(BE), are as¬ 
sumed to be proportional to the hole concentra¬ 
tion, while the transition temperature T(SSC) for 
singlet superconductivity is approximately given 


T(AF) = c(AF)J(x min — x) (13a) 
T(BE) = c(BE)J(x - x min ) (13b) 
T(SG) = c(SG)/(x max - x)(x > x min ) and 

U3cj 

T(SG) = c(SG)/x(x < x min ) 

T(SSC) = c(SSC)/(x - x min )(x max - x). (13d) 

c(X) are the constants in Eq. (1), and the existing 
regions of these phases are defined as follows: 
antiferromagnetic phase (AF), 0 < x < x min ; singlet 
superconductivity phase (SSC), x min < x < x max ; 
and Fermi liquid phase, x max < x (fx max = /). Fig- 
ure 6 illustrates the phase diagram for five differ¬ 
ent (AF, SG, SSC, FL, EM) phases. The maximum 
temperature for SSC is suppressed in our model, 
in conformity with the gauge field fluctuation, etc. 

The SSC in the Fermi liquid phase I can be 
described by the spin fluctuation model [35, 36]. 
The transition temperature for SSC is expressed by 
the conventional BCS formula [35-37] 

T (SSC) = r Q f m (x max - x)exp( — 1/A), (14) 

where the cutoff energy is given by the character¬ 
istic energy T Q of spin fluctuation (SF) and is 
the magnetic coherence length [37]. The SF model 
at least works well in the overdoped region I in 
Figure 6, although its behavior in the underdoped 
region II is not clear yet. If the true order parame¬ 
ter for the high-T c superconductivity is discovered 
in the future, the coherence length, penetration 


T 



FIGURE 6. Phase diagram for the Fermi liquid I (FL), 
spin gap II (SG), singlet superconductive III (SSC), exotic 
metallic IV (EM), and antiferromagnetic V (AF) phases for 
copper oxide superconductors by the modified 
slave-boson model. 
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depth, and transition temperature can be obtained 
by the same formalism in Eq. (12). The previous / 
and present modified GL models can be utilized 
for molecular design of spin-mediated supercon¬ 
ductors, although they are phenomenological. In 
fact, several experimental groups are now conduct¬ 
ing synthetic efforts for them as discussed below. 

ACTIVE CONTROLS OF PROPERTIES 
BY EXTERNAL FIELDS 

The phase diagram in Figure 6 clearly shows 
that electronic phases of copper oxides are variable 
with hole concentration (x). This, in turn, indicates 
that active control of electronic, conductive, mag¬ 
netic, and optical properties by chemical modifica¬ 
tions and external fields are feasible for potential 
applications of molecule-based materials to ad¬ 
vanced technologies. Concerning these applica¬ 
tions, there are several experimental and theoreti¬ 
cal reasons why the intermediate regions in Fig¬ 
ures 5 and 6 are so interesting: 

1. Electronic structures in this regime are often 
sensitive to external variables (doping of hole 
or electron, pressure, temperature, external 
electronic field, photoexcitation, etc.), and the 
variations with such parameters are nonlin¬ 
ear in many cases. 

2. Controls of electronic, magnetic, optical, and 
other properties with the above variables are 
feasible in appropriate experimental condi¬ 
tions because of the strong coupling of charge, 
spin, and phonon degrees of freedom. 

3. Mean-field theories often break down be¬ 
cause of strong fluctuations, and, therefore, 
new theories such as gauge field theories in 
quantum chromodynamics (QCD) [17, 19, 20] 
would be useful even in the field of solid-state 
quantum chemistry. 

Thus, the switching of conductive, magnetic, and 
optical properties by the external fields is a key 
issue for molecule-based materials in the interme¬ 
diate regime [3, 18]. Figure 7 illustrates the bista¬ 
bility expected in several natural and artificial sys¬ 
tems [14, 18]. Although the bistability is indeed 
one of the guiding principles for the construction 
of electronic devices, it is equally applicable to the 
design of new functional materials responsible for 
active controls [3]. It would be one of the chemist's 
strategies to obtain molecule-based materials with 


Electronic 

properties 


FIGURE 7. Controls of electronic, magnetic, and optical 
properties of molecule-based materials by external 
parameters (see text). Switchings by the external fields 
are feasible in the intermediate regime. 

such novel properties, although powerful metal 
magnets and high-T c copper oxides have already 
been discovered. Probably, molecule-based func¬ 
tional systems have common characteristic fea¬ 
tures with biological systems [18], 


Possible Strategies for Charge- and/ 
or Spin-mediated Superconductors 

GENERATIONS OF NEW ELECTRONIC 
PHASES BY CARRIER DOPING 

The discovery of high-T c copper oxide [1] sug- 
gested the idea that doping in antiferro- or ferro¬ 
magnetic systems, more generally molecular 
charge-transfer (CT) insulators, may provide sev¬ 
eral exotic electronic phases as illustrated in Fig¬ 
ure 8 [3-5]. The antiferromagnetic insulators may 
be converted upon doping into six different phases: 
(1) ferromagnetic metal or insulator; (2) spin glass; 
(3) paramagnetic metal; (4) antiferromagnetic 
metal; (5) ferrimagnetic metal or insulator; and (6) 
charge- or spin-mediated superconductor. Recent 
field theoretical approaches can describe these 
electronic phases in a systematic manner as shown 
in Eqs. (2)-(6). Unfortunately, to our knowledge, 
theoretical determinations of possible phase dia¬ 
grams for molecular materials [17, 22] are difficult 
tasks except for a special case such as the Nagaoka 
theorem [38], because the perturbation theory 
breaks down for molecular materials under inves¬ 
tigation. However, accumulating experimental re¬ 
sults suggest that carrier doping into intermediate 
or strong correlation systems exhibit several inter¬ 
esting phenomena in Figure 8 and other quantum 
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Mott-Hubbard 
and CT Insulator 


carrier 

doping 








ferromagnetic metal (FRM) 
or insulator (FRI) 

spin-glass (SG) 


paramagnetic metal (PMM) 


antiferromagnetic metal (AFM) 


ferrimagnetic insulator (FRII) 
or metal (FRIM) 

charge- or spin-mediated 
superconductor (SC) 


FIGURE 8. Possible phases obtained by carrier doping 
into Mott-Hubbard and CT insulators. 


spin phenomena [17] expected for the low-dimen¬ 
sional systems in Figures 5 and 6. Therefore, a 
chemist's approach from the theoretical side would 
search isoelectronic series to copper oxides as em¬ 
ployed previously [3], while that of the experimen¬ 
tal side is to synthesize new isoelectronic com¬ 
pounds followed by physical observations [14b]. 
Collaborations from both sides would open the 
new intersection field in molecular electronic ma¬ 
terials. For the theoretical side, the following hy¬ 
pothesis 3 is assumed as [3-5] 

(H3) Dopmg of hole or electron into Mott-Hubbard or 
CT insulators may provide exotic electronic materials 
summarized in Figure 8 . 


CHARGE- AND / OR SPIN-MEDIATED 
SUPERCONDUCTOR 

Although many theoretical studies of doped 
copper oxides were initiated just after the discov¬ 
ery of the high-T c superconductivity [1], theoreti¬ 
cal attempts to examine possible molecular analogs 
to them were scarce in 1986 [3-5, 7] although such 
attempts are now popular in chemistry and mate¬ 
rial science. As mentioned previously [5], we have 
therefore started our theoretical attempts to search 
possible models for charge- and/or spin-mediated 
superconductors. For the purpose, we assumed a 
working hypothesis (H4): 

(H4) Charge - and/or spin-mediated superconductors 
may appear in the intermediate region of metal- 
insulator transitions as illustrated in Figure 9. 


T ( A ) 




FIGURE 9. Four different approaches to obtain high-T c 
superconductors in the intermediate regime of the 
metal-insulator transitions: (A) charge-mediated 
superconductor; (B) spin-mediated superconductor. 


Figure 9 illustrates possible approaches based 
on the hypothesis H4. Concerning charge-medi¬ 
ated superconductivity, the introduction of charge 
fluctuation from the metallic side was already ex¬ 
amined by Little [39] in the case of conductive 
polymers and related species in Figure 9(A). On 
the other hand, the possible role of CT excitation 
from the nonmagnetic insulator side in the inter¬ 
mediate region of metal-insulator transitions is 
not elucidated yet. Therefore, this moot point 
should be examined later. The spin-fluctuation 
model is also conceivable from the metallic side in 
the intermediate regime of antiferro (or ferro) mag¬ 
netic insulator-to-metal transitions as illustrated in 
Figure 9(B). On the other hand, the t-J model or 
RVB model can be used to formulate possible 
spin-mediated superconductivities from the mag¬ 
netic insulator side. In the later sections, we there¬ 
fore examine possible candidates from these theo¬ 
retical viewpoints. 
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Models for Exotic Spin-mediated 
Conductors and Superconductors 

ORGANIC AND ORGANOMETALLIC RVB 
SYSTEMS 

First, let us consider the spin-mediated super¬ 
conductor in relation to copper oxides. The intro¬ 
duction of electron carriers into magnetic insula¬ 
tors is one of the possible ways to obtain 
spin-mediated superconductors as shown in Fig¬ 
ure 9(B). The RVB state with spin frustration is 
expected to be a possible precursor for the pur¬ 
pose, although it has been interesting from the 
mechanism of free-radical reactions. For example, 
the Heisenberg model for organic radicals and 
radical clusters permit general spin structures such 
as triangular and cone ones [40, 41]. hi fact, these 
general spin structures as well as spin-frustrated 
configurations appear even in the three hydrogen 
D 3h system and four hydrogen T d system which 
are models of transition structures in exchange-for¬ 
bidden radical reactions. We have thoroughly in¬ 


vestigated these systems to elucidate group-theo¬ 
retical interrelationships between several theoreti¬ 
cal models [40]. 

After the discovery of the high-T^ superconduc¬ 
tivity [1], triangular spin systems have attracted 
great interest because of the Andersons's resonat¬ 
ing valence bond (RVB) theory [42] for magnetism 
and superconductivity. Many field theoretical 
models [17] have been presented for the high-T c 
superconductivity in conformity with Anderson's 
proposal. On the other hand, we considered possi¬ 
ble organic [10] and organometallic [11] spin-frus¬ 
tration systems such as the triangular ladder (IV) 
and Kagome ladder (V) in relation to RVB model 
for copper oxides as illustrated in Figure 10. Both 
through-bond and intermolecular approaches are 
possible to construct such spin networks or topol¬ 
ogy. The former approach enables us to propose 
several organic [8] and organometallic [11] candi¬ 
dates composed of the bond units in Figure 4. We 
also tried to synthesize low-dimensional systems 
composed of 4Fe—4S clusters with spin frustra¬ 
tions, but amorphous oligomers resulted to pre¬ 
vent further experimental treatments [43]. Instead 
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FIGURE 10. 1 -D, ladder, and 2-D spin systems with and without holes (or electrons) which are attractive from 
magnetism and superconductivity. 
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of the former (through-bond) approach, the latter 
(through-space) approach is rather simple since 
many radicals and ion-radicals can be used for 
component units. For example, three-center units 
(2), nitronyl nitroxide (NN) derivatives, doped TTF 
and BEDT-TTF molecules, etc., would be building 
blocks for molecular ladder compounds in Figure 
10 . 

Awaga et al. [44] discovered the organic Kagome 
system composed of p-MPNN, which does not 
exhibit the magnetic ordering even at low temper¬ 
ature because of strong spin frustration. But intro¬ 
duction of the carrier into this system seems diffi¬ 
cult. Enoki [45] also found the triangular lattice 
composed of the BEDT-TTF plus Br complex. Their 
preliminary results for the complex showed pecu¬ 
liar magnetic behaviors. Apparently, much more 
experimental effort is desirable to open a new field 
in this direction. Then, we have the following 
hypothesis 5: 

(H5) Exotic electronic phenomena may appear upon 
doping of organic and organometallic resonating va¬ 
lence bond (RVB) systems. 

/V-BAND MODELS FOR ORGANIC 

FERROMAGNETS, FERROMAGNETIC 
CONDUCTORS, AND 
p-WAVE SUPERCONDUCTORS 

The ferromagnetism for electron correlated sys¬ 
tems with N-tuple degenerate orbitals is a classic 
problem in magnetism because of the potential 
exchange interaction; the parameters arising from 
the Coulomb interaction (U, 1/) in Figure 5 should 
be considered [46, 47]. If an electron carrier is 
doped into the systems, the interaction between 
the conduction electron and the localized spin is 
realized, showing common features for other sys¬ 
tems examined in this article. Recently, revisited 
interest for degenerated N-band systems has arisen 
from several discoveries. The orbital degeneracy 
(freedom) such as the three degenerate t 2g orbitals 
of C 60 (7) plays an important role for orbital ferro¬ 
magnetism [46]. The Hund rule for the orthogonal 
t 2g orbitals favors the high spin state of the dian¬ 
ion and trianion of 7, but the ground state of the 
dianion is a singlet, showing the important role of 
an electron correlation effect as in the case of 
square planar cyclobutadiene. Thus, the interplay 
or competition among ferromagnetic exchange 


from orbital degeneracy, electron correlation (li), 
and the Jahn-Teller distortion (W) is one of funda¬ 
mental problems in molecular magnets with N- 
band degeneracy. The TMAE*C 60 CT complex ex¬ 
hibits complex magnetic behaviors because of these 
three factors; many spectroscopic experiments are 
not conclusive and are not settled yet [46]. 

The backbone and pendant-type polycarbene 
polymers investigated by the Ito-Takui [48] and 
Iwamura [49] groups are particularly interesting 
since the polycarbene species are regarded as the 
orthogonal two-band (<r and i r) systems. Theoreti¬ 
cal calculations [7, 50] were carried out to eluci¬ 
date possible spin states of these species after 
doping and it was concluded that the doping of 
carriers into these polymers would become a model 
for the ferromagnetic conductor and the molecular 
triplet superconductor. Recently, the Shida and 
Ito-Takui groups initiated doping experiments into 
polycarbenes [51]. 

Recently, Maeno et al. [52] discovered the su¬ 
perconductivity for non-copper oxides: Sr 2 Ru0 4 . A 
theoretical possibility of triplet superconductivity 
(TSC) in Table I was pointed out for Sr 2 Ru0 4 
[53]. According to the band calculation [54], the 
dsi^-pTr interaction is essential in this com¬ 
pound in contrast to the dy(a)-pa interaction in 
copper oxide. In our / model in Eq. (1), the T c for 
triplet superconductivity should be rather low be¬ 
cause the ferromagnetic spin fluctuation is weak in 
Sr 2 Ru0 4 . If Sr 2 Ru0 4 is really the triplet supercon¬ 
ductor, it supports the isoelectronic analogies in 
Figure 2. Several recent findings for the inorganic 
systems may provide the following hypothesis 6 
for molecular materials: 

(H6) Carrier-doped molecule-based magnets with Ri¬ 
band degeneracy are interesting for ferromagnetism, 
magnetic conductivity, and triplet superconductivity. 


Models for Charge-mediated 
Superconductors 

The transition temperature for spin-mediated 
superconductors are estimated [5] to be 150 K even 
at the upper limit, since the largest effective ex¬ 
change integrals are in the order 1000 cm -1 . There¬ 
fore, we may change the strategy to obtain the 
room-temperature superconductor. Several ap¬ 
proaches [55] from the metal side in Figure 9(A) 
were performed. Probably, new approaches differ- 
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ent from them are necessary for the purpose. Here, 
let us consider the CT model from the insulator 
side in Figure 9(A) instead of charge fluctuation 
(exciton) model by Little [39]. 


ALKALI-, ALKALI-EARTH-, AND 

HYDROGEN-TERNARY COMPLEXES 

The chemical synthesis of high-T c superconduc¬ 
tors without copper ion is one of a chemist's 
dreams. According to the working hypothesis H5, 
let us consider possible model systems in the inter¬ 
mediate region of metal-insulator transitions [56, 
57]. Building blocks composed of hydrogen (H) 
and alkali earth atoms are regarded as the Is- and 
2 s orbital pairs having the isoelectronic configura¬ 
tions to those in Figure 2. The alkali (A)- and 
alkali-earth (B)-atom combinations also provide the 
same electronic configurations. These simple com¬ 
binations are interesting because high pressures 
can be used to obtain solids. The ternary com- 
plexes, HBH, HBA, and ABA, are isoelectronic to 
the three-center four-electron systems depicted in 
Figures 3 and 4. On the other hand, HHH, HAH, 
AAH, and AAA components are formally isoelec¬ 
tronic to the one-extra hole-doped three-electron 
states of the three-center systems in Figure 4. The 
CT excitations (A) can be controlled so as to couple 
the Cooper (2 e ) pair in the intermediate regime of 
the metal-insulator transitions. Therefore, three- 
component solids expressed by (H) t (A) (B) 2 would 
be particularly interesting from the CT mechanism 
of the high-T c superconductivity in Figure 9(A). 
The partial addition of alkali and alkali-earth atoms 
into metallic hydrogens at high pressure would be 
one of the charge-fluctuation approach in Figure 
9(A). On the other hand, let us consider a model 
system of the CT mechanism of the partial addi¬ 
tion of hydrogen atoms into an insulator, e.g., 
which is consisted of LiH and BeH 2 , at high pres¬ 
sure as shown in Figure 11. 


A FORMULATION OF OUR CT MODEL 

To clarify the difference between Little [39] and 
our models, we describe here the model Hamilto¬ 
nians explicitly. For the underdoped state, a model 
Hamiltonian is written as 
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FIGURE 11. A model system of the CT mechanism of 
the partial addition of hydrogen atom into an insulator 
which consists of LiH and BeH2 and some possible 
transition states. 
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In Eq. (15a), the first and second terms are kinetic 
and electron interaction parts, respectively. H e _ l 
means electron-lattice (positive charge) interac¬ 
tion. Hj is the Coulomb interaction between lat¬ 
tices (positive charges). H e _ p and H l _ p indicate 
electron-phonon and lattice (positive charge)- 
phonon interactions, respectively. a^ a {a ra ) is the 
creation (annihilation) operator for an electron at 
the r-th site with a spin. is the creation opera¬ 
tor for a state (positive charge) at the z-th site, 
which is shown in Figure 11(b) and (d). is the 
creation operator for a state, which is different 
from the above state, as shown in Figure 11(c). The 
operators are satisfied with the anticommutation 
relation. c p/ which is a boson, is the phonon cre¬ 
ation operator. 

In particular, in the underdoped system as 
shown in Figure 12, the above Hamiltonian is 
approximated as 

h = E - v E E 

r, s a r ,s,t,u (T, p 

(16) 


where 


the following: 

T c = C A, (18) 

where A is the CT excitation energy and C is a 
constant. We expect that T c increases above 300 K 
(see Table II), because the CT excitation energy 
equals to about 1-3 eV. 

On the other hand, for the overdoped state, 
another model Hamiltonian is considered: 

H = H e + H h + H e _ h + H d , (19) 
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The second term in Eq. (15a) and of Eq. (15c) 
are negligible. Therefore, the transition tempera¬ 
ture for this approximation may be estimated as 
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FIGURE 12. Underdoped model system constructed 
of alkali- and alkali-earth metals. 


where a+ (r (a r(T ) is the creation (annihilation) opera¬ 
tor for an electron at the r-th site, the beryllium 
site, with a spin. iC is the creation operator for a 
positive charge (hole) at the z-th site with cr spin, 
where the label i is the lithium site. The operators 
are satisfied with the anticommutation relation. 
The first and second terms in Eq. (19a) mean the 
kinetic and electron interaction parts, respectively. 
In Eq. (19b), the first and second are the kinetic 
and Coulomb interactions for the hole. H e _ h is the 
Coulomb interaction between the electron and hole. 
H d indicates that an electron (negative charge) and 
a hole vanish. This means that the charged state 
becomes neutral as shown in Figure 11(a). This 


TABLE II_ 

Three mediums and T c for superconductors. 


Medium 

Maximum T c (K) 

Phonon 

30 

Spin 

100 

Charge 

300 
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model will reduce to a metal model, which may be 
a metallic hydrogen model, at the limit of a large 
amount of a doping hydrogen atom. 

DESIGN OF NONMETAL TERNARY SYSTEMS 

The magnitude of the CT excitation energy (A rf ) 
is one of the possible parameters to elucidate pos¬ 
sible organic systems which are isoelectronic to 
copper oxides. Previously [5], we already exam¬ 
ined the CT excitation energies between nonmetal 
second and third periodic elements. The A value 
for the sulfur (S) and nitrogen (N) pair, namely, 
transition energy from SN to S + N“, is only 0.1 eV, 
and therefore the (SN)* polymer corresponds to 
the overdoped copper oxide [2], which is a normal 
metal without spin fluctuation. The normal BCS- 
type superconductivity is expected for the species, 
in accord with the experiment. On the other hand, 
the CT energies (A pp ) for the CS, SiS, PS, and CN 
pairs are 0.6-1.0 eV, compatible with the undoped 
copper oxide [58]. Therefore the ID polymers (I), 
ladders (II), and sheet (III) consisted of these pairs 
in Figure 10 could be CT semiconductors/or insu¬ 
lators. The introduction of holes or electrons into 
these systems by doping may induce the metal- 
insulator transitions as in the case of copper ox¬ 
ides. For example, the synthesis of ternary solids, 
(X)*(C) y (N) z , (X)*(Si) y (S) z , and so on (X = halogen 
atoms, etc.) would be interesting targets since the 


solid-state synthesis under the high pressure was 
developed recently [59]. Then, we may have the 
following hypothesis H7: 

(H7) Charge-transfer models for the high-T c supercon¬ 
ductivity in Figure 9(A) are conceivable in the ternary 
solids such as (H) x (A) y (B) z and (X) x (C) y (N) z . 


Models for Photo-induced 
Molecular Materials 

PHOTOINDIICED ORGANIC AND 
ORGANOMETALLIC CT FERRO- 
AND FERRIMAGNETS 

Photo-induced phase transitions have been con¬ 
sidered as an advantage of molecule-based materi¬ 
als. From this viewpoint, the donor-acceptor com¬ 
plexes would be interesting targets for magnetic 
modification [16]. Possibilities of organic charge- 
transfer (CT) ferro- and/or ferrimagnets were ex¬ 
amined based on the ab initio computations of 
donor-acceptor (D-A) complexes with radical 
groups [16]. The intra- and intermolecular effective 
exchange couplings between ion radicals were ex¬ 
amined theoretically as illustrated in Figure 5 in 
[16]. Possibilities of photo-induced or photo- 
switched organic magnets were also proposed as 
illustrated in Figure 13 [16]. To this end, spin 
sources were introduced into compounds exhibit- 


potential energy 



(A) 


Potential energy 



FIGURE 13. Photo-induced or photo-switched organic magnetic materials. 
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ing the neutral-ionic transitions by photoexcita¬ 
tion: The TTF-chloranil system having a radical 
substituent is illustrated as an example in [18]. 
Similarly, magnetic oligomers such as polyphen¬ 
ylene vinylene) [60], etc., can be used as a donor or 
acceptor for photo-induced CT magnets [18]. 

Very recently, Hashimoto et al. [61] demon¬ 
strated that the magnetic properties of Prussian 
blue analogs (MXM 7 , X - CN 1 ") are controllable 
by the electrochemical, photochemical, and other 
techniques, namely, magnetic phase transitions 
were induced by the external perturbations. Their 
results lend support our previous attempt, e.g., see 
[15]. The success of the photo-induced magnetism 
[61] in the Prussian blue analogs may support 
possibilities of photo-induced organic magnets 
as well. 

The key issues in these p-d, ir-d, and 7t-R 
systems are small metal-ligand CT (MLCT) and 
metal-metal CT (MMCT) excitations in Figure 13. 
Thus, we may have the following hypothesis 8: 

(H8) Photo-induced molecular magnetism and tunable 
magnets by external fields are expected in p-d, ir-d, 
and tt-R systems with small MLCT and MMCT ener¬ 
gies in Figure 3. 


POSSIBILITIES OF PHOTO-INDUCED 

SUPERCONDUCTORS 

It is significant to consider possibilities of the 
photo-induced superconductors in an intermediate 
regime for the metal-insulator transition. Let us 
consider a model system as shown in Figure 14(a). 
The phase of the ground state, which is described 
as region I as shown in Figure 14(b), is an insula¬ 
tor. By irradiating the photon, electrons transit to 
the excited states which are in region II or III as 
shown in Figure 14(b). Region III, which is a con¬ 
tinuous level region, becomes the metal phase, 
and, on the other hand, region II, which has a very 
small energy gap between levels, will be chaotic. 
We point out that analogously to the above discus¬ 
sions of superconductivity in the intermediate re¬ 
gion of metal-insulator transitions the photo- 
induced superconductive phase may be found in 
region II but for solving the mechanism of the 
coupling Cooper pair. Thus, we may have the 
following hypothesis 9. 

(H9) Photo-induced superconductivity by external fields 
are also expected in a semi-continuous level band in 
Figure 14. 



(b) 


Region II Region I 

FIGURE 14. A model of photo-induced superconductor 
and three regions of excited levels: (a) a lattice model of 
an insulator; (b) energy levels. Discrete energy levels are 
in region I. Regions II and III describe semicontinuous 
and continuous levels, respectively. 



Discussions and Concluding Remarks 

The discovery of high-T c superconductors [1] 
enable us to investigate possible organic and 
organometallic analogs to copper oxides. Previ¬ 
ously [3-5], we thoroughly examined some possi¬ 
ble model Hamiltonians for copper oxides. N-band 
(from two to six bands) models for copper oxides 
were examined by exact diagonalization of the 
cluster models. From these computational results, 
CT insulators with strong superexchange interac¬ 
tions were proposed as possible precursors for 
high~T c superconductors and exotic molecular 
magnets. Some typical examples described by the 
p-d, a—d, and tt-R Hamiltonians were also dis¬ 
cussed from both charge and spin fluctuation 
mechanisms for the high-T c superconductivity. 
Judging from the cutoff energies, the maximum T c 
would be 30, 100, and 300 K for phonon-, spin-, 
and charge-mediated superconductors as shown in 
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Table II. During the past 7 years, we have discov¬ 
ered and investigated many experimental conclu¬ 
sions on the basis of the working hypotheses pre¬ 
sented previously [3-5]. In this article, we have 
tried to explain these newly obtained experimental 
results from theoretical viewpoints. The J model 
[5] was extended so as to reproduce many experi¬ 
mental results [2]. As cited in many articles in 
the references, some theoretical computations 
were carried out to examine guiding principles for 
constructions of functional materials and active 
controls of various properties of molecule-based 
materials. Since theoretical predictions of crystal 
structures for complex systems discussed here 
were difficult tasks, numerical calculations of real¬ 
istic models were performed assuming possible 
lattices in Figure 10. These computational results 
and available experiments cited in this article en¬ 
able us to propose several working hypotheses 
(H1)-(H9). For the purpose, a theoretical formula¬ 
tion for the CT model was presented. More de¬ 
tailed theoretical formulations for these hypothe¬ 
ses will be given elsewhere. 
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ABSTRACT: We present potential energy surfaces for two four-atom systems, HOCO 
and Ar-HCO. In the former system a previous, reduced dimensionality 2 degree-of- 
freedom model of the OH + CO -> H + C0 2 reaction is extended by inclusion of the 
nonreactive CO stretch. In the latter system, extensive ab initio calculations are done for 
19 HCO geometries and 120-150 Ar-HCO planar configurations, for a fixed HCO 
geometry. In both systems a Shepard-type representation of the global potential is 
presented in the former system in 3 degrees of freedom, and in the latter system in the 
full 6 degrees of freedom. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 965-973, 1997 


Introduction 

I t is perhaps too obvious to state the importance 
of potential energy surfaces for dynamic stud™ 
ies of molecular interactions. Not only are poten¬ 
tials essential, but in many instances they are more 
important than the choice of dynamical approach, 
i.e., classical or quantum. During the past 20 years, 
and especially in the past decade, a number of ab 
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initio potential energy surfaces have been devel¬ 
oped for triatomic systems. It is fairly routine now 
to perform high-quality ab initio calculations at 
sufficiently many nuclear geometries, of the order 
of 1000 (corresponding to roughly 10 points per 
degree of freedom), to determine a global potential 
for triatomics. The potential is typically obtained 
by representing the full potential by a functional 
form with a number of linear and nonlinear pa¬ 
rameters which are determined by a least-squares 
procedure. 

This highly successful approach for triatomic 
systems has obvious and severe limitations for 
larger systems. For example, for a four-atom sys- 
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tem the grid of geometries corresponding to the 
modest number of 10 points per degree of freedom 
would result in 10 6 geometries at which electronic 
energies would be calculated. This is clearly pro¬ 
hibitive for current ab initio methods. Even if one 
thinks about bypassing a potential function and is 
willing to attempt electronic structure calculations 
"on-the-fly," i.e., exactly at grid points where they 
are needed by the dynamics, the problems are at 
best very severe for classical dynamics and again 
prohibitive for quantum dynamics. 

Thus, other strategies are needed, and have 
been developed, for polyatomic systems (where 
henceforth "polyatomic" refers to systems of four 
or more atoms). In this study we describe novel 
applications of a Shepard-type interpolation meth¬ 
od to two quite different systems that illustrate 
different objectives. One is a reduced dimensional¬ 
ity model of the OH + CO -» H + C0 2 reaction, 
and the other is a nonreactive four-atom system, 
Ar-HCO. 

In this type of interpolation a data base of 
localized and widely distributed (in configuration 
space) representations of the potential energy sur¬ 
face is used to obtain the surface globally, by 
means of switching functions. This approach ap¬ 
pears to have been used first to obtain a global 
representation of the HCO potential [1]. In that 
approach ab initio calculations of the electronic 
energy were done for 30 widely different geome¬ 
tries. Sufficient calculations were done at each ge¬ 
ometry to obtain a local fourth-order force field, 
and the collection of force fields was then used to 
interpolate via switching functions to obtain the 
full potential globally. (That representation was 
then used to obtain an efficient three-dimensional 
spline representation of the suface.) More recently, 
this general approach was adopted for a study of 
the OHH 2 potential surface [2]. In this case, Hes¬ 
sians were obtained at several hundred geometries 
and used as a database to obtain a potential cen¬ 
tered around the minimum energy path. 

A potentially serious problem with a database 
of force fields in a Shepard-type interpolation is 
with extrapolation. That is, if one evaluates the 
potential at points in configuration space that are 
distant from any of the force fields, the result will 
be the extrapolated value of a force field, which 
may contain spurious oscillations or singularities. 
(By definition, force fields are local representations 
of the potential, and their behavior beyond their 
radius of convergence may be quite unphysical.) In 


the present work we will specifically address this 
problem and show how it can be overcome. 

In the next section we describe the new applica¬ 
tion of Shepard-type interpolation to a reduced 
dimensionality model of the reaction OH + CO —» 
H + C0 2 . Following that we present a new 6 de- 
gree-of-freedom potential surface for Ar-HCO, 
based on high-quality ab initio electronic energies. 
A summary and conclusions are given at the end. 


Reduced Dimensionality Potential for 
OH + CO -> H + C0 2 

The goal here is to construct a reduced dimen¬ 
sionality, 3 degree-of-freedom (3-DOF) potential 
energy surface for the OH + CO —> H + C0 2 re¬ 
action suitable for subsequent quantum calcula¬ 
tions. The coordinates we consider are the OH 
internuclear distance, r OH , the CO internuclear 
distance, r co , and R, the distance between the 
centers of mass of the OH and CO fragments. All 
previous quantum calculations of the reaction dy¬ 
namics separated the nonreactive CO bond from 
the dynamics [3-9], and our goal is to insert that 
mode back into the dynamics, although still within 
a reduced dimensionality framework. Our motiva¬ 
tion for doing this is twofold. First, resonances 
(due to complex HOCO formation), which domi¬ 
nate the reaction dynamics, in which the nonreac¬ 
tive CO bond is temporarily excited were missing 
in the previous calculations. Second, in these calcu¬ 
lations the description of the vibrational motion of 
C0 2 is highly approximate because, as is well 
known, the two CO bonds in C0 2 mix strongly to 
form normal modes of vibration. 

The first quantum studies of this reaction (in 
which the nonreactive CO mode was uncoupled) 
were done using a 2-DOF model in which the 
potential was given in terms of the coordinates 
r OH and R [3]. In that model the full 6-DOF poten¬ 
tial [10] was minimized for fixed values of r OH 
and R, and with r co adiabatically separated so 
that the zero-point motion corresponding to the 
nonreactive CO bond was added to the minimized 
2-DOF potential. 

Initially, our idea was to incorporate the CO 
bond by minimizing the same 6-DOF with respect 
to the 3 degrees of freedom, R, r OH , and r CO' 
however, we encountered isolated regions in con¬ 
figuration space where the minimization proce- 
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dure gave unstable results for extended values of 
r co* To overcome this problem, we adopted the 
following strategy. In regions in R and r OH where 
the potential behaves smoothly with respect to the 
variation of the CO distance we minimized the 
6-DOF potential for a given R and r OH , and for 
r co = 2.2 a.u. (its value at the HOCO minimum). 
We then varied r co in the range 1.8-3.2 a.u and fit 
the resulting potential to a Morse functional form, 
i.e.. 


Vmorse('co) = P,(l.- 6~ >W) , (1) 


where i refers to a specific point with coordinates 
(R ? , fo H )- We did 106 such fits at points shown in 
Figure 1. These points are centered around the 
reaction path for OH + CO -> H + C0 2 , but avoid 
the regions were the potential did not give satis¬ 
factory results. Then we represented the CO-stretch 
at any value of R and r OH by the following Shep¬ 
ard-like interpolation: 


^i( r co/ r OH' R) ~ 


£ * iv i (r OH , R ) V^orse (r co ) 


£j'Wy(rQ H , R) 


( 2 ) 


where w t is a weight given by 

jjq. — a i$i UoH ' ^0 


(3) 


with 


We varied the range parameter a i over a wide 
range [subject to the condition that Eq. (2) reduces 
to within a few percent of V^ orse (r co ) at the ith 
point], and found that our fit is insensitive to its 
exact value. This robustness of the interpolation is 
due at least in part to the fact that the Morse 
potentials do not vary greatly over the grid of 
points in R-r OH space. 

We should note that our objective is not to 
reproduce the potential at selected points. Clearly, 
by using a Morse to fit the potential and then 
using the Shepard-like switching among the Morse 
fits, the final result is in a sense twice removed 
from the original data. In general, the errors in the 
Morse fits are much greater than the errors intro¬ 
duced by switching method. With this in mind we 
examined the precision of Eq. (2) by comparing it 
to the actual potential in the CO stretch at a value 
of R and r OH not among the fits. The result is 
shown in Figure 2 for one example. As seen, the 
interpolation does provide a realistic representa¬ 
tion of the CO stretch potential. 

The final 3-DOF potential was then constructed 
by combining the previous 2-DOF potential [3] 
with Vi(r QO , r OH , R). However, since that 2-DOF 
potential contained the local zero-point energy of 
the CO, we first had to subtract that energy using 
our Vi(r co , r OH , R) potential, and then the new 
3-DOF potential is given by 

V( r cof ^oh / R) ~ R) T V 1 (r CO/ r OH , R), 

( 5 ) 


S i (R / ^qr) — ^(t "oh 7oh) T (R -R 1 ') • (4) 


where V' ld is the previous 2-DOF-dimensional po¬ 
tential of Schatz and Dyck [3] minus the local CO 
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FIGURE 1. Points where the CO stretch potential was 
fit to a Morse function. 



FIGURE 2. Comparison of the actual CO potential 
(circles) at r OH = 2.0 a.u. and R = 4.8 a.u. and the CO 
potential from the Shepard interpolation. 
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zero-point energy. Figure 3 shows a 2-DOF con¬ 
tour plot of the new surface at r co = 2.5 a.u. The 
potential is smooth and shows a single well corre¬ 
sponding to the cis-HOCO minimum as well as a 
small barrier in r OH coordinate which is consistent 
with the 2-DOF surface of Schatz and Dyck. Figure 
4 shows a 2-DOF contour plot at fixed R = 3.73 
a.u (the value of R at the cis-HOCO minimum). 
The minimum around r co = 2.2 a.u. is very dis¬ 
tinct and corresponds to the stable cis-HOCO con¬ 
figuration. 

Finally, we note that the zero-point energy of 
the CO vibration extracted from the 2-DOF effec¬ 
tive potential of Schatz and Dyck [3] is signifi¬ 
cantly lower than the experimental results for the 
asymptotic OH + CO channel, roughly 900 cm -1 
compared to 1100 cm -1 experimentally. We have 
calculated the zero-point energy of CO vibration 
using the new 3-DOF potential of Eq. (5), and it is 
in within 50 cm" 1 of the experimental value. 

The present application of a Shepard-like inter¬ 
polation to generate a 3-DOF potential from a 
2-DOF one has potentially wide applicability to 
polyatomic systems in which there are many non¬ 
reactive degrees of freedom. If the potential in 
these degrees of freedom can be represented as 
sums of functions such as Morse functions, then 
perhaps a relatively small number of determina¬ 
tions of these functions would suffice to obtain a 
realistic global representation based on the Shep¬ 
ard-like interpolation in these degrees of freedom. 



r co = 2.5 a.u. Contour values are in electron volts. 



r (CO), a.u. 


FIGURE 4. Equipotential contour plot of the new 
potential for R - 3.73 a.u. Contour values are in electron 
volts. 


Next we consider another, quite different appli¬ 
cation of a Shepard-like interpolation to the Ar- 
HCO system in all 6 degrees of freedom. 


Six Degree-of-Freedom Interaction 
Potential Surface for Ar-HCO 

As noted in the Introduction it is impossible to 
do ab initio calculations in a straightforward six¬ 
dimensional scan for a four-atom potential surface, 
so some other strategy is needed to construct a 
reasonable potential surface. In the present case, 
the system of interest is Ar-HCO, and the potential 
will be used in scattering study of collisional vi¬ 
bration energy transfer and collision-induced dis¬ 
sociation /recombination of HCO. Thus, the poten¬ 
tial surface needs to include a fairly wide variation 
in the HCO geometry. At the very least, it must 
sample the geometry around the HCO minimum, 
and the most important energetic pathway to dis¬ 
sociation. In addition, since the interaction poten¬ 
tial is of the van der Waals type, many geometries 
of the Ar atom relative to a given HCO geometry 
must be considered. 

With the above requirements in mind, we 
adopted the following strategy to construct a rea¬ 
sonable six degree-of-freedom potential. For a 
given HCO geometry we calculated the electronic 
interaction energy for 120-150 Ar geometries in 
the HCO plane. Those energies were then fit to a 
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functional form, and then the various fits, corre¬ 
sponding to 19 HCO geometries, were used in a 
Shepard-type interpolation to obtain a representa¬ 
tion for the global potential. As above, the objec¬ 
tive is not to reproduce the calculated ab initio 
points. The Shepard-type interpolation is done on 
the fits to points, and in general the errors in these 
fits are much greater than the errors introduced by 
the interpolation of the fits. 


Ab initio Calculations 

The basis set and the level of correlation and 
tests of both have been described in detail in Ref. 
[11]. Briefly, single reference, configuration interac¬ 
tion (CISD) calculations, using double-zeta and 
triple-zeta basis sets, and counterpoise correction 
were done using the code MOLPRO [12], The 
coordinates we used are body fixed, spherical co¬ 
ordinates with the origin at the center of mass of 
CO, with the z axis along CO, and H in the xz 
plane, as shown in Figure 5. The CO distance is 
denoted by r, the position of H to the center of 
mass of CO is (R, y), and in HCO plane the 
position of Ar is (Q, 6). 

For each HCO geometry, 6 varied from 0 to 
180° in equal intervals of 18° for </> = 0 and also for 
(f) = 180°. For each 8, 6 to 7 Q's were selected 
according to the shape of the potential cuts. Thus, 
ab initio calculations were done at between 120 
and 150 points per HCO geometry. The 19 HCO 
geometries considered are given in Table I. 



TABLE I_ 

HCO geometries chosen for the 
ab initio calculation. 


Geom. No. 

R (bohr) 

r (bohr) 

y (degree) 

1 

3.043 

2.288 

34.773 

2 

4.213 

2.153 

47.261 

3 

5.669 

2.288 

34.773 

4 

3.043 

2.288 

64.773 

5 

3.043 

2.288 

49.773 

6 

3.043 

2.288 

19.773 

7 

3.043 

2.488 

34.773 

8 

3.043 

2.088 

34.773 

9 

3.043 

3.088 

34.773 

10 

7.559 

2.143 

34.773 

11 

4.213 

2.153 

57.261 

12 

4.213 

2.153 

37.261 

13 

4.213 

2.303 

47.261 

14 

4.213 

2.003 

47.261 

15 

3.590 

2.173 

40.000 

16 

3.590 

2.173 

55.000 

17 

3.590 

2.173 

25.000 

18 

3.590 

2.373 

40.000 

19 

3.590 

1.973 

40.000 


In order to make some global comparisons be¬ 
tween the ab initio electronic energies and the fits 
to them, described below, we fit several sets of 
Ar-HCO energies very precisely by a somewhat 
complex functional form described earlier [11]. Po¬ 
lar contour plots of the interaction potential (based 
on these precise fits) are given in Figures 6-8 for 
the HCO minimum (geometry 1), the H—CO tran¬ 
sition state (geometry 2), and nearly dissociated H 
—CO (geometry 3), respectively. As seen, there is 
a van der Waals minimum at a cis T-shaped geom¬ 
etry that is nearly independent of the H atom 
position. 


Fits of the Planar Ar-HCO 
Interaction Potentials 

A major factor in considering a functional form 
to fit the ab initio energies for a fixed HCO geome¬ 
try is the subsequent use of the fits in a Shepard- 
type representation of the global surface. As men¬ 
tioned in the Introduction, a danger in using this 
type of representation is possible unphysical be¬ 
havior of the potential in regions away from any of 
the fits. Thus, it is important to choose a physically 
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K(bohr) 

FIGURE 6. Equipotential polar contour plot of ab initio 
interaction potential of Ar with equilibrium HCO. Contour 
values are in cm -1 . 


reasonable functional form for the fits, even if this 
entails some sacrifice in the precision of the fit. 
With this in mind we chose to represent the Ar- 
HCO interaction potential as the following sum- 
of-pairs form for a fixed HCO geometry, (R n , r n , %) 
and for planar Ar-HCO configurations: 



K(bohr) 

FIGURE 7. Same as Fig. 6, but for with HCO at 
transition state geometry. 


Im 

JZ 

a 



FIGURE 8. Same as Fig. 6, but for Ar with HCO at a 
nearly dissociated geometry. 

where 

V^ x (Rm-k) = C\%exp(-Ci%R Ar . x ) 

C^x[ 1 - exp - (i^Ar-x/2) 8 ' 



The superscript corresponds to the nth HCO ge¬ 
ometry. As seen, there are two linear and one 
nonlinear parameters C^\, C£ n ^, per two- 
body potential for a total of nine parameters. They 
were determined by nonlinear least square fitting 
to the ab initio energies separately for each HCO 
geomtery. 

The average fit error ranges from 60 to 150 
cm -1 , depending on the HCO geometry, and over 
a range of interaction energies that varies from 
roughly —100 to 7000 cm" 1 . The fitting error scales 
roughly with the value of the interaction energy. 
Polar contour plots of the fitted interaction poten¬ 
tials for three HCO geometries, corresponding to 
the ab initio ones shown in Figures 6-8, are shown 
in Figures 9-11. Comparing these, one sees that 
the fitted potentials are reasonable representations 
of the ab initio ones. Further examination of the 
fits are shown in Figure 12 for the cis planar 
Ar-HCO geometry, and for an out-of-plane config¬ 
uration, for HCO at its equilibrium geometry. As 
seen, the fit for the in-plane geometry is quite 
good, and the one for the out-of-plane one is not as 
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flr-HCO 



K(bohr) 

FIGURE 9. Same as Fig. 6 but for the fitted sum-of-pairs 
potential. 


flr-(H-CO) 



K(bohr) 

FIGURE 10. Same as Fig. 7 but for the fitted 
sum-of-pairs potential. 


good. This is not surprising given that the fit was 
made to the ab initio energies only for the in-plane 
Ar-HCO configuration. 

Global Surface 

The global 6 -DOF potential surface is given by 
the following Shepard-type representation: 

-^Ar-H / -^Ar-C ' ^Ar-O ' ^ ' T / 7 ) 

= H ^n^^Ar-H' ^Ar-C / ^Ar-o) lYh W n' (8) 

n / n 

where the weight function is, as above, a simple 
Gaussian 

w n = exp -a r (R- R n ) 2 - a 2 (r - r n f 

-a 3 (y-y„) 2 ]. (9) 

The three range parameters were chosen so that 
the global potential is smooth and close to the fit 
when the HCO geometry is close to nth geometry. 
After some trial and error we determined that the 
values, 04 = 2.0 bohrs -2 , a 2 = 25.0 bohrs -2 , a 3 = 
17.5 rads 2 , satisfied the above criteria well. (These 
orders of magnitude of the range parameters were 
mainly determined by the spacing in fits in the 
corresponding coordinates.) 


In order to test the global potential surface, a 
number of comparisons of the surface were made 
with additional ab initio calculations. In Figure 13 
we compare the global surface with ab initio ener¬ 
gies for two planar Ar-HCO configurations, as R 
is varied over a large range, with r and y at their 
values for HCO at equilibrium. As seen, the agree¬ 
ment is reasonable and physically correct, i.e., the 


Rr-(H—CO) 



-15 -10 -5 0 5 10 15 

H(bohr) 


FIGURE 11 . Same as Fig. 8 but for the fitted 
sum-of-pairs potential. 
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Q (bohr) 

FIGURE 12. Comparison of the interpolated potential 
(curve) with ab initio (circles) energies as R is varied for 
r co = 2.15 bohrs and y = 40°, for the two indicated Ar 
geometries. 



FIGURE 13. Comparison of the interpolated potential 
(curve) with ab initio (circles) energies for HCO at 
equilibrium for in-plane and out-of-plane Ar-HCO 
geometries. The in-plane interpolated potential is the 
sum-of-two-body fit to the ab initio energies. 


interaction is more repulsive in the cis Ar-HCO 
geometry (0 = 0) than in the trans one (<f> = 180°). 
Another test is for the out-of-plane interaction. 
This is shown in Figure 14 for HCO at equilibrium, 
and two values of 0 for Q = 5.67 a.u. As seen, the 
behavior of the global surface is quite reasonable. 

This potential surface is available upon request 
to JMB. 


Summary and Conclusions 

Shepard-type interpolation has been used in two 
four-atom systems. In the first, a 3 degree-of-free- 
dom reduced dimensionality potential was de¬ 
scribed for the OH 4 - CO -> H + C0 2 reaction. 
The surface was built up from an existing 2 de¬ 


gree-of-freedom potential by the addition of the 
nonreactive CO stretch. We minimized the 6 de- 
gree-of-freedom potential for fixed values of r OH 
and R and then fit the CO dependence of the 
potential to a Morse function. The CO dependence 
of the potential at an arbitrary value of R and r OH 
was determined by Shepard interpolation. This 
surface is currently being used to explore the role 
of this stretch in HOCO resonances [13]. 

The second four-atom system considered was 
Ar-HCO. In this application, high-quality ab initio 
electronic energies were calculated for 19 HCO 
geometries and 120-150 planar Ar-HCO geome¬ 
tries for each fixed HCO geometry. The energies 
were fit using a physically motivated sum of two- 
body potentials. The global surface was given again 
by a Shepard interpolation in the HCO coordi- 
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FIGURE 14. Comparison of the interpolated potential 
(curve) with ab initio (circles) energies for HCO at 
equilibrium as a function of d>, and for the two Ar-HCO 
geometries indicated. 

nates. Several tests of the interpolated surface were 
presented and demonstrated that the global sur¬ 
face is a good representation of the ab initio ener¬ 
gies. This potential is currently being used in 
quantum calculations of energy transfer, dissocia¬ 
tion, and recombination in Ar-HCO collisions [14]. 

In both examples above, we chose physically 
motivated representations of fits to the energetics, 
for HOCO to an existing 6 degree-of-freedom po¬ 


tential, and for Ar-HCO to ab initio calculations. 
These are in general better behaved than fits that 
have been used in previous Shepard-type interpo¬ 
lations; however, at the expense of not giving very 
precise fits to the original potential points. 
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ABSTRACT: Ionic materials can be conveniently described by quantum mechanical 
cluster models. In these models the electrostatic field of the crystal surrounding is often 
taken into account by embedding the cluster in an array of point charges (PCs). However, 
anions at the cluster boundary can be strongly polarized by neighboring positive PCs. 
These artifacts can be significantly reduced if a model potential description is employed 
for the nearest-neighbor cations of the surrounding. We have implemented a scheme of 
total ion model potentials (TIMP) in the density functional code LCGTO-DF. In the 
present work we compare a number of electronic properties of MgO(OOl) models, like 
core-level splittings, electron affinity, and electrostatic potential, computed with the help 
of TIMP embedding to results obtained for the same clusters surrounded by PCs only. 
The cluster models range from MgOf “ to Mg 25 0 25 . Adsorption of a CO molecule on 
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Introduction 

C hemical substances in the solid state can of¬ 
ten be efficiently described by quantum me¬ 
chanical cluster models. Systems with strongly 
ionic bonds, like alkali halides and alkaline-earth 
oxides, the properties of which are dominated by 
long-range Coulomb interaction, are also favorably 
studied by cluster models. A simple but widely 
used method to account for electrostatic effects of 
the crystal surrounding is to embed a model clus¬ 
ter in an array of point charges (PCs). This ap¬ 
proach also allows one to reproduce the electro¬ 
static potential outside the cluster in an accurate, 
yet economic fashion. This is of particular impor¬ 
tance for studies of adsorption complexes on 
strongly ionic substrates which are commonly 
formed mainly due to electrostatic interactions. 
However, a disadvantage of the point charge em¬ 
bedding is that PCs on the cluster boundaries can 
significantly distort the electron density of ions at 
the cluster edges. Therefore, the accuracy of the 
calculations depends on the distance between the 
"chemically interesting" central region of the clus¬ 
ter and its boundaries. In order to shift the bound¬ 
aries far enough one may be forced to employ 
extended, i.e., rather expensive cluster models. 
Clearly, anions with their diffuse and easily polar¬ 
izable electronic shells will be distorted much 
stronger by neighboring positive PCs at the cluster 
boundary than the more compact and less polariz¬ 
able cations by negative PC neighbors. Thus, eval¬ 
uation and subsequent rectification of spurious 
anion-PC effects on calculated observables repre¬ 
sent key problems in cluster modeling of ionic 
metal oxides. 

The outlined artifacts can be diminished by 
substituting model potential (or pseudopotential) 
cations for positive PCs closest to cluster anions. In 
this way, it is possible to restore the repulsive 
contribution of electron shells of the anions and 
cations which is missed by an embedding in a 
simple array of PCs. Less efficient from the com¬ 
putational viewpoint is the "brute force" alterna¬ 
tive whereby all positive PCs near the cluster 
borders are replaced by all-electron cations. 

The first applications of pseudopotentials in 
cluster models [1, 2] were devoted to the descrip¬ 
tion of impurities in alkali halide crystals where 
Hartree-Fock (HF) based model potentials by 


Huzinaga et al. [3] and complete-cation normcon- 
serving pseudopotentials [4] were used. A similar 
technique was found valuable to describe bulk 
properties of alkaline-earth oxides [5, 6]. Recently, 
the general procedure on how to perform calcula¬ 
tions on embedded clusters under the assumption 
of a "frozen" environment and the orthogonality 
requirements for the wave function of the cluster 
and the environment have been discussed in detail 
[7, 8]. 

CO molecules adsorbed on MgO have fre¬ 
quently been considered as a prototype system for 
representing metal oxide surface complexes [9]. 
The properties of CO/MgO cluster models embed¬ 
ded in an array of PCs have been studied exten¬ 
sively, at both the HF and post-HF levels [10, 11] 
as well as by density functional (DF) methods 
[12-14]. There is now general agreement that CO 
binds to the MgO surface mainly due to electro¬ 
static interaction. In the case of PC embedding, the 
calculated MgO-CO binding energy varies no¬ 
tably with the size of the model cluster [14]. Also, 
the electrostatic potential above in MgO surface in 
the region of the adsorbate has been found to 
depend strongly on shape and size of the model 
cluster; this effect was traced to the polarization of 
anions at the cluster borders [15]. Recently pub¬ 
lished cluster model investigations of MgO based 
on ab initio model potentials are free of this defi¬ 
ciency [8, 16, 17]. However, no such model poten¬ 
tial study has been reported so far within the 
framework of DF methods. 

The present work describes an implementation 
of the pseudopotential (PP) technique in the 
LCGTO-DF package (next section). This method 
has been applied to improve cluster models of 
MgO by using total ion model potentials (TIMP) to 
substitute positive PCs at the cluster boundaries. 
The models and calculational details are outlined 
in the third section. The present study aims at 
testing the TIMP technique within a DF implemen¬ 
tation rather than at carrying out a comprehensive 
investigation on a physical or chemical problem of 
an ionic material. Therefore, MgO has been chosen 
as a substrate which has been well studied theoret¬ 
ically and experimentally. The calculational results 
presented demonstrate how significantly features 
of the cluster models can change due to TIMP 
improved boundary conditions. One-electron 
properties of model clusters and the electrostatic 
potential near an ideal MgO(OOl) surface have 
been calculated with the help of TIMP and PC 
embedded models. Adsorption of CO molecules 
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on Mg 2+ cations on the regular MgCXOOl) surface 
and on three-coordinated defect sites has also been 
considered; Results of DF TIMP calculations will 
be compared to the corresponding ab initio data 
[17]. 

OUTLINE OF THE COMPUTATIONAL 
TECHNIQUE: TOTAL ION 
MODEL POTENTIALS 

TIMP embedding aims at overcoming the limi¬ 
tations of a simple PC embedding scheme for 
cluster models of ionic substrates by approxi¬ 
mately accounting for orthogonality effects of clus¬ 
ter electronic states with respect to states of ions in 
the environment. PC embedding accounts only for 
the electrostatic field of ions which, at large dis¬ 
tances, can be to good accuracy treated as the field 
of point charges. However, if one does not take the 
finite size of the cations into account, the electron 
density of the cluster will spread to neighboring 
positive point charges. The "frozen" electron den¬ 
sity of these cations is included in the TIMP 
description, its Pauli repulsion counteracts the at¬ 
tractive potential of the cation charge within the 
effective ionic radius. For the practical realization 
of TIMPs used here, the PP approach of replacing 
the core electrons in calculations involving heavier 
atoms is modified. All electrons of the model ions 
are included in the pseudopotential and no basis 
functions are placed on these centers. In the fol¬ 
lowing, a brief review of angular specified norm- 
conserving PPs [4, 18, 19] used in this work is 
given to introduce their application for construct¬ 
ing TIMPs. 

In the all-electron linear combination of Gauss- 
ian-type orbitals DF (LCGTO-DF) method [20] the 
nonrelativistic Kohn-Sham equation can be writ¬ 
ten as 

{-^V 2 + V ext + V c [ p ] + V xc [ p , Vp]}t/f, = erfi, 

( 1 ) 

where V ext is the nuclear potential, V c is the Hartree 
(classical Coulomb) potential of the electron den¬ 
sity p, and V xc is the exchange-correlation (XC) 
potential. The one-particle Kohn-Sham (KS) or¬ 
bitals 4/ t are expanded in a basis of Gaussian-type 
functions Xk> 

+i-I.c ktXk , (2) 

k 

and these basis functions are set up as sym¬ 


metrized contractions (linear combinations) of 
primitive functions 

xl = C l m (r,d,<p)e- a * r \ (3) 

where C l m ( r, 0, <p ) is a real solid harmonic (a har- 
monic polynomial). 

In a density functional PP calculation, the inter¬ 
action of electrons 

v„ = V c [ p] + l/ xc [ p] (4) 

is replaced by the interaction of valence electrons 
and a pseudopotential V , which accounts for the 
core electrons, 

v/ e = V c [ pj + VJ Pv ] + V p (r), (5) 

where p v denotes the valence electron density. The 
pseudopotential in a molecular calculation is set 
up as a superposition of individual atomic contri¬ 
butions V p A , 

' V p (t) = Y,V p A (r-R A ). (6) 

A 

According to the angular-momentum-specified 
normconserving PP approach [14, 18], an atomic 
pseudopotential can be written as 

L— 1 / 

V P A = V L r+ LlV, p - V L n E \lm){lm\. (7) 

1 = 0 m= — l 

Here L is chosen so that L — 1 equals the maxi¬ 
mum angular momentum of the atomic one-elec¬ 
tron orbitals which belong to the frozen core. \lm) 
denotes the spherical harmonic Y lm . In a DF for¬ 
malism the first term of Eq. (7), VI , is the same for 
all electrons with / > L; it originates from the 
Coulomb and exchange-correlation interaction of 
the valence electrons with the core shells. We note 
that this term is a local one in contrast to the 
angular momentum specified terms which contain 
the projection operators \ These latter terms 

account for the orthogonality of the valence or¬ 
bitals with l < L to the core orbitals and contain 
the proper PPs Vf , l < L. 

The PPs are determined by solving the atomic 
radial equations 

/ 1 d 2 1(1 + 1 ) Z' 

+ ~ 17 2 7 + v .[ Pul 

+ v„[ p, + p !\ +VyW(r) 

= erffir), (8) 
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where if/fir) is the radial part of a nodeless pseudo 
valence orbitals, constructed from the exact va¬ 
lence orbital following Ref. [18]. e l denotes the 
atomic one-electron eigenvalue of a valence orbital 
of angular momentum, /, and Z' is the nuclear 
charge decreased by the charge of the core elec¬ 
trons. Thus, Vj p is constructed to reproduce the 
exact valence one-electron energies for electrons in 
the field of a shielded nuclear charge, but interact¬ 
ing with the valence electron density. The frozen 
core electron density p/ is included in the XC 
potential to account for the nonlinear character of 
this quantity [18]. 

To apply the formalism sketched above to 
TIMPs, all electrons of a model center are included 
in the PP and L - 1 is chosen equal to the highest 
angular momentum of the occupied orbitals. To 
determine V/, pseudo orbitals of unoccupied lev¬ 
els are employed for each angular momentum 1 
which is present in the occupied orbital manifold. 

In the LCGTO-DF implementation, the poten¬ 
tials Vj p are generated in a numerical atomic DF 
calculation and are used in tabular form. The core 
density p c is determined as the difference of the 
total density from an all-electron atomic calcula¬ 
tion and the density due to the pseudo valence 
orbitals. To describe the Coulomb part of the local 
term VI, it is represented by a set of s-type Gauss¬ 
ian functions. The coefficients as well as the Gauss¬ 
ian exponents are determined by means of a non¬ 
linear least-squares fit according to the gradient 
extremum search method [21]. 

To evaluate matrix elements ( Xk> l^/ p l X\) °f the 
nonlocal terms of V p basis functions Xk [Eq. (3)] 
centered on atom B are projected onto spherical 
harmonics Y lm centered on the PP center A [22]. 
The projection of a simple Gaussian function is 
given by 

[c" a(r - b)2 ]/ w =47rY lm (n)e-° (r2+b2) i l (2arb), (9) 

where r = (r, fl); vector b denotes the position of 
center B in the coordinate system of a PP center 
A, and i l is a modified spherical Bessel function 
[23]. The real solid harmonic C l m can be easily 
represented as a linear combination of complex 
solid harmonics [24] 



f 477 
^ 21 + 1 



( 10 ) 


The projection onto complex solid harmonics cen¬ 
tered at A is given by [24] 

[ Qvr ( r — b>] 

_ ( -\l-i / 47r [L + M||L-M| 

1 ; V 21 + 1\ l + m j\ l - m ) 

Xr l C^Z' m (r), 1<L (11) 

where | ^ j is a binomial coefficient. The resulting 

products of spherical harmonics centered at the 
same atom [Eqs. (9) and (11) substituted in Eq. (3)] 
are coupled according to [24] 


r'% ini] (n)r% m 2 (n) 

= Y,r h+,1 ~ L r L Y LM (kl)G l ^' l * m > (12) 

LM 


with GiW' lini2 being a Gaunt coefficient. 

To determine forces in a calculation employing 
PPs or TIMPs displacement gradients of the matrix 
elements of V p with respect to atomic centers are 
required; they yield contributions which parallel 
those of the nuclear potential. The differentiation 
of the projected basis functions with respect to 
positions of all-electron centers is straightforward 
and analytical expressions have been derived. Par¬ 
tial derivatives with respect to the coordinates of 
pseudo centers can be obtained by exploring the 
translational invariance of the matrix elements. 

Although it does not constitute a special prob¬ 
lem to construct TIMPs for anions employing the 
DF representation of the exchange-correlation op¬ 
erator [6], in this work we use only TIMPs for 
Mg 2+ cations and represent the oxygen anions by 
PCs. There are two reasons for this decision: (i) the 
deficiency of conventional models with embed¬ 
ding of the all-electron (AE) cluster into a PC array 
has been attributed mainly to the distortion of 
anions at the cluster borders by positive PCs at¬ 
tracting electron density [16], while the effect of 
negative PCs on relatively compact AE cation 
neighbors is of subordinate importance; (ii) since 
free O 2- anions do not exist, additional approxi¬ 
mations, like application of a Watson sphere [6] or 
embedding into a bulk crystal [25], are required 
for generating such a TIMP. We have generated a 
PP for the neutral Mg atom (3s 2 3p° electron con¬ 
figuration). Thus, the frozen density of the Mg 2+ 
TIMP corresponds to the shells Is 2 , 2s 2 , and 2p 6 
of the neutral Mg atom. 
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Computational Details 

In the present study AE cluster models of regu¬ 
lar surface sits of MgCHOOl) range from MgO^ - to 
Mg 2 5 0 2 5 ; smaller models are all subsystems of the 
latter cluster. The Mg-O distance in all models 
was taken from the experimental MgO bulk geom¬ 
etry (2.104 A). To simulate the Madelung field of 
the crystal environment, the clusters were embed¬ 
ded in PC arrays of dimension 17 X 17 X 2 which 
allows us to consider both Mg and O adsorption 
sites within the same model. If the PC array is 
extended to a thicker slab (17 X 17 X 6), only neg¬ 
ligible alterations of calculated adsorption proper¬ 
ties are observed. For these PCs the charge value 
for an ideally ionic crystal, ±2.0 a.u., was as¬ 
sumed. A more strict approach would involve 
computing the lattice potential through an explicit 
Ewald summation over the infinite crystal [26-28]. 
Fortunately, the Madelung field can be very satis¬ 
factorily approximated by the field of a finite set of 
PCs [29]. In case of low-coordinated defect sites 
(e.g., three-coordinated corner sites of MgO) the 
PC representation of the crystal field is actually the 
only reasonable choice. 

In the new, more sophisticated model variant 
all cations that are either nearest or next-nearest 
neighbors of the outer AE cluster atoms (at the 
distance of 2.104 and 2.976 A, respectively) were 
described by TIMPs. Cations situated at larger 
distances were still modeled by PCs. Calculations 
of the Mg 9 0 9 cluster with AE Mg 2+ cations lo¬ 
cated in the positions previously occupied by 
TIMPs in the corresponding model (the cluster 
formula then is Mg 9 0 9 Mg^ + ) were also carried 
out for comparison. To model three-coordinated 
Mg 2+ sites of MgO, a cubic array 10 X 10 X 10 
point charges was used. The clusters Mg 4 0 4 and 
MgOg” at the corner of this cube were treated as 
AE parts which were surrounded by TIMPs ac¬ 
cording to the strategy just outlined for models of 
regular sites. 

Orbital basis sets and contraction coefficients, 
(15s 10 pid) -» [6s5pld] for Mg (the numbers of 
primitive Gaussians are given in parenthesis, the 
numbers of contracted basis functions in brackets) 
and (13sSpld) —> [6s5pld] for O, have been taken 
from our previous study of transition-metal atom 
adsorption on a Mg(100) surface [30]. For the atoms 
of the CO adsorbate molecule, (9s5p2d) -> 
[5s4p2d] basis sets have been employed [14]. The 


orbital bases were contracted using atomic eigen¬ 
vectors obtained in the local density approxima¬ 
tion (LDA). 

To optimize the position of the CO molecule 
over the MgO(OOl) surface, calculations for differ¬ 
ent adsorption heights were carried out, keeping 
the C-O distance fixed at the value optimized for 
the free molecule: The equilibrium height was then 
determined by fitting these energies to a polyno¬ 
mial. Then, to find the harmonic frequency of the 
C-O vibration and the equilibrium C-O bond 
length, the position of the CO center of mass was 
fixed and five points for different C-O distances 
were calculated (A q = 0.0; ±0.05; ±0.1 Bohr, 
where is the displacement from equilibrium 
bond length of free CO) and fitted by polynomials. 
To test the implementation of the analytical energy 
gradients for the pseudopotential technique, the 
geometry optimization and the frequency calcula¬ 
tions were repeated with analytical gradients. Both 
methods gave the same results within 0.001 A and 
1-2 cm -1 for geometrical parameters and har¬ 
monic frqeuencies, respectively. 

The auxiliary fitting basis sets used in the 
LCGTO-DF method to represent the electron 
charge density and the exchange-correlation po¬ 
tential during the self-consistency cycles in single¬ 
point calculations were constructed from the or¬ 
bital exponents in a standard fashion [20]. For the 
"post-SCF" (self-consistent field) evaluation of the 
total energy, as well as during the self-consistency 
cycles of geometry optimiation and frequency cal¬ 
culations employing analytical energy gradients, 
the exchange-correlation term was calculated us¬ 
ing an accurate numerical integration procedure 
[31]. 

The local density approximation to the ex¬ 
change-correlation functional as suggested by 
Vosko, Wilk, and Nusair (VWN) [32] was used to 
obtain one-electron energies of the cluster models 
presented below. To calculate electron affinities 
and ionization potentials, the total energy was 
determined evaluating the XC contribution with 
the help of the generalized gradient approximation 
(GGA) to the exchange energy as suggested by 
Becke [33] and to the correlation energy as sug¬ 
gested by Lee, Yang, and Parr (BLYP) [34], em¬ 
ploying the self-consistent LDA density. For the 
automatic geometry optimizations, the total en¬ 
ergy and the analytical energy gradients were cal¬ 
culated within the self-consistently employed 
GGA; two variants were applied, BLYP and the 
Becke-Perdew form (BP, Becke exchange [33], 
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Perdew correlation [35]). The standard counter¬ 
poise technique [36] was applied to correct binding 
energies for the basis set superposition error 
(BSSE). 


Results and Discussion 

TIMP VERSUS PC EMBEDDING: 

ONE-ELECTRON PROPERTIES OF 
CLUSTER MODELS 

As mentioned in the introduction, the oxygen 
anions in cluster models of MgO are strongly po¬ 
larized by neighboring PCs. In Figure 1 we display 
charge density plots for TIMP [Fig. 1(a)] and PC 
embedded [Fig. 1(b)] Mg 9 0 9 clusters along with 
the corresponding charge difference plot [Fig. 1(c)]. 
A density decrease in the TIMP model with re¬ 
spect to the PC case is marked by dashed lines. 
One can clearly see that all oxygen anions in the 
TIMP embedded model exhibit a regular spherical 
shape, while in the PC embedded model the an¬ 
ions at the cluster border are of ellipsoid-like shape 
where the charge density extends toward neigh¬ 
boring positive PCs. The charge difference plot 
[Fig. 1(c)] clearly identifies a nonzero electron den¬ 
sity around the nearest-neighbor positive point 
charges in the PC embedded model. This charge 
density is transferred from the oxygen anion due 
to an artificially strong attraction of the naked 
positive charge which is not balanced by repulsion 
from core electrons around the cation. 

Let us now consider how this distorted electron 
density affects one-electron properties of the mod¬ 
els. In the PC embedded models the one-electron 
levels lie lower than those in the models with 
TIMP embedding (Table I) because the attractive 
potential due to the surrounding positive PCs is 
overestimated. Both core and valence levels are 
affected. The band gap of 3.8-4.4 eV as measured 
by the highest occupied molecular orbital-lowest 
unoccupied molecular orbital (HOMO-LUMO) 
splitting calculated for the models with TIMP em¬ 
bedding is in reasonable agreement with the result 
(3.0 eV at the T point) of an Xa band structure 
calculation on a two-layer MgO slab [37]. How¬ 
ever, for well-understood reasons [38], the present 
LDA value is significantly smaller than the experi¬ 
mental value of 7.8 eV [39]. The energies of the 
LUMO (-1.2 or -1.5 eV) and HOMO (-5.4 to 
— 5.6 eV; Table I) are also quite close to the corre¬ 
sponding values at the T point of a two-layer slab 





[ 100 ] 

FIGURE 1 . Electron density contour maps for the 
cluster model Mg 9 0 9 in the plane containing the fourfold 
symmetry axis and the O — Mg — O unit: (a) TIMP 
embedding; (b) PC embedding; (c) density difference 
Ap = p(Mg 9 0 9 ,TIMP) - p(Mg 9 0 9 , PC). The contour 
values in panels (a) and (b) are 0.64, 0.16, 0.045, 0.01, 
and 0.0025, an additional contour line of 0.0008 a.u. is 
used in panel (c); dashed lines indicate negative values. 


band structure (LUMO -1.5 eV, HOMO -4.5 eV 
[37]). The shift of the cluster LUMO to low ener¬ 
gies in PC embedded models can be rationalized 
by the finding that this orbital is mainly localized 
on the most perturbed border atoms. The large 
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TABLE I_ 

Properties of PC, TIMP, and Mg 2+ all-electron embedded cluster models of the MgO(OOI) surface. 3 


Type of embedding 




PC 



TIMP 


Mg 2+ AE 

Property 

Mg 5 o 5 

M99O9 

M925O25 

Mg 5 0 5 

M 99 O 9 

Mg 25 o 25 

Mg 9 0 9 Mg? 6 2+ 

e(LUMO) 

-8.78 

-6.82 

-7.40 

-1.17 

-1.15 

-1.54 

-1.46 

e(HOMO) 

-9.02 

-9.69 

-9.51 

-5.55 

-5.43 

-5.37 

-5.68 

A e (HOMO-LUMO) 

0.24 

2.87 

2.11 

4.38 

4.28 

3.83 

4.22 

EA 

6.54 

5.05 

6.00 

-0.47 

-0.33 

0.18 

1.62 

IP 

11.33 

11.42 

10.73 

7.77 

6.93 

6.44 

7.31 

01s levels 

6 (central atom) 

-506.86 

-507.99 

-507.64 

-503.42 

-503.48 

-503.59 

-503.70 

e(edge atoms) 

-508.76 

-508.46 

-507.76 

-503.37 

-503.48 

-503.58 

-503.72 

e (corner atoms) 

— 

-508.75 

-507.84 

— 

-503.44 

-503.56 

-53.72 

Ae 

1.90 

0.76 

0.20 

0.05 

0.04 

0.03 

0.02 

Mg Is levels 

e (central atom) 

-1252.12 

-1253.12 

-1252.35 

-1247.96 

-1248.11 

-1248.18 

-1248.34 

e(edge atoms) 

-1250.04 

-1252.06 

-1252.54 

-1247.92 

-1248.04 

-1248.16 

-1248.27 

e (corner atoms) 

— 

-1250.89 

1252.71 

— 

-1248.02 

-1248.16 

-1248.22 

Ae 

2.08 

2.23 

0.36 

0.04 

0.09 

0.02 

0.12 


a HOMO and LUMO one-electron energies (e), electron affinities (EA), ionization potentials (IP), Is core level energies (e), and their 
spread Ae (for the largest cluster restricted to the subsystem Mg g O g ). All energies in eV. 


splitting of 1 s core levels in PC embedded models, 
especially in the smaller clusters Mg 5 O s (about 1.9 
and 2.1 eV; Table I) and Mg 9 0 9 (about 0.8 and 2.2 
eV), clearly demonstrates the inequivalence of 
atoms of the same kind located at different cluster 
positions. TIMP embedding reduces this level 
splitting to less than 0.1 eV (Table I). Ideally, ions 
of one sort belonging to the same layer of a regular 
MgO(OOl) surface should have identical core level 
energies. 

As a consequence of the low-lying LUMO, clus¬ 
ter models with simple PC embedding manifest an 
artificially enhanced electron affinity (EA), e.g., 
5.05 eV for Mg 9 0 9 . Here, EA is determined as the 
difference of calculated total energies of the neu¬ 
tral embedded cluster and its anion, EA = E (neutral) 
- E(anion.)• The EA values for two additional elec¬ 
trons calculated at the BLYP level for the PC 
embedded clusters Mg 5 O s and Mg 9 0 9 are 8.9 and 
7.1 eV, respectively. This implies that the border 
oxygen anions exhibit a marked oxidative power. 
We note in passing that no significant electron 
charge transfer has been found in studies on small 
adsorbates (e.g., on transition metal atoms [30]) 
deposited above the central O 2- ion of PC embed¬ 
ded clusters. Nevertheless, using PC embedded 
models for investigations of larger adsorbed 
species, like metal clusters, can lead to erroneous 
results if no special care is taken to keep the 


cluster borders far enough from the adsorption 
site. The EA overestimation is rectified in TIMP 
embedded cluster models (Table I). 

The calculated ionization potentials (IPs) of PC 
embedded models of about 11 eV are higher by 2 
eV than the experimental value of 9.1 eV [39], 
while the IPs in TIMP models of about 7 eV are 
lower than experiment by the same amount (Table 
I). It is unclear at present to which extent the 
inaccuracy of the calculated IPs is caused by the 
DF method itself or by limitations of the cluster 
models employed. Note that increasing the cluster 
" thickness" by additional PC layers does not lead 
to any significant changes. 

We conclude this comparison of PC and TIMP 
embedded cluster models by noting that the TIMP 
embedding procedure very satisfactorily repro¬ 
duces all cluster properties discussed here as 
judged by the results for the AE correspond¬ 
ing cluster models (cf. TIMP Mg 9 0 9 and AE 
Mg 9 0 9 Mgi6 + ; Table I). 

ELECTROSTATIC POTENTIAL ABOVE 

THE SURFACE 

It is generally accepted that the electrostatic 
potential plays a key role in interactions of adsor¬ 
bates with ionic substrates. Two recent theoretical 
investigations are devoted to the study of the 
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electrostatic potential above the MgCXOOl) surface 
in different cluster models. Ferrari and Pacchioni 
[15] carried out HF calculations on PC embedded 
models of five-coordinated Mg 2+ and O 2- sites at 
a regular surface and on three-coordinated O 2- 
corner sites. They found that in PC embedded 
models the electrostatic potential above regular 
sites depends significantly on the cluster size and 
is rather different from that obtained in periodic 
slab HF calculations. In particular, in PC embed¬ 
ded models the electrostatic potential above a 
Mg 2+ site is much stronger and decays slower 
with the increasing distance from the surface than 
in the periodic model. At typical adsorbates dis¬ 
tances above O 2 sites the potential is even of the 
wrong sign. Nygren et al. [8] studied models of 
regular surface Mg 2+ and O 2 ” sites with the help 
of both PC and TIMP embedding approaches us¬ 
ing the HF method. They found that values of the 
electrostatic potential obtained in TIMP models of 
various size are rather similar and above Mg 2 + 
sites the electrostatic potential is close to that one 
generated by a single Mg 2+ cation surrounded by 
an infinite PC array. 

Here we consider the electrostatic potential only 
above Mg 2+ sites of regular (001) surface of MgO. 
Figure 2 shows the dependence of the electrostatic 
potential felt by a positive unit charge at variable 


height above the central Mg 2+ cation of different 
cluster models. The point charge array of the size 
used in the cluster embedding, with all ions repre¬ 
sented by PCs of charge ± 2.0 a.u., has been taken 
as a reference. The resulting electrostatic potential 
(Fig. 2, solid line) is quite similar to that obtained 
from HF calculations on a slab model [15] or a 
Mg 2+ cation surrounded by an infinite array of 
PCs [8]. Since magnesium oxide is a highly ionic 
system with a small covalent contribution and 
nearly complete charge separation, a good quan¬ 
tum chemical model should mimic to a significant 
extent the electrostatics of the ideal PC representa¬ 
tion. 

The PC embedded models Mg 5 0 5 and Mg 9 0 9 
(diamonds and squares, respectively) yield an 
electrostatic potential much larger than that of a 
PC array. Also the electrostatic potential of these 
two cluster models extends to significantly larger 
distances, featuring a rather slow decay which is 
quite different from that of the reference system. 

At variance with the results of PC embedded 
models, the TIMP strategy yields electrostatic po¬ 
tential curves that are very similar to each other 
and to the reference curve of the ideal PC system, 
but are somewhat lower in value than the latter. 
At distances typical for the location of adsorbed 
molecules (CO in Fig. 2) the TIMP curves are even 



FIGURE 2. Electrostatic potential felt by a positive charge at distance r above the central Mg z+ cation in different 
cluster models of MgO(001). Typical values for the calculated positions of the nuclei of adsorbed CO are indicated by 
arrows. The results for different models are marked as follows: 17 x 17 x 2 PC array (solid line). MgOf ~ TIMP (a), 
Mg 9 0 9 TIMP (O), Mg 9 0 9 Mg?f + (x), Mg s O s PC (O), Mg 9 0 9 PC (□). 


982 


VOL. 65, NO. 5 







BOUNDARY CONDITIONS FOR MgO CLUSTERS 


below zero. This implies that the corresponding 
model potential for electrons is slightly more re¬ 
pulsive, although the absolute values of the elec¬ 
trostatic potential are very small. The electrostatic 
potential curves for Mg 5 O s and Mg 9 0 9 TIMP em¬ 
bedded models agree to graphical accuracy; there¬ 
fore, only the curve for the latter model is shown 
in Figure 2 (circles). The lower electrostatic poten¬ 
tial compared to the reference system can be at¬ 
tributed to a stronger screening of the central Mg 2+ 
cation by the electron density of very diffuse shells 
of neighboring O 2- anions. The electrostatic po¬ 
tential produced by the model Mg 9 0 9 Mg^ + 
(crosses) where all 16 TIMPs are substituted by AE 
Mg 2+ cations is slightly higher than the reference 
potential due to a small charge transfer from the 
Mg 9 0 9 subsystem into valence orbitals of the 16 
cations. Not unexpectedly, all electrostatic poten¬ 
tials calculated at the DF level are in good agree¬ 
ment with the electrostatic potentials obtained for 
similar models with the help of the F1F method [8, 
15]. 

CO ADSORPTION ON REGULAR 

SURFACE SITES 

The interaction of an MgO substrate with an 
adsorbate like CO provides a good test of the 
TIMP DF models. The adsorption of CO on the 
regular MgO(100) surface has been frequently 
studied theoretically (see, e.g., [9] and references 
therein). Most of these cluster studies employed 
the PC embedding approach. The TIMP technique 
was applied only recently in modified coupled 
pair functional (MCPF) calculations where the 
electron correlation was treated with high accuracy 
[16, 17]. 

One of the most important characteristics of the 
MgO-CO interaction derived from quantum 
chemical calculations is the adsorption energy. In a 
previous DF study of CO adsorption on MgO(001) 
[14] based on the BLYP functional we calculated 
binding energy values of 0.20, 0.21, 0.11, and 0.09 
eV for the PC embedded clusters Mg 5 O s , Mg 9 0 9 , 
Mg 2 i0 2 i, and Mg 25 0 25 , respectively. (Note that 
PCs of ±1.8 a.u. were used.) Based on the analysis 
of the electrostatic potential in the previous sec¬ 
tion, one expects a weaker adsorption interaction 
in TIMP models. Recently, Nygren and Pettersson 
computed the energy of CO interaction with a 
TIMP embedded cluster Mg 5 0 5 to 0.08 eV at the 
MCPF level [17]. In our present DF TIMP calcula- 



FIGURE 3. Sketch of the cluster model Mg 9 0 9 -C0. 

tions the interaction of CO with the MgO(001) 
surface (Fig. 3) turned out to be extremely weak. 
Although we were able to locate a potential energy 
minimum at the BLYP level, the binding energy 
was found to be negative after the BSSE correction 
was taken into account (Table II). This finding is in 
line with results of previous DF studies where the 
BLYP functional was found to underestimate lig¬ 
and-metal binding energies [31]. The BP ex¬ 
change-correlation functional yields slightly short¬ 
er equilibrium values for the Mg-C distance and 

TABLE II_ 

Observables for CO adsorption on Mg cations of a 
regular MgO(OOI) surface calculated for clusters 
embedded using the TIMP technique or AE 
Mg + 2 cations. 3 



h e (Mg 

-C) 

D e 


Cluster model 

BLYP 

BP 

BLYP 

BP 

TIMP embedding 
MgO® - 

2.57 

2.46 

-0.016 

0.021 

Mg 5 0 5 

2.53 

2.49 

-0.011 

0.044 

Mg 9 0 9 

Mg 2+ AE embedding 

2.56 

2.49 

-0.003 

0.054 

Mg 9 0 9 Mgf + 

2.56 

2.49 b 

0.010 

0.066 


a Surface~CO distance h e ( Mg-C) (in A) and binding energy 
D e (in eV), corrected for the BSSE, both for the BLYP and 
the BP exchange-correlation energy functionals. 
b Height as optimized for the TIMP embedded cluster model 
Mg 9 0 9 at BP level. 
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binding energy values of 0.02-0.05 eV for the vari¬ 
ous cluster models. These larger values are compa¬ 
rable to the result of the MCPF study, but are still 
very small, about 1 kcal/mol. Apparently, an ac¬ 
curate description of the CO-MgO interaction at 
cationic sites of an ideal MgO(OOl) surface requires 
the proper evaluation of weak interaction contri¬ 
butions, such as dipole-dipole and van der Waals 
interactions. 

This notable weakening of the interaction of CO 
with a MgO substrate can be attributed to two 
features of the TIMP models. First, the electrostatic 
potential above Mg 2+ at a distance of about 3 A 
where CO molecule is situated is very small; it 
almost vanishes near the oxygen nucleus (see the 
previous section). As a consequence, the attraction 
of the CO molecule to the surface due to the 
adsorbate polarization is less than in PC embed¬ 
ded models. Second, in the TIMP models the O 2 " 
substrate anions are more extended in the direc¬ 
tion of the adsorbate. This leads to a stronger 
repulsion between the CO molecule and the closed 
shells of the substrate anions. Thus, electrostatic 
attraction and Pauli repulsion cancel each other, 
resulting in an almost vanishing adsorption en¬ 
ergy. Note that the change of the electrostatic po¬ 
tential is also a consequence of the modified shape 
of the oxygen anions which in the TIMP models 
screen the Mg 2+ cations much better. 

It is worth mentioning that the most recent 
experimental estimations of the adsorption energy 
of CO on MgO(OOl) are above 0.4 eV [40, 41] while 
the most accurate quantum chemical investiga¬ 
tions [8] including the present study furnish val¬ 
ues lower than 0.1 eV. This clear disagreement has 
been not resolved although a plausible explanation 
has been given [8]. Our results allows us to com¬ 
ment on this discrepancy in the next section. 

CO ADSORPTION ON 

THREE-COORDINATED SITES 

Finally, we apply the TIMP technique to surface 
defects where the interaction of a CO molecule is 
expected to be stronger. Advantages and disad¬ 
vantages of the various types of embedding mod¬ 
els will be more evident. Also, low-coordinated 
defect sites of MgO are of primary interest for 
chemical applications such as catalysis. We con¬ 
sider here only models of the three-coordinated 
Mg site as it is the most characteristic one. Evi¬ 
dently, regular surface sites and three-coordinated 
corner sites are two limiting cases, and the results 


for the four-coordinated cationic site are expected 
to fall between these two limits. 

The results for CO adsorption at a three-coordi¬ 
nated corner Mg cationic site are given in Table III. 
On the unrelaxed site (where the substrate struc¬ 
ture is kept frozen at bulk termination), the bind¬ 
ing energy for CO adsorption is calculated to 
0.47-0.48 eV for the BLYP exchange-correlation 
functional. The fact that the minimal cluster model 
MgOJl" and the stoichiometric cluster Mg 4 0 4 yield 
almost the same results is indicative for the quality 
of TIMP embedding. The BLYP binding energy 
value for the unrelaxed corner site is essentially 
identical to that computed by Nygren and Petters- 
son with the MCPF method [17]. It is interesting 
that the CO vibrational frequency shift of 26-27 
cm -1 is significantly smaller than the value of 56 
cm" 1 obtained by Nygren and Pettersson [17]. 

For the cluster model MgO*” we also consid¬ 
ered the relaxation of the top magnesium atom. It 
moves down toward the O anions by 0.32 A, 
when the positions of all the other atoms are kept 
fixed. This is in line with the HF result, 0.26 A, 
obtained with PC embedding [42]. At a relaxed 
corner site, the energy of CO adsorption is calcu¬ 
lated to 0.37 eV, about 0.1 eV smaller than at the 
unrelaxed site. Despite this weaker interaction en¬ 
ergy, a larger C-O frequency shift of 39-41 cm" 1 
is obtained. For this model, both BLYP and BP 
energy functionals yield very similar results (Table 
III). 

TABLE III_ 

Observables for CO adsorption on 
three-coordinated Mg cations of MgO calculated 
for clusters embedded using the TIMP techniques. 3 


Cluster model 

h e (Mg-C) 

r e (C-O) 

D e 

0) e 

Aw e 

Un relaxed 
MgOg - , BLYP 

2.31 

1.1290 

0.469 

2168 

26 

Mg 4 0 4 , BLYP 

2.31 

1.1288 

0.482 

2169 

27 

Relaxed 
MgOg", BLYP 

2.36 

1.1287 

0.366 

2183 

41 

MgO|“, BP 

2.36 b 

1.1284 

0.376 

2192 

39 

CO molecule 
BLYP 

. 

1.1345 

_ 

2142 

_ 

BP 

— 

1.1340 

— 

2153 

— 


a Mg-CO distance h e (Mg-C) and C-0 distance r e (C-O) 
(in A), binding energy D e (in eV, corrected for BSSE), 
harmonic frequency o) e of the C-O vibration and the corre¬ 
sponding adsorption-induced frequency shift Aw e (in 
cm “ 1 ). 

b Height as optimized at BLYP level. 
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Our data for the relaxed three-coordinated site 
are very similar to experimental data for CO ad¬ 
sorption on MgO films grown over a Mo(100) 
surface [41]. In this latter work, an infrared (IR) 
absorption peak at 2178 cm -1 (35 cm” 1 blue- 
shifted with respect to gas-phase CO) is observed 
under ultrahigh vacuum conditions at a tempera¬ 
ture of 90 K. The adsorption energy has been 
determined from isothermal adsorption and tem¬ 
perature-programmed desorption to 0.43 and 0.46 
eV, respectively [41]. It has been argued [41] that 
the supported MgO film used in the experimental 
study exhibits a low concentration of surface de¬ 
fects, but there is no direct information from ex¬ 
periment as to which type of sites are being probed 
by CO adsorbates. The present DF results support 
the proposition [17] that only CO adsorption at 
low-coordinated defect sites of MgO(001), but not 
at regular five-coordinated Mg sites, yields an ad¬ 
sorption energy exceeding 0.4 eV as measured by 
He et al. [41]. 


Summary 

We have presented a DF implementation of the 
TIMP embedding technique which opens opportu¬ 
nities for cluster model studies of ionic materials 
with improved boundary conditions. Comparative 
calculations of PC and TIMP embedded cluster 
models showed that the TIMP approach success¬ 
fully repairs the main deficiency of PC embedded 
models, namely the artificial polarization of oxy¬ 
gen anions by positive PCs at the cluster border. 
TIMP cluster models yield substantially improved 
one-electron energies. The splitting of core levels 
of Mg and O atoms in various cluster positions is 
reduced to less than 0.1 eV. HOMO and LUMO 
energies come substantially closer to the corre¬ 
sponding values calculated within periodic DF slab 
models; the HOMO-LUMO gap increases by 1-2 
eV. As a consequence of the low-lying LUMO, 
cluster models with simple PC embedding feature 
an exaggerated electron affinity; this shortcoming 
is also rectified in TIMP embedded cluster models. 
For TIMP cluster models, the electrostatic potential 
above the central Mg 2+ cations in the adsorbate 
region is much weaker than in models with simple 
PC embedding, resulting in significantly reduced 
adsorbate-substrate interactions. In these novel 
TIMP models the adsorption energy of CO on a 
regular MgO(OOl) surface is calculated to be only 


about 0.05 eV at BP level. For the CO adsorption at 
the partially relaxed three-coordinate corner site 
the calculated binding energy is 0.38 eV and the 
CO vibrational frequency shift is 39 cm” 1 . 
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ABSTRACT: Transition-metal (TM)-main-group multiply bonded complexes have 
been heavily studied, due to their usefulness in a variety of applications, e.g., catalysis. 
The large majority of multiply bonded complexes are early TM systems, but late metal 
complexes are becoming more common and have been shown to be potent catalysts. An 
investigation of ligand and substituent effects for representative late TM multiply 
bonded complexes with heavy (Pt-phosphinidenes) and light (Ru-alkylidenes) elements 
is presented. Fundamental differences in the response of the metal-ligand multiple bond 
are observed for light vs. heavy main-group elements for these late TM systems. The 
results are, in many respects, diametrically opposed to those seen for their early TM 
congeners. © 1997 John Wiley & Sons, Inc. Int J Quant Chem 65: 987-996, 1997 


Introduction 

T ransition-metal (TM)-main-group (MG) mul¬ 
tiply bonded complexes have been heavily 
studied since the discovery of metal carbenes in 
1964 [1], Multiply bonded complexes participate in 
a wide variety of important reactions, such as 
methane activation by imidos (L n M = NR) [2] and 
alkane oxidation by oxos (L„M = O) [3]. The ma¬ 
jority of TM = MG complexes involve MG ele¬ 
ments from the first row and TMs from the early 
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to middle portions of the transition series. Analogs 
with heavier MG elements are becoming more 
common [4]. Likewise, multiply bonded com¬ 
plexes of later TMs (TM late = MG) are also being 
more heavily studied, due in large part to their 
intermediacy in different catalytic processes [5]. 

The seeming instability of TM late = MG com¬ 
plexes has been explained to arise from the repul¬ 
sion between the rf-orbital electron density on the 
late TM and MG orbitals that could be used in 
7r-bonding [6]. As described by Mayer for oxos, 
this leads to the occupation of orbitals with 7r-anti- 
bonding character and, hence, weakening and 
polarization of the metal-element 7r-bond [6]. In 
valence bond terms, such a situation suggests a 
balance between resonance structures such as A 
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and B. The qualitative model proposed by Mayer 
should apply: 

L„M + —E“ —R L„M=E—R 

/ 

A B 

whether the MG is light or heavy, but the anti¬ 
bonding interaction may exacerbate the already 
weaker 77-bonding power observed in heavier MG 
elements. 

In the majority of cases, the multiple bond is the 
"active site" which defines the reactivity of a 
TM=MG complex. The reactivity of TM=MG 
complexes of early TMs, particularly for light MG 
elements, is often limited by the thermodynamic 
and kinetic stability of the multiple bond. While 
this is advantageous in terms of experimental iso¬ 
lation of such species, it is an obvious limitation in 
terms of utilizing TM=MG complexes for further 
reactions. Furthermore, our computations on mul¬ 
tiply bonded complexes of early TMs suggest that 
the strength of the multiple bond engenders a 
degree of insensitivity to changes in the ligands 
and substituents, making it difficult to substan¬ 
tially tailor their chemistry through synthetic mod¬ 
ification [7]. Conversely, given the instability and 
metastability of multiple bonds involving later 
TMs, it may be possible to harness them for syn¬ 
thetic and catalytic benefit if one could assess the 
sensitivity of TM late = MG complexes to their 
chemical environment. As it is often difficult to 
obtain direct, detailed experimental information 
on TM late = MG complexes, they are ideal candi¬ 
dates for computational research. 

In the present research, we chose two 
TM late =MG systems for study: One involves a 
lighter MG element (Ru-alkylidenes) and the other 
a heavier MG element (Pt-phosphinidenes). Our 
goal is to more fully investigate the bonding in 
TM late = MG complexes using quantum calcula¬ 
tions. An investigation of the electronic and molec¬ 
ular structure of platinum-phosphinidene and 
ruthenium-alkylidenes complexes is presented. 
Platinum complexes studied are of the form 
(PX 3 ) 2 Pt = PR, where (PX 3 ) 2 is two monodentate 
or a single bidentate phosphine and R is an aryl or 
alkyl group. These systems are models of those 
recently proposed by Glueck et al. as intermediates 
in metal-catalyzed olefin hydrophosphination [5a]. 
The d 8 (X 3 P) 2 Pt° moiety is isolobal to methylene 
and other isovalent groups. This suggests some 
multiple bonding for the PtP bond. On the other 


hand, the d s configuration of (X 3 P) 2 Pt° will result 
in occupation of orbitals which are PtP 7r-antibond- 
ing. It was a goal of this research to determine if 
electronic effects can stabilize a Pt-phosphinidene. 
All known phosphinidenes contain very bulky lig¬ 
ands to prevent oligomerization [8]. This research 
used small models that allow electronic properties 
to be studied, but kept the models computation¬ 
ally feasible. 

In contrast to the instability of platinum phos¬ 
phinidenes, ruthenium alkylidenes have been syn¬ 
thesized, isolated, and structurally characterized. 
Alkylidenes of Ru have shown high activity in 
ring-opening metathesis polymerization (ROMP), 
ring-closing metathesis (RCM), and acyclic diene 
metathesis polymerization (ADMET) [9]. Grubbs 
et al. showed that Ru complexes are very tolerant 
of functional substrates and catalyze the ROMP of 
both high- and low-strain cyclic olefins [9a,b]. 
Ruthenium systems have also been studied as wa¬ 
ter-soluble catalysts, being stable in protic media 
[5, 10]. Grubbs et al. increased the metathesis ac¬ 
tivity of Ru catalysts by replacing the triphenyl- 
phosphine ligands with better cr-donating 
alkylphosphines [9b], so, clearly, the electronics of 
the metal are an important parameter to be fine- 
tuned. It has been suggested that the active species 
in the catalytic process have a ruthenium- 
carbon multiple bond [9a]. An investigation of 
ligand and substituent effects on the Ru = C bond 
is presented. Square pyramidal complexes of the 
form Ru(PX 3 ) 2 (Y) 2 ( = CZ 2 ) have been modeled 
(where X,Y,Z = H, Cl, F, or Me). As with plat¬ 
inum phosphinidenes, the substituent size has been 
kept small to keep the computational time reason¬ 
able. 


Computational Methods 

Calculations employed the sequential and paral¬ 
lel versions of the GAMESS quantum chemistry 
program [11]. The effective core potentials (ECPs) 
and valence basis sets of Stevens et al. [12] were 
used for all heavy atoms and the — 31G basis set 
for H. In the Stevens scheme, all ECPs for elements 
heavier than neon are derived from Dirac- 
Hartree-Fock calculations and thus implicitly in¬ 
clude Darwin and mass-velocity relativistic ef¬ 
fects; spin-orbit coupling is averaged out, using a 
/-weighted averaging scheme, in potential genera¬ 
tion. ECPs replace the innermost core orbitals for 
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platinum and all core orbitals for MG elements. 
Thus, the ns, np, nd, (n + l)s, and (n + l)p shells 
are treated explicitly for ruthenium and platinum; 
for the MG elements, ns and np are treated explic¬ 
itly. The Pt and Ru valence basis sets are qua¬ 
druple and triple zeta for the sp and d shells, 
respectively, while MG elements have a double- 
zeta-plus-polarization valence basis. 

Geometries were optimized using restricted 
Hartree-Fock (RHF) and wave functions for 
closed-shell singlets. Bond lengths and angles for 
TM complexes are typically predicted to within 
1-3% of experiment using this computational 
scheme, termed RHF/SBK(d), involving com¬ 
plexes in a variety of geometries, oxidation states, 
and metals from the entire transition series [13]. 


Results 

PLATINUM PHOSPHINIDENES 

The geometry of the Pt-phosphinidenes is 
shown in 1. All complexes were optimized under 
C s symmetry. Substituents are listed in Tables I-IV 
along with corresponding calculated data. The par¬ 
ent complex, where PX 3 = PH 3 and R = H, is a C s 
symmetry minimum with Pt=P = 2.196 A and 
Pt=P—H = 102.7°. When Pt=P—H is fixed 
linear (Pt = P—H = 180°), Pt=P decreases by 
0.03-2.166 A, supporting the idea that forcing 
phosphorus lone-pair donation to the metal will 
shorten the bond and strengthen the Pt=P bond. 


The linear structure is not a minimum, though. 
Experimental models are rare; Pt—P bonds in the 
phosphido dimers discussed by Glueck et al. [5a] 
are roughly 2.335 A (trans to Cl). Calculated Pt=P 

o 

bond lengths are several percent shorter ( ~ 2.2 A), 
consistent with some multiple bonding character* 
(Tables I-IV): 

x 3 p^ 

p t = p^-R 

X 3 P 

1 

Alkyl and Silyl Phosphinidenes 

The aliphatic and silyl derivatives modeled are 
listed in Table I. Upon changing the phosphorus 
substituent from H to Me, Et, *Pr, and *Bu, there is 
very little change in bond length (0.002 A), al¬ 
though the Pt = P—R angle increases (102.7°, 
111.8°, 112.2°, 113.8°, and 114.6°, respectively), due 
to the larger phosphinidene substituent. The geo¬ 
metries of the two *Pr conformations studied are 
shown in 2a and 2b; there is no difference in Pt=P 
bond lengths between 2a and 2b, and the 
Pt=P— r angles differ by only 1.4°. The con- 
former with methyl groups pointing down, 2b, is 
lower in energy by ~ 1 kcal mol” 1 than that of 2a 
at the RHF/SBKW) level of theory. Replacing the 
phosphinidene substituent in the parent complex 
with SiH 3 and CH 2 SiH 3 has little effect on the 

^Throughout the article, the Pt—phosphinidene bond will 
be indicated as Pt=P to distinguish it from the Pt-phosphine 
bonds. 


TABLE 1_ 

Alkyl and silyl phosphinidene derivatives. 3 


X 

R 

Pt=P (A) 

Pt—P— R (°) 

Q_ 

II 

Q_ 

o 

CD 

BO(Pt—R) 

Q (Pt) 

g(P) b 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

H (linear) 

2.166 

180.0 

1.759 

0.937 

-0.442 

-0.327 

H 

Me 

2.198 

111.8 

1.781 

0.924 

-0.680 

0.129 

H 

Et 

2.197 

112.2 

1.811 

0.923 

-0.703 

0.153 

H 

'Pr (2a) 

2.197 

113.8 

1.778 

0.869 

-0.718 

0.157 

H 

'Pr (2b) 

2.197 

112.4 

1.792 

0.875 

-0.710 

0.148 

H 

f Bu 

2.197 

114.6 

1.822 

0.847 

-0.745 

0.184 

Me H 

2.200 

102.9 

1.828 

0.891 

-0.787 

-0.040 

dpm H 

2.192 

102.4 

1.734 

0.899 

-0.443 

- 0.053 

F 

H 

2.215 

101.3 

1.614 

0.903 

-0.665 

0.194 

H 

CH 2 SiH 3 

2.196 

111.9 

1.854 

0.850 

-0.680 

0.139 

H 

SiHg 

2.196 

101.9 

1.817 

0.976 

-0.715 

-0.003 


a See structure 1 for definitions of X and R. The bond order (BO) is calculated as 

determined from a Mulliken population analysis [15]. 

b The atomic charge on the phosphinidene phosphorus is q( P). 

described in 

[14]; atomic charges (q) are 
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TABLE II 


Halide phosphinidene derivatives. 3 


X 

R 

Pt=P(A) 

Pt—P—R (°) 

BO(Pt=P) 

BO(P—R) 

g(Pt) 

g(P) b 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

F 

H 

2.215 

101.3 

1.614 

0.903 

-0.665 

0.194 

H 

Me 

2.198 

111.8 

1.781 

0.924 

-0.680 

0.129 

H 

CH 2 F 

2.194 

109.2 

1.769 

0.939 

-0.751 

0.217 

H 

chf 2 

2.189 

107.8 

1.826 

0.904 

-0.618 

0.100 

H 

cf 3 

2.187 

106.6 

1.820 

0.875 

-0.645 

0.187 

H 

ch 2 ci 

2.193 

108.4 

1.781 

0.891 

-0.710 

0.233 

H 

CH 2 Br 

2.193 

107.9 

1.790 

0.892 

-0.703 

0.227 

H 

CH 2 I 

2.193 

107.7 

1.793 

0.870 

-0.700 

0.238 


a See footnote a to 

Table 1. 







b See footnote b to 

Table 1. 







TABLE III 


Aromatic phosphinidene derivatives. 3 

X 

R 

°2 

CL 

II 

CL 

Pt—P—R (°) 

Q_ 

II 

CL 

O 

CD 

BO(P—R) 

g(Pt) 

g(P) b 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

C 6 H 5 

2.204 

115.1 

1.730 

1.066 

-0.697 

-0.012 

H 

C 6 H 5 (3) 

2.194 

108.3 

1.750 

0.823 

-0.647 

0.055 

H 

4-C 6 H 4 F 

2.193 

108.3 

1.759 

0.810 

-0.649 

0.059 

H 

2 ,6-C 6 H 3 F 2 

2.188 

107.3 

1.726 

0.718 

-0.619 

0.183 

H 

3,5-C 6 H 3 F 2 

2.191 

108.1 

1.756 

0.808 

-0.640 

0.065 

H 

2,4,6-C 6 H 2 F 3 

2.188 

107.4 

1.735 

0.705 

-0.619 

0.181 

H 

c 6 f 5 

2.186 

105.9 

1.737 

0.696 

-0.619 

0.183 

H 

Pyridine 

2.191 

107.9 

1.750 

0.819 

— 0.644 

0.082 

a See footnote a to Table 1. 
b See footnote b to Table 1. 


TABLE IV 


Cyano phosphinidene derivatives. 

X 

R 

Pt=P (A) 

Pt—P—R (°) 

BO(Pt=P) 

BO(P—R) 

g(Pt) 

g(P) b 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

CH 2 —CN 

2.191 

109.2 

1.821 

0.873 

-0.687 

0.209 

H 

C(CN) 3 

2.178 

104.5 

1.871 

0.717 

-0.633 

0.299 

H 

c(ch 3 ) 2 cn 

2.191 

111.9 

1.829 

0.842 

- 0.728 

0.241 

dpm 

CH 2 —CN 

2.184 

109.3 

1.715 

0.879 

-0.456 

0.088 

a See footnote a to Table 1. 







b See footnote b to Table 1. 
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Pt = P bond length: 



When the phosphine ligands are changed from 
PH 3 to PMe 3 , the Pt = P bond length increases 
slightly from 2.196 to 2.200 A. The Pt = P—H 
angles are near identical: 102.7° in (PH 3 ) 2 Pt = PH 
and 102.9° in (PMe 3 ) 2 Pt = PH. There is an even 
larger increase (0.02 A) in the Pt = P bond length 
when the parent phosphines are replaced by the 
very good 7r-acceptors PF 3 . Finally, replacement of 
the two phosphine ligands with a single bidentate 
diphosphine (dpm, H 2 PCH 2 PH 2/ a model of 
dppm) shortens the Pt = P bond length from 2.196 
to 2.192 A. 

Haloalkyl Phosphinidenes 

The haloalkyl-substituted phosphinidenes mod¬ 
eled are listed in Table II. The Pt=P bond length 
remained constant at 2.193 A for the halomethyl- 
substituted derivatives (H 3 P) 2 Pt=P—CH 2 X (X 
= Cl, Br,I). The bond angles changed a few de¬ 
grees over the fluoromethyl series (106.6° [in 
(PH 3 ) 2 P t = PC F 3 ] -109.2° [in 

(PH 3 ) 2 Pt = PCH 2 F]). Upon replacing R = Me 
with R = CH 2 F, the bond length reduces from 
2.198 to 2.194 A and further decreases to 2.189 A 
when R = CHF 2 and to 2.187 A when R = CF 3 . 
This suggests that the Pt=P bond can be affected 
through the a framework, assuming the primary 
influence of the fluorine is by way of inductive 
(electronegativity) effects. 

Aryl Phosphinidenes 

Table III lists the aromatic substituents studied. 
Conformer 3 was assumed for all complexes stud¬ 
ied based on experimental models such as 
Cp 2 Mo=PMes* [8a]. A gradual decrease is seen 



as more fluorines are added, except for the 3,5-di- 
fluoro derivative, which increases slightly from the 
2,6-difluoro derivative (2.191 vs. 2.188 A when 
R = 2,6-C 6 H 3 F 2 ). The Pt=P Q bond length de¬ 
creases from 2.194 to 2.186 A after completely 
replacing the phenyl hydrogens with fluorines 
(R = C 6 H 5 vs. R = C 6 F 5 ). Also note that a 
para- F substituent has little effect as can be seen 
by comparing (PEt 3 ) 2 Pt=P — Ph with 
(PH 3 ) 2 P t = P — p-C 6 H 4 F or 
(PH 3 ) 2 P t = P-2,6-C 6 H 3 F 2 with 
(PH 3 ) 2 Pt = P-2,4,6-C 6 H 2 F 3 . The shorter Pt=P 
bond of (PH 3 ) 2 Pt=P—2,6-C 6 H 3 F 2 (2.188 A) vs. 
that in (PH 3 ) 2 Pt=P—3,5-C 6 H 3 F 2 (2.191 A) can 
be interpreted as pointing to the greater impor¬ 
tance of inductive (a) over conjugative (77) effects 
in terms of influencing the Pt-phosphinidene link¬ 
age (Table III): 
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Cyanoalkyl Phosphinidenes 

An interesting series of complexes are the cyano 
derivatives listed in Table IV. The bond angles are 
not correlated with bond length. The shortest bond 
length has the smallest angle. The largest angle 
[R = C(CH 3 ) 2 CN, Pt=P—R = 111.9°] is likely 
due to the larger substituent size rather than to 
any electronic effect. All cyano derivatives have a 
shorter Pt=P bond length than that of the parent 
complex (R = H, Pt=P = 2.196 A). The tricyano 
complex [R = C(CN) 3 ] has the smallest Pt—P bond 
length (2.178 A) of all the complexes modeled, 
supporting the above statement (Table IV). 

RUTHENIUM AUKYUIDENES 

The two idealized conformations of the Ru(IV) 
alkylidenes are shown in 4. All complexes were 
optimized under C 2v symmetry. Tables V and VI 
list the substituents along with the corresponding 


calculated data: 


PX, 

PX 3 „ 

1 3 ,y z 

1 ,Y ... 

Ru = C. 

Ru=C' s f 

Y^ 1 Z 

Y^ 1 ^ 

PX 3 

PX 3 


4a 4b 


( = CH — p-C 6 H 4 C l(PCy 3 ) 2 and 
RuC1 2 ( =CHCH—CPh 2 )(PCy 3 ) 2 , both character¬ 
ized by Grubbs et al. [9f], and cationic 
[ RuCl(=CHCH=CPh 2 )(CO)(P ‘ Pr 3 ) 2 ] + , charac¬ 
terized by Esteruelas et al. [16]. All three com¬ 
plexes correspond to conformer 4b. 
RuC1 2 ( = CH — p-C 6 H 4 Cl)(PCy 3 ) 2 has 
Ru=C = 1.839(3) A, Ru—Cl = 2.401(1), 2.395(1) 
A, and Ru —P = 2.397(1), 2.435(1) A; 

RuC1 2 (= CHCH = CPh 2 )(PCy 3 ) 2 has 
Ru = C = 1.851(21) A; and 

[RuCl(= CHCH = CPh 2 )(CO)(PiPr 3 ) 2 ] + has 
R u = C = 1.8 74(3) A and 

Ru—P = 2.4586(8), 2.4307(8) A. No comparison to 
Ru—Cl is made in the latter complex since CO is 
trans to chloride. The best model for these systems 
is X = Me, Y = Cl, Z = H (Me, Cl, Hj, 4b. The 
model has Ru—C = 1.871 A, Ru—Cl = 2.488 A, 
and Ru—P = 2.481 A (Table V). The Ru-al- 
kylidene bond length is less than 2% different 
from the X-ray structures, while Ru—P and 
Ru—Cl are both within 4% of experiment, so that 
all three metal-ligand bond lengths are in excel¬ 
lent agreement with experiment. 

Effect of Phosphine Substituent (X) 

Modification 


Several five-coordinate ruthenium alkylidenes The series of phosphine substituents studied 

have been isolated and characterized. The most (X = H, F, Cl, Me) were modeled for Y = Cl. Phos- 


similar 

to the 

model 

complexes 

are RuC1 2 

phine 

substituent modifications have a 

relatively 

TABLE V 









Important bond lengths In ruthenium alkylidenes. 3 




Ru—C 

Ru—P 

Ru—Y 


Ru—C 

Ru—P 

Ru—Y 

X 

Y 

Z 

(4a*) 

(4 a) 

(4a) 

A£rhf 

(4b) 

(4b) 

(4b) 

H 

H 

H 

1.853 

2.436 

1.708 

14.3 

1.889 

2.408 

1.743 

H 

F 

H 

1.837 

2.509 

2.000 

0.2 

1.873 

2.481 

2.047 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

H 

Me 

H 

1.841 

2.456 

2.169 

15.4 

1.879 

2.431 

2.237 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

F 

Cl 

H 

1.908 

2.443 

2.418 

-1.0 

1.917 

2.421 

2.438 

Cl 

Cl 

H 

1.893 

2.502 

2.417 

0.5 

1.914 

2.478 

2.441 

Me 

Cl 

H 

1.869 

2.503 

2.450 

-4.5 

1.871 

2.481 

2.488 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

H 

Cl 

F 

1.980 

2.500 

2.454 

1.5 

1.990 

2.486 

2.460 

H 

Cl 

Cl 

1.965 

2.516 

2.447 

-0.1 

1.984 

2.478 

2.470 

H 

Cl 

Me 

2.007 

2.528 

2.475 

-3.4 

1.983 

2.474 

2.495 

H 

F 

F 

1.954 

2.513 

2.034 

0.4 

1.976 

2.496 

2.042 

Me 

H 

H 

1.847 

2.433 

1.711 

12.2 

1.873 

2.415 

1.752 


a See structures 4a and 4b for alkylidene conformations. Bond lengths are in A, and A£ RHF is the difference between 4a and 4b. A 
negative number indicates conformer 4b is lower in energy than is conformer 4a. 
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small effect on the alkylidene bond length (< 0.04 
A) (Table V). However, from a structural point of 
view, the sensitivity of the Ru-alkylidenes to 
phosphine modification is greater than seen for 
Pt-phosphinidenes. 

When replacing phosphine hydrogens in the 
parent complex (H, H, H) with methyl groups 
(Me, H, H), the change in Ru = C, Ru—P, and Ru 
—Cl bond lengths is < 0.01 A for 4a and < 0.02 
A for 4b. The bond lengths in (H,Cl,H) rt are 
Ru=C = 1.879 A, Ru—P - 2.498 A, and Ru—Cl 

o 

= 2.440 A. Upon changing X to fluorine 
(F,C1, H) a , Ru = C increases by 0.029 to 1.908 A, 
Ru—P decreases by 0.055 to Q 2.443 A, and Ru—Cl 
decreases by 0.022 to 2.418 A. The decrease in Ru 
—P bond length is consistent with the increased 
s-character on the P lone pair because of the elec¬ 
tronegative fluorines [17]. When X is changed from 
F to Cl (Cl,Cl,H) fl , Ru=C and Ru—Cl remain 
about the same (1.893 and 2.417 A, respectively). 
The Ru—P bond length changes by 0.059 to 2.502 
A. When Cl is replaced with Me (Me, Cl, H) a , 
Ru—P is nearly unchanged (2.503 A), while 
Ru=C decreases by 0.024 to 1.869 A and Ru—Cl 
increases by 0.033 to 2.450 A. The geometric 
changes when the X substituent is modified in 
conformer 4b are nearly identical to those just 
discussed for conformer 4a. 

Analysis of the atomic charge calculated by a 
Mulliken population analysis (MPA) reveals that 


(Me, H, H) (both 4a and 4b) has the most negative 
charge on ruthenium. This is in support of Grubbs' 
finding [9b] that metathesis activity is increased by 
replacing the triphenylphosphine ligands with bet¬ 
ter cr-donating alkylphosphines, if the rate-de¬ 
termining step is breaking the C=C double bond. 
Increased electron density on the metal will in¬ 
crease backbonding into 7r* orbitals of the olefin 
and should facilitate subsequent [2^ + 2J cy- 
cloaddition. Further investigation of this point will 
require a more in-depth computational analysis of 
the mechanism of olefin metathesis for the Ru-al- 
kylidene catalysts. 

In all cases for (X, Cl, H), there is a small (< 5 
kcal mol -1 ) energetic difference between conform- 
ers 4a and 4b regardless of the phosphine sub¬ 
stituent. In three cases, conformer 4b is calculated 
to be slightly more stable than is 4a. For the 
(Cl, Cl, H) complex, the situation is reversed with 
(Cl,Cl,H) fl more stable than is (Cl, Cl, H) b by 0.5 
kcal mol -1 . The experimental catalysts have chloro 
ligands [9]. Hence, the computations are consistent 
with experimental observation of two Ru-al- 
kylidene conformers, i.e., the carbene is either or¬ 
thogonal or parallel to the P—Ru—P axis depend¬ 
ing on X [9b]. This point is addressed more below. 

For a given set of ligands and substituents, 
conformer 4b has a more negative/less positive 
charge on Ru and a more positive charge on C 
than have the corresponding atoms in 4a. In other 


TABLE VI 


Important charges on ruthenium alkylidenes. 3 


X—P 
subst. 

Y—Ru 
subst. 

z—c 

subst. 

g(Ru) 

(4a*) 

<7(0 

(4a) 

g(Ru) 

(4b) 

g( C) 

(4b) 

H 

H 

H 

-0.9262 

0.0227 

-1.0547 

0.0998 

H 

F 

H 

0.1691 

0.1463 

0.0335 

0.1831 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

H 

Me 

H 

-0.9382 

0.0194 

-1.1218 

0.1711 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

F 

Cl 

H 

-0.6751 

0.2063 

-0.8217 

0.2529 

Cl 

Cl 

H 

-0.5380 

0.1928 

-06856 

0.2534 

Me 

Cl 

H 

-0.7352 

0.1395 

-0.9179 

0.1916 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

H 

Cl 

F 

-0.5300 

0.6396 

-0.4893 

0.5369 

H 

Cl 

Cl 

-0.3974 

0.0496 

-1.0020 

-0.0680 

H 

Cl 

Me 

-0.5856 

0.0126 

-0.6912 

0.0181 

H 

F 

F 

0.2791 

0.5907 

0.2951 

0.4937 

Me 

H 

H 

-1.0698 

-0.0061 

-1.2218 

0.0743 

a See structures 4a and 4b for alkylidene conformations. The atomic charges (g) are 
analysis. 

determined from a Mulliken population 
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words, the rotating about the Ru = C bond leads 
to increased polarization of the Ru = C bond. As 
the polarity of a multiple bond is often an impor¬ 
tant determinant of its reactivity, and solution- 
phase data suggest that both conformers are acces¬ 
sible for experimental systems, this is an important 
result. 

Effect of Ligand (Y) Modification 

Inspection of the data in Table V makes it evi¬ 
dent that from a geometric point of view modifica¬ 
tion of the Ru ligands has a small effect. This is 
true for both conformers 4a and 4b. For example, 
the parent complex is (H, H, H) fl , with 
Ru=C = 1.853 A, Ru—P = 2.436 A, and Ru—H 
= 1.708 A. When Ru ligands (Y) in the parent 
complex are replaced with fluorine (H, F, H) n , 
Ru=C decreases by only 0.01 to 1.84 A and 
Ru—P increases by 0.07 to 2.51 A. Replacing Y 
with Cl (H, Cl, H) fl has an opposite effect to F on 
the Ru = C bond length, i.e., it increases by 0.03 to 
1.88 A. The Ru—P bond length is about the same 
in (H, F, H) fl and (H, Cl, H) fl at 2.50 A. Replacing 
the hydride ligand with a methyl ligand does not 
result in a large change in geometry. However, as 
before, changes are more substantial than seen for 
Pt-phosphinidenes. 

Despite the structural similarity of conformers 
4a and 4b with ligand modification, there are no¬ 
ticeable energetic differences, particularly for non¬ 
halide ligands. For halide ligands, conformers 4b 
and 4a are close in energy (AE RHF < 3 kcal mol -1 . 
Table VI). As experimental systems have Y = Cl, 
theoretical results are consistent with experimental 
observation of two rotamers differing in the orien¬ 
tation of the alkylidene moiety [9b]. On the other 
hand, there is a distinct preference (> 14 kcal 
mol -1 ) for 4a when strong <x-donors (H, Me) are 
coordinated to Ru. Halides are 7r-donors while 
phosphines are 7r-acceptors so that they should 
work to stabilize and destabilize, respectively, the 
d n orbitals involved in the multiple bond to the 
alkylidene C. Thus, it is unexpected that small 
rotational barriers are seen for Y = halide. The 
difference may be due, in part, to changes in 
P—Ru—P and Y—Ru—Y angles which accom¬ 
pany rotation about the Ru=C bond, as compres¬ 
sion of these angles away from the alkylidene 
(increasing Y—r u =c and P—Ru = C angles) 
will polarize the d w orbitals toward the alkylidene 
C, making for more effective 77 -bonding. Simple 
molecular mechanics calculations also suggest that 


the conformational energy difference about the 
Ru=C bond between halide and nonhalide lig¬ 
ands is partly steric in origin. 

Although the changes in geometry of the Ru-al- 
kylidenes as a function of Y are limited, analysis of 
calculated properties indicate that interesting 
changes are taking place in the Ru = C active site. 
First, it can be seen from the atomic charge calcu¬ 
lated by an MPA that replacement of the hydride 
ligands in (H, H, H) rt with either chlorine or fluo¬ 
rine makes the charge on the Ru more positive/less 
negative. Likewise, these same modifications make 
the carbon more positively charged. As the magni¬ 
tude of the shifts in atomic charge are larger on the 
Ru than on the C, replacement of electroneutral 
with electronegative Y decreases the polarization 
of the RuC double bond. Finally, for both alkyli¬ 
dene conformers, the MPA suggests that the elec- 
trophilicity of the Ru center can be substantially 
enhanced through ligand modification. 

Effect of Alkylidene Substituent (Z) 

Modification 

The alkylidene substituents have been modified 
in the same manner as above, using Z = H, F, Cl, 
and Me, with X = H and Y = Cl. When the alkyli¬ 
dene substituent in (H, Cl, H) fl is replaced by fluo¬ 
rine, (H, Cl, F) fl/ the Ru—P and Ru—Cl bond 
lengths are almost unchanged, but Ru = C is in¬ 
creased from 1.88 to 1.98 A, a difference of 0.10 A 
(5%). When F is replaced with Cl, (H, Cl, Cl) fl , 
again, the Ru—P and Ru—Cl bond lengths are 
roughly the same. The alkylidene bond length is 

o o 

decreased to 1.97 A, a difference of 0.02 A, al¬ 
though the Ru = C bond length is still much 
longer (0.09 A) than is (H, Cl, H) fl . The longest 
alkylidene bond length was obtained with methyl 
substituents on the carbon, (H, Cl, Me) rt ; the 
Ru = C bond length is increased to 2.01 A, a 
change of 0.13 A from (H, Cl, H) fi . As before, Ru—P 
and Ru—Cl show relatively small changes with 
modification of the alkylidene substituents. The 
Ru = C bond length in (H, Cl, Me) is the only case 
where conformer 4a is longer than 4b. The longer 
bond length is due to sterics between the phos¬ 
phine ligands and methyl substituents on the 
alkylidene carbon. Conformers 4a and 4b are very 
similar in energy for the small phosphines studied 
here. The bulky phosphines needed to prevent 
oligomerization (e.g., l Pr, Cy) in an experimental 
system will lead to a preference for an alkylidene 
orientation in which the alkylidene plane is or- 
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thogonal to P—Ru—P (4b), as seen in the solid- 
state structures of Ru-alkylidenes [9b] and in the 
lower energy of (H, CLMe)^ than of (H,C1, Me) fl 
(Table V). 

The large change in Ru = C bond length is in 
contrast to what is seen in early, high-valent 
alkylidenes [13,18], particularly for second- and 
third-row TMs that have more covalent bonding 
than their first-row congeners. In those cases, it is 
generally observed that metal-carbon double 
bonds are insensitive to modification of metal lig¬ 
ands and carbon substituents. For example, the 
Zr = C bond in H 2 Zr=CH 2 is 1.99 A [18], while 
that in H 2 Zr=CH 2 is only 0.04 A (2%) longer. 
Likewise, for a third-row TM, going from a meth- 
ylidene H 3 Ta = CH 2 (Ta = C = 1.90 A) to 
H 3 Ta = Cl 2 (Ta=C = 1.94 A) also results in a 
2 % increase in the metal-ligand multiple bond 
length [18]. In such cases, substituents such as 
chlorine generally transmit their influence through 
the 7r-system [13,18]. As indicated by atomic 
charges from an MPA, fluorine yields a substan¬ 
tially more electrophilic alkylidene carbon, sug¬ 
gesting that its influence is transmitted primarily 
through cr-withdrawing rather than 77 -donating 
ability. Interestingly, chlorine substituents led to a 
larger change in the charge on ruthenium than 
carbon: q Ru = - 0.59 and q c = 0.19 (H, Cl, H) fl ; and 
*?Ru = —0.40 and q c = 0.05 (H, Cl, Cl) fl . The less 
positive charge on the alkylidine carbon in 
(H, Cl, Cl) a indicates a dominance for the chlorine 
substituent of 77 over a effects. 


Discussion 

Analyses of computational results for Pt-phos- 
phinidenes and Ru-alkylidenes point to some in¬ 
teresting differences between these late TM, multi¬ 
ply bonded complexes and their more heavily 
studied, computationally and experimentally, early 
TM analogs. 

For Pt-phosphinidenes, the steric bulk of R has 
an effect on bond angle, as expected. The sub¬ 
stituent size, however, was kept small for compu¬ 
tational feasibility, so this was not vigorously pur¬ 
sued. However, the calculation on the (H 3 P) 2 
Pt=PH with a constrained linear Pt = P—H an¬ 
gle suggests that substituents that engender large 
Pt=P—R angles should serve to strengthen the 
Pt=P bond. The bidentate phosphine dpm 
(H 2 PCH 2 CH 2 ) also led to a shortening of the 


Pt=P bond length vs. bis(monodentate-phos- 
phine) analogs. 

Halogenated substituents (both aliphatic and 
aromatic) caused small decreases in the Pt=P 
bond length. Somewhat surprisingly, the bonding 
and structure of the Pt-phosphinidene moiety was 
more sensitive to substituent changes in alkylphos- 
phinidenes than in arylphosphinidenes. Analysis 
of the molecular orbitals for (H 3 P) 2 Pt = P—CH 3 
and (H 3 P) 2 Pt = P—C 6 H 5 suggests that this may 
be due to lack of delocalization of phenyl-based 
MOs to the Pt=P "active site," while in the 
(H 3 P) 2 Pt=P—CH 3 , there is some mixing of 
C—H a orbitals with Pt=P orbitals. These 
points, taken together, suggest that the Pt=P 
bond is primarily affected through the cr frame¬ 
work, a conclusion that is consistent with a polar¬ 
ized description of the Pt-phosphinidene bond, A. 

The largest decreases in Pt=P bond length 
were caused by cyano substituents. Hence, calcula¬ 
tions suggest that bulky phosphinidene sub¬ 
stituents (coupled with bulky bidentate phosphine 
ancillary ligands), combined with cyano phos¬ 
phinidene substituents, could be a means to ob¬ 
taining a terminal platinum phosphinidene with 
sufficient stability for structural characterization. 

Strong cr-donor ligands (e.g., hydride) led to 
large rotational barriers about the Ru=C bond. 
On the other hand, halide ligands show small 
differences between Ru-alkylidene conformers 
(4a, 4b). The latter observation is consistent with 
experiments [9b] that have been interpreted as 
pointing to two conformers in solution that differ 
in the orientation of the alkylidene moiety with 
respect to the other ligands. 

The changes in molecular and electronic struc¬ 
ture of Ru-alkylidenes as a function of metal lig¬ 
and and alkylidene substituent are larger than 
observed for Pt-phosphinidenes studied in this 
work and also those observed in research on 
alkylidenes involving high-valent, early TMs [13, 
18]. These observations are important in two re¬ 
spects: First, for early TM = MG complexes, there 
is greater chemical diversity with changes in 
chemical environment when MG is a heavy ele¬ 
ment (e.g., phosphorus) [19]. Second, for late TMs, 
the situation is reversed. Based on observations to 
date, our current working hypothesis is as follows: 
For TM early — MG light complexes, the strength of 
the metal-ligand 77 bond engenders a degree of 
insensitivity to chemical modification. For 
TM early =MG heavy complexes, the situation is re¬ 
versed due primarily to weak metal-ligand 77 
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bonding. The TM late = MG light complexes must 
have weaker tt bonding than observed for early 
metal congeners, making them akin, in a concep¬ 
tual sense, to TM early = MG heavy systems with re¬ 
spect to their response to ligand and substituent 
modification. Finally, for TM late = MG heavy com¬ 
plexes, the inherently weaker Tr-bonding of the 
heavier MG elements, coupled with the low-valent 
nature of the metals, conspire to yield a bonding 
situation where 7r-bonding is virtually nonexistent 
so that any effects on the metal-ligand multiple 
bond must be transmitted through the a frame¬ 
work which generally results in small changes. 

The foregoing hypotheses suggest that 
TM=MG complexes involving late TMs, as they 
are increasingly explored through theory and ex¬ 
periment, will yield new chemistry not observed 
in the more well-studied early, TM complexes. 
How these changes can be exploited for catalytic 
or synthetic benefit will require a more in-depth 
analysis of the reactivity of these systems. Such 
studies are now under way in our lab. 
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ABSTRACT: Values of several parameters appearing in the modified Bethe-Bloch 
theory must be known if one is to calculate the stopping power for a given 
projectile-target combination within the energy interval of applicability of the theory. In 
the course of the past several years, the author has established values for kapton and 
mylar target materials through fits of experimental data with the modified Bethe-Bloch 
theory. During the past year, several sets of measurements of polystyrene stopping 
powers for light projectiles have been studied so as to extract values of Bethe-Bloch 
parameters also for this target material. Results of these studies, including uncertainties 
in the parameter values that reflect the number and accuracy of the measurements 
analyzed, are summarized in this presentation. © 1997 John Wiley & Sons, Inc. Int J Quant 
Chem 65: 997-1003, 1997 


Introduction 

T he stopping power of matter for charged par¬ 
ticles has been studied for nearly a century 
now. Not only is this topic one of theoretical im¬ 
portance in several areas of physics, but it holds 
great significance in numerous areas of experimen¬ 
tal physics, i.e., it is often necessary to know with 
considerable accuracy the energy loss of a charged 
subatomic projectile during traversal of a known 
thickness of target material. If one is to avoid 
energy-loss measurements for every projectile- 
target combination encountered, a reliable means 


of calculation must be available. In this connection, 
the modified Bethe-Bloch theory of stopping 
power has proved very helpful over a broad range 
of projectile velocities, provided that values of 
several parameters of the theory are available [1-4]. 
In the case of a target that manifests aggregation 
effects, such as alloys or compounds, other compli¬ 
cations arise. In these cases, one must invoke 
Bragg's rule of (linear) additivity of stopping ef¬ 
fects [5]. A technique for calculating energy loss in 
this sort of target has recently been explained 
[6-9], 

In the past few years, three measurements of 
the stopping powers of polystyrene for light pro¬ 
jectiles have been reported [10-12], two of them 
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[10,12] covering lower energy intervals than, and 
one of them [11] overlapping considerably, that 
explored previously by the author [13,14]. The 
current study was undertaken with the goal of 
analyzing for the first time the recent measure¬ 
ments [10-12] in terms of the modified Bethe- 
Bloch theory and of comparing extracted values of 
parameters with those established earlier through 
experiment [13,14] and by means of theoretical 
calculations. In the case of the dominant 
Bethe-Bloch parameter, the mean excitation en¬ 
ergy (I), the previously determined experimental 
value was 71.1 + 2.0 eV [14], whereas a very reli¬ 
able calculated value was 68.7 eV [15]. Nearly a 
decade ago, the author similarly analyzed several 
extant sets of stopping power measurements for 
two other plastics, mylar and kapton, both of which 
are used widely in numerous areas of physics and 
the first of which has been extensively utilized in 
the U.S. space program [6,7]. The results of these 
investigations [6,7] are compared with those of the 
current study of polystyrene. Moreover, uncertain¬ 
ties in all three sets of extracted parameter values 
were calculated in the current study for the sake of 
both clarity and comparison. 


Theory 

The modified Bethe-Bloch formalism and the 
method employed in analyzing stopping power 
measurements for the purpose of extracting values 
of the various parameters of the formalism were 
described in previous studies [1—4,6-9,13,14]. The 
stopping power, S, of an elemental target of atomic 
weight A and atomic number Z for a projectile of 
atomic number z and velocity v — pc is given by 

0.30708 Z 

S.—S-L, (1) 


in units of keV cm 2 /mg. Here, L, which represents 
the (dimensionless) stopping number per target 
electron, contains three terms: 

L = Lq + £zL x + L 2 , (2) 

where the basic stopping number, L 0 , is given by 


L 


o 



2 mc 2 p 2 ^ 

1- P 2 i 


P 2 - In / - C/Z - 8/2 . (3) 


The symbol I in this expression signifies the target 
mean excitation energy, me 2 represents the rest 
mass energy of the electron, C is the sum of target 
shell corrections, and 8 stands for the density 
effect correction needed for highly relativistic pro¬ 
jectiles [16]. The mean excitation energy can be 
determined from a fit to accurate stopping power 
measurements. Shell corrections can be obtained 
with the method of Bichsel [17,18], where Walske 
K- and L-shell corrections [19,20] are utilized with 
appropriate scaling factors employed in the L-shell 
correction so as to obtain the M- and N-shell 
corrections: 

c = C k ( ft 2 ) + V L C L (H L fi 2 ) + V M C L (H M p 2 ) 

+ V N C L (H N f3 2 ). (4) 

In this expression, C K and C L represent the X- and 
L-shell corrections, respectively, and the V { and H i 
(with i — L, M, N) denote the scaling factors. 

The L x and L 2 terms in Eq. (2) represent two 
terms of higher order in the projectile atomic num¬ 
ber (z). The ( zL x ) term is known as the Barkas- 
effect correction term, where L x can be calculated 
from one of at least three existing formalisms 
[21-25]. A comparison of the methods [2] has 
shown that the first [21-23] clearly provides a 
superior fit to measurements over the broadest 
projectile energy interval. Thus, this formalism has 
been used in the stopping number calculated for 
analyses in the current study. In this case, 

F(b/x l/2 ) 

1 — --- (r) 

1 Z 1/2 x 3/2 ' v ' 

where F represents a function graphed in [21], 
x = (18 7&7)p 2 /Z, and b denotes the sole free 
parameter of the formalism, f appears in Eq. (2) as 
an adjustable amplitude of the Barkas-effect term 
in order to reflect a previous controversy about 
inclusion of close-collision contributions to the term 
[1,9]. The L 2 term, called the Bloch term [26], is 
given by 


My) = ^(i)-Re[iMi + n/)L (6) 

where ijj represents the digamma function [27] 
and y = za/p, with a signifying the fine struc¬ 
ture constant. 

The several parameters of the three stopping 
number terms in modified Bethe-Bloch theory, 
consisting of the mean excitation energy I, the 
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Barkas-effect parameters b and £, and the scaling 
parameters for the shell corrections V { and H if are 
all independent of projectile velocity. Whereas the 
Bloch term clearly depends on the projectile en¬ 
ergy and charge, there are no free parameters. 
Thus, a fit to stopping power measurements to 
establish values of some subset of the above 
parameters should yield parameter values that are 
valid over the projectile energy interval in which 
the modified Bethe-Bloch formula is valid. 

The original Bohr and Bethe theories applied 
strictly to pure monatomic targets in the gaseous 
state [28]. A target as simple as an element in the 
condensed state, or as a homonuclear diatomic 
molecule in the gaseous state, manifests bonding 
effects. Extension of Bethe theory to molecules and 
condensed matter [28] was outlined in a recent 
report [9]. Both physical-state effects and 
chemical-bonding effects are often grouped as ag¬ 
gregation effects. A target subject to aggregation 
effects will henceforth be referred to as a compos¬ 
ite target, obvious examples being compounds, 
alloys, and mixtures. A method of managing ag¬ 
gregation effects, initially advanced by Bragg [5], 
is to assume linear additivity of stopping effects of 
constituents in the composite target. This assump¬ 
tion, called "Bragg's rule" or the "additivity rule," 
is used as a first approximation in the investiga¬ 
tion of aggregation effects. In this vein, stopping 
cross sections, stopping powers, and various pa¬ 
rameters in the Bethe-Bloch formula are character¬ 
ized on the basis of departures from the additivity 
rule. The current study is based in part on continu¬ 
ing interest in the additivity problem. Calculation 
of average parameter values has been described 
previously [22,29,30]. The Bloch term [26] con¬ 
tains no target-dependent parameters, but the 
Barkas-effect correction term [21-23] requires av¬ 
eraging. Moreover, L 0 contains the mean excita¬ 
tion energy, I, and the shell corrections, C—both 
of which are target-dependent—that must be as¬ 
signed average values [29,30]. The general dearth 
of knowledge of the correct parameter values for 
target constituents poses an obstacle to implemen¬ 
tation of averaging procedures [9]. Consequently, 
the parameter selected for testing the additivity 
rule is the mean excitation energy, for which 
the calculated average (Bragg) value, 1 B , can be 
obtained [28,29] as 


In I B 


E j tijZj In I ; 
E; tijZj 


(7) 


where n ]t Z y , and l ]f respectively, represent the 
atomic concentration, atomic number, and mean 
excitation energy of the j th component of the com¬ 
posite material. I values extracted from measure¬ 
ments are expected to exceed the respective Bragg 
values in general, with the largest deviations oc¬ 
curring for strong chemical bonds in low-Z com¬ 
pounds, since in such cases, a large fraction of the 
total number of electrons participate in the bond¬ 
ing. For a given compound, the chemical binding 
effect should be greatest at low projectile veloci¬ 
ties, since the tightly bound inner-shell electrons 
contribute less to the stopping than do the outer, 
valence electrons [31]. 


Method of Analysis 

The basic procedure that is utilized in assigning 
values to some parameters and searching for val¬ 
ues of others through fits to experimental data 
[1-4,6-9,13,14,30] is based on the premise that 
the assigned values, which have been established 
through prior analyses, are essentially correct. 
Then, the number and accuracy of the remaining 
parameter values that can be ascertained are 
determined by the number and accuracy of the 
measurements. 

In the case of polystyrene, the average atomic 
number (Z) is 3.500, the average atomic weight 
(A) is 6.509, and the Bragg value [5] of mean 
excitation energy (I B ) based on current values of 
constituent mean excitation energies [18] and cal¬ 
culated in the usual manner [1,6-9] is 69.5 eV. The 
shell correction scaling parameters for the L shell, 
calculated according to the Bichsel rubric [17,18], 
are V L = 0.188 and H L = 1.00. (The parameter val¬ 
ues utilized for polystyrene, mylar, and kapton are 
displayed in Table I. Constituent mean excitation 
energies of 19.2 eV for hydrogen, 81 eV for carbon, 

TABLE I___ 

Average atomic number (Z), average atomic 
weight (A), logarithmic average mean excitation 
energy (/ B ), and shell correction scaling 
parameters assigned to polystyrene, mylar, 
and kapton. 


Target 

Z 

A 

l B (eV) 

7, 

H l 

Polystyrene 

3.50 

6.51 

69.5 

0.188 

1.00 

Mylar 

4.54 

8.74 

78.7 

0.318 

1.00 

Kapton 

5.03 

9.80 

79.6 

0.379 

1.00 


INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 


999 




PORTER 


82 eV for nitrogen, and 106 eV for oxygen were 
taken from [18].) A typical method of analysis is to 
fix the values of shell correction scaling parame¬ 
ters and to search for best-fit values of the mean 
excitation energy (/), the Barkas-effect parameter 
(b), and the amplitude of the Barkas-effect term 
(£) in a three-parameter fit or to set the value of £ 
at unity and proceed to conduct a two-parameter 
fit. The best fit is established by minimizing the 
root-mean-square relative deviation of calculated 
from measured stopping powers, cr. This quantity 
is defined as 


a = 


l N 

- y 

T 


(S m -S t 


N : 


i -1 \ 


AS 


m I { 


1/2 


( 8 ) 


for measurements at N energies, where S m repre¬ 
sents the measured stopping power; S c the calcu¬ 
lated stopping power; and A S m , the statistical 
uncertainty in the measured value. Thus, a value 
of cr near unity indicates an acceptable level of 
agreement between theory and experiment. 


Analyses and Results (Protons) 

The point of departure in the present study was 
a reappraisal of the results of a previous analysis 
[14] by the author of measurements reported by 
the author and his collaborators [13]. The earlier 
analysis [14] was conducted prior to introducing 
an automated fitting routine into the computer 
code utilized [32,33], i.e., manual searches were 
carried out to obtain the values (I = 71.1 ± 1.8 eV 
and b = 1.90 ± 0.05 with a = 0.70) in the previous 
study [14]. The central values were checked with 
the automated search routine, with the resulting 
new values, I = 71.0 eV, b = 2.14, and cr = 0.69. 
Thus, it would appear that the lower b value first 
obtained was a consequence of finding a local 
minimum close to that located in the automated 
search. (The above fits were made with V L at a 
higher value [0.625] that had been selected for the 
first analysis [14].) When V L was reset at 0.188 for 
the current series of analyses, the results were that 
I = 71.0 eV, b = 2.26, and a = 0.69. (A three- 
parameter search for /, b, and £ proved fruitless, 
since the value of b became negative [ -0.312] and 
that of £ very small [0.09], with cr — 0.69; remain¬ 
ing searches were made in the two-parameter 
mode.) Clearly, the effect of selecting a smaller 
value of V L in the present study had little effect on 


the derived values of I and b: I remained the 
same, whereas b rose by 0.12 to 2.26 and cr re¬ 
mained constant. 

Analyses of more recent measurements [10,12] 
followed in an attempt to compare extracted 
parameter values with those obtained from the 
measurements by the author and collaborators with 
2.2-5.9 MeV protons [13]. The first set of new 
measurements analyzed was one with 0.02-0.4 
MeV protons [10]. A vital question to be answered 
at the outset was that of the lowest-energy data 
point that could be included without utilizing a 
charge-state correction. In other recent studies by 
the author, it was learned that proton energies 
extending down to 300 keV could be included 
without appreciable distortion of extracted param¬ 
eter values [4,34,35] and that a corresponding 
lower bound for alpha particles was 1.0 MeV 
[36,37]. In the present study, proton data points 
below 300 keV were initially excluded, then only 
those below 250 keV, and finally only those below 
200 keV, in an attempt to stretch the energy inter¬ 
val downward. The test was successful. Thus, the 
data for 0.2-0.4 MeV were included in the analy¬ 
sis, with resulting values of I = 71.9 eV, b = 1.20, 
and cr — 0.04. Another set of similar measure¬ 
ments [12], from which proton data points from 
0.2-0.4 MeV were selected, yielded I = 69.8 eV, 
b = 1.15, and a = 0.28. The data from [12] are 

i 

truly composite in nature. Measurements with H 
and 2 H projectiles at 40-3000 keV/u were re¬ 
ported. These data were combined with previously 
reported measurements [10,38] and supplemented 
by means of analyses with a fitting function [39] in 
order to extend the covered energy interval up¬ 
ward. The fitting function proposed earlier [39] is 
an expansion in energy per mass unit for the 
(reciprocal) stopping power: 

3 

s' 1 = £ (9) 

i= 0 

where units of keV/u for E will yield units of 
10“ 15 eV cm 2 /atom for s. The procedure used was 
first to establish y 1 values by obtaining the best 
overall fit to proton stopping powers from 10 to 
1000 keV in the Janni tables [40] and then to 
determine values of the through fits of the 
experimental data from [10,12,39], The resulting 
values of coefficients are purportedly valid for 
hydrogen projectile energies of 40-400 keV/u [12]. 
Actual measured stopping powers were obtained 
from one of the authors [41] in a third set of recent 
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TABLE II_ 

Extracted values of mean excitation energy (/) and 
Barkas-effect parameter (b) and associated error 
function (a) for each of six measurements of 


polystyrene stopping power for protons, including 
the projectile energy interval explored in each case. 


Ref. 

A E p (MeV) 

/ (eV) 

b 

cr 

[13] 

2.2 -5.9 

71.0 

2.26 

0.69 

[10] 

O 

1 

CVJ 

o 

71.9 

1.20 

0.04 

[12] 

0.2-0.4 

69.8 

1.15 

0.28 

[11] 

0.9-3.0 

62.2 

0.84 

0.43 

[42] 

0.20-0.34 

68.3 

1.12 

0.24 

[38] 

0.20-0.35 

64.8 

1.25 

0.49 


Measurements from [12] with and 2 H from 40 to 300 
keV/u were supplemented by the authors with results of 
other measurements [10,38] and of a fitting function [39] to 
characterize the data of an extensive compilation [40] so as 
to extend the energy interval upward. 


measurements [11], utilizing 0.9-3.0 MeV protons; 
analysis yielded I = 62.2 eV, b = 0.84, and a = 
0.43. The results of these fits of proton measure¬ 
ments are displayed in Table II, along with results 
from current fits of two older data sets [38,42]. 
These results will be discussed extensively below. 


Analyses and Results 
(Alpha Particles) 

Two of the references contained also stopping 
power measurements for alpha particles [11,12]. 
The data for 0.8-2.9 MeV projectiles [11] yielded 
I = 60.7 eV, b = 1.66, and cr = 1.77. (Only one of 
the nine data points featured a projectile energy 
below 1.0 MeV.) The measurements for 1.0-3.0 
MeV projectiles [12] yielded I = 62.5 eV, b = 1.48, 
and a = 0.75. Once again, it must be noted that 
the data from [12] are composite in nature. 
Measurements with 3 He and 4 He projectiles at 
25-90 keV/u were reported. These data were sup¬ 
plemented by previously reported measurements 
[11] in order to extend the covered energy interval 
upward. These data were fit with the function 
given by Eq. (9) to obtain new (He projectile) 
values of the coefficients, a if purportedly valid for 
the energy interval from 20 to 750 keV/u [12]. All 
these results, summarized in Table III, will be 
discussed further below. 


TABLE III_ 

Extracted values of mean excitation energy (/) and 
Barkas-effect parameter (b) and associated error 
function (cr) for each of two measurements of 
polystyrene stopping power for alpha particles, 
including the projectile energy interval explored 
in each case. 


Ref. 

AE a (MeV) 

/(eV) 

b 

a 

hi] 

0.8-2.9 

60.7 

1.66 

1.77 

[12] 

1.0-3.0 

62.5 

1.48 

0.75 


Measurements from [12] with 3 He and 4 He from 25 to 90 
keV/u were supplemented by results of other measure¬ 
ments [11] to extend the energy interval upward. 


Discussion of Results 

In the course of developing the Barkas-effect 
formalism utilized herein [21-23,43], it was rec¬ 
ommended by two of the authors [43] that, in 
general, b = 1.4 ± 0.1. Moreover, according to the 
"13% rule" [18], one might expect that the mean 
excitation energy for a solid compound of low 
atomic number will lie some 13% higher than the 
Bragg value; the Bragg value of I for polystyrene 
is I B = 69.5 eV, and 1.13I S = 78.5 eV. (Since con¬ 
stituent mean excitation energies have evolved 
over time, the I B values given herein may differ 
from those provided in cited earlier references.) 
All values of I extracted from experimental data 
lie well below the "13% rule" prediction, and the 
proton-based values, instead, provide 68.0 eV as a 
simple arithmetic average. A reliable calculated 
value of I is 68.7 eV [15]. Two values of I ex¬ 
tracted from stopping power measurements with 
70 MeV protons and two different samples of 
polystyrene, 67.55 and 69.0 eV, were reported re¬ 
cently [44]. Nearly all extracted values of the 
Barkas-effect parameter (b) are eminently reason¬ 
able, the exceptions being those of proton data 
from [11,13]. I values obtained from the proton 
measurements of [11,13] appear to be somewhat 
low and high, respectively. Only the alpha particle 
data of [11] provided a fit characterized by a 
(j-value greater than unity. 

The experiments were conducted with thin-film 
transmission techniques in [10,13,38,42] and with 
the Rutherford back-scattering method in [11, 12]. 
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TABLE IV_ 

Recommended values of mean excitation energy 
(/) and Barkas-effect parameter ( b ) for 
polystyrene, mylar, and kapton, with 
calculated uncertainties (A/ and A b). 


Target 

/ (eV) 

A/ (eV) 

b 

A b 

Polystyrene 

71.9 

6.5 

1.20 

0.36 

Mylar 

80.5 

7.0 

1.33 

0.60 

Kapton 

78.2 

8.1 

1.08 

0.87 


However, the complete data from [12] that were 
analyzed herein were composite in nature, as noted 
above. Hence, no systematic trends in extracted 
parameter values are discernible solely on the 
basis of the experimental method employed. 

Results of searches for parameter values for 
kapton and mylar were reported previously [6,7]. 
However, no uncertainties in the values were pro¬ 
vided in those reports. An important feature of the 
current study is the calculation of uncertainties in 
recommended values of I and b for polystyrene, 
mylar, and kapton. These uncertainties, calculated 
with the method described in detail previously 
[1,45], reflect the number and accuracy of the 
measurements of the stopping power for a 
given projectile-target combination. Part of the 
procedure for establishing uncertainties in param¬ 
eters I and b for mylar and kapton was to 
calculate best-fit central values with the current 
version of the fitting program. Hence, the recom¬ 
mended values of these parameters differ slightly 
from those previously published [6,7]. The partic¬ 
ular measurements chosen for analysis in this por¬ 
tion of the study, were, for each target material, 
those which reflected the greatest consistency with 
expectations based on modified Bethe-Bloch 
theory. All three sets of measurements were 
made with protons traversing thin-film targets, 
those utilized being from [10] for polystyrene 
and from [46] for mylar and kapton. Results of this 
portion of the analyses are displayed in Table IV. 
Each uncertainty shown was derived solely from 
the particular set of measurements analyzed, thus 
reflecting the number and accuracy of only those 
measurements. 


Summary 

Eight experimental studies of the stopping 
power of polystyrene, six for protons and two for 


alpha particles, have been reviewed in order to 
ascertain modified Bethe-Bloch stopping power 
parameters by consistent data analysis in each 
case. Four of the six proton studies were con¬ 
ducted with the thin-film transmission method 
[10,13,38,42], whereas the remaining four meas¬ 
urements were conducted with the Rutherford 
back-scattering (RBS) technique [11,12]. The ma¬ 
jority of results for mean excitation energy (I) 
were reasonably consistent with the Bragg value of 
69.5 eV or a reliable theoretical value of 68.7 eV 
[15], although considerable dispersion among de¬ 
rived values was found. All but two extracted 
values of the Barkas-effect parameter (b) were quite 
consistent with the expected range of values [43] 
of 1.4 ± 0.1, one exception occurring in the case of 
a proton transmission experiment [13] and the 
other occurring in the case of a proton RBS experi¬ 
ment [11]. Dispersion in extracted parameter val¬ 
ues has been encountered in previous reviews of 
stopping power data [1,8,36,47], but no convinc¬ 
ing explanation therefor has yet been advanced. 
Generally, this dispersion merely reflects obvious 
dispersion among sets of stopping power measure¬ 
ments—a concern within the stopping power com¬ 
munity for several decades. An additional factor 
contributing to dispersion among reported sets of 
stopping power measurements is undoubtedly the 
use of various smoothing functions before listing 
the actual measurements—a practice openly de¬ 
plored in a previous study [47]. (An excellent ex¬ 
ample of this smoothing procedure is the afore¬ 
mentioned report of measurements in [12].) The 
practice can distort the values of parameters ex¬ 
tracted from the data in an analysis based on firm 
theory. 

The situation for kapton targets also required 
careful assessment [6,7]. In one study [7], a set of 
proton measurements was rejected on the basis of 
quite implausible values of parameters extracted 
therefrom. Similarly, in the case of mylar targets, 
another study [7] resulted in doubt being cast on 
the validity of a set of alpha particle measure¬ 
ments. These selection processes will hopefully be 
justified by future sets of definitive measurements. 
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